A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to engines in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
engines (0) - 13 freq
engine (1) - 50 freq
ingines (1) - 4 freq
engine's (1) - 2 freq
engineers (2) - 7 freq
ingine (2) - 33 freq
injines (2) - 2 freq
endins (2) - 13 freq
engineer (2) - 33 freq
eggies (2) - 4 freq
ingins (2) - 15 freq
erlines (2) - 2 freq
ingines- (2) - 1 freq
nines (2) - 12 freq
eines (2) - 3 freq
engages (2) - 3 freq
agnes (3) - 29 freq
ginos (3) - 1 freq
gings (3) - 75 freq
shines (3) - 37 freq
ingyres (3) - 1 freq
entire (3) - 32 freq
anjings (3) - 1 freq
leggins (3) - 3 freq
jeggins (3) - 1 freq
engines (0) - 13 freq
ingines (1) - 4 freq
ingins (2) - 15 freq
engine's (2) - 2 freq
engine (2) - 50 freq
ingens (3) - 1 freq
ingines- (3) - 1 freq
engages (3) - 3 freq
nines (3) - 12 freq
engineer (3) - 33 freq
engineers (3) - 7 freq
ingine (3) - 33 freq
injines (3) - 2 freq
ingans (3) - 9 freq
endins (3) - 13 freq
ongaens (4) - 2 freq
ongaeins (4) - 2 freq
ingin (4) - 18 freq
injins (4) - 1 freq
ongyans (4) - 4 freq
youngins (4) - 1 freq
ingles (4) - 1 freq
gunes (4) - 2 freq
penguins (4) - 7 freq
angina (4) - 3 freq
SoundEx code - E525
ensnared - 2 freq
engine - 50 freq
ensemble - 4 freq
ensaumpil - 11 freq
encoonters - 9 freq
encoonter - 10 freq
enjoyin - 70 freq
ensaumple - 9 freq
ensaumples - 1 freq
engines - 13 freq
encoontered - 2 freq
engine's - 2 freq
encounters - 3 freq
ensconced - 1 freq
encountenn - 1 freq
enigmatic - 7 freq
ensamples - 2 freq
enjoyen - 1 freq
engineer - 33 freq
enjoying - 24 freq
enchantit - 2 freq
enjoyin' - 2 freq
ensample - 6 freq
encoonther - 1 freq
enchant - 2 freq
enchantment - 2 freq
enagement - 1 freq
encounter - 7 freq
engineering - 4 freq
enchanted - 1 freq
enchantin - 1 freq
encompasst - 1 freq
enginivver - 1 freq
enginivverin - 1 freq
eenocent - 2 freq
emissions - 4 freq
engineers - 7 freq
enjambment - 1 freq
engendered - 2 freq
enchanter - 2 freq
engineerin - 8 freq
enjoyment - 7 freq
enchantan - 1 freq
engineerin' - 1 freq
ensuin - 4 freq
enigmatical - 1 freq
ensconsed - 1 freq
ensample-or - 1 freq
ensenyie - 1 freq
ensaumpils - 5 freq
engineered - 1 freq
eimaginative - 2 freq
enackin - 2 freq
enackment - 1 freq
eimage-makin - 1 freq
enjyin - 2 freq
enjoayin - 1 freq
encompass - 2 freq
'engineer' - 1 freq
engineer's - 1 freq
engenderit - 1 freq
ensampul - 8 freq
ensampuls - 1 freq
eimagine - 2 freq
encumbered - 1 freq
enigma - 1 freq
enjyement - 1 freq
enchauntit - 3 freq
encampment - 1 freq
€˜enchauntit - 1 freq
engenderin - 2 freq
encompassin - 3 freq
engenders - 1 freq
encoontert - 1 freq
enjoyan - 1 freq
encompasses - 1 freq
encooonter - 1 freq
enkoontirs - 1 freq
encountered - 2 freq
ewmqn - 1 freq
eoinsweeney - 1 freq
enjeyin - 1 freq
emmasnpharper - 1 freq
emmajanestanle - 150 freq
eiumqn - 1 freq
emmaissandy - 1 freq
emcnzeommb - 1 freq
MetaPhone code - ENJNS
engines - 13 freq
engine's - 2 freq
ENGINES
Time to execute Levenshtein function - 0.214958 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337894 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028309 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037272 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000916 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.