A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eemis in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eemis (0) - 10 freq
semis (1) - 4 freq
eesin (2) - 1 freq
eamin (2) - 1 freq
exis (2) - 1 freq
seems (2) - 517 freq
emit (2) - 4 freq
deemies (2) - 1 freq
permis (2) - 1 freq
eeses (2) - 1 freq
denis (2) - 11 freq
leis (2) - 1 freq
feemin (2) - 1 freq
feemit (2) - 1 freq
nevis (2) - 4 freq
jeesis (2) - 2 freq
semi (2) - 20 freq
eerie (2) - 34 freq
penis (2) - 1 freq
eekin (2) - 1 freq
ermie (2) - 1 freq
eerins (2) - 6 freq
deemit (2) - 2 freq
demit (2) - 2 freq
eet's (2) - 58 freq
eemis (0) - 10 freq
ems (2) - 4 freq
semis (2) - 4 freq
emus (2) - 1 freq
mis (2) - 3 freq
nemos (3) - 1 freq
eemur (3) - 1 freq
eens (3) - 131 freq
eenies (3) - 15 freq
teems (3) - 1 freq
dems (3) - 6 freq
nems (3) - 19 freq
memes (3) - 4 freq
exems (3) - 1 freq
enemies (3) - 36 freq
mos (3) - 9 freq
feeis (3) - 1 freq
erms (3) - 42 freq
eomin (3) - 1 freq
geis (3) - 8 freq
deems (3) - 6 freq
hems (3) - 8 freq
jeems (3) - 91 freq
eyewis (3) - 5 freq
weems (3) - 1 freq
SoundEx code - E520
enough - 883 freq
ens - 16 freq
enns - 11 freq
eneuch - 748 freq
eence - 316 freq
eens - 131 freq
enjoay - 1 freq
enjoy - 331 freq
eneugh - 49 freq
ense - 15 freq
eemage - 18 freq
emmma's - 1 freq
enuch - 89 freq
eemis - 10 freq
een's - 13 freq
eyn's - 1 freq
eyns - 7 freq
enjey - 11 freq
eense - 16 freq
eneaise - 1 freq
enn's - 1 freq
enough-he - 1 freq
enouch - 4 freq
enugh - 2 freq
enc - 1 freq
enic's - 1 freq
'enjoy - 2 freq
eunice - 1 freq
enoch - 17 freq
enosh - 3 freq
emmaus - 6 freq
eemock - 8 freq
einas - 1 freq
eins - 2 freq
enyoch - 36 freq
eens-shö - 1 freq
eans - 10 freq
emus - 1 freq
eimage - 17 freq
enogh - 20 freq
enjye - 1 freq
enough-a - 1 freq
'enough - 1 freq
eng - 10 freq
eneuch- - 1 freq
eyeing - 1 freq
eines - 3 freq
enack - 1 freq
eince - 1 freq
eneoch - 2 freq
€˜eence - 1 freq
eenies - 15 freq
€“eneuch - 1 freq
ems - 4 freq
enschew - 1 freq
enjy - 2 freq
emma's - 1 freq
ewing - 6 freq
enes - 2 freq
€œenoch - 1 freq
enyoch' - 1 freq
eneÂ’s - 1 freq
emms - 1 freq
emosh - 1 freq
engy - 2 freq
emaaq - 1 freq
emz - 4 freq
emoji - 3 freq
euang - 1 freq
e'en's - 1 freq
eyemask - 1 freq
eimsj - 1 freq
enoug - 1 freq
enj - 1 freq
euankay - 1 freq
MetaPhone code - EMS
emmma's - 1 freq
eemis - 10 freq
emmaus - 6 freq
emus - 1 freq
embassie - 1 freq
embassy - 1 freq
ems - 4 freq
emma's - 1 freq
emms - 1 freq
emz - 4 freq
EEMIS
Time to execute Levenshtein function - 0.185257 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363952 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028366 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038864 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.