A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to emus in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
emus (0) - 1 freq
mus (1) - 1 freq
ecus (1) - 1 freq
ems (1) - 4 freq
emms (1) - 1 freq
semis (2) - 4 freq
mugs (2) - 17 freq
mis (2) - 3 freq
ess (2) - 258 freq
eess (2) - 1 freq
erms (2) - 42 freq
e's (2) - 11 freq
rvus (2) - 1 freq
mug (2) - 45 freq
opus (2) - 2 freq
eers (2) - 9 freq
elks (2) - 2 freq
nemos (2) - 1 freq
nus (2) - 5 freq
ehrs (2) - 2 freq
semes (2) - 2 freq
menus (2) - 3 freq
nms (2) - 2 freq
ers (2) - 25 freq
lemes (2) - 2 freq
emus (0) - 1 freq
ems (1) - 4 freq
mus (1) - 1 freq
mes (2) - 3 freq
umes (2) - 1 freq
amys (2) - 1 freq
ms (2) - 29 freq
muse (2) - 35 freq
maus (2) - 1 freq
amuse (2) - 8 freq
mos (2) - 9 freq
eemis (2) - 10 freq
emms (2) - 1 freq
ecus (2) - 1 freq
mis (2) - 3 freq
mous (2) - 49 freq
mas (2) - 7 freq
amos (2) - 3 freq
eyes (3) - 612 freq
mss (3) - 2 freq
eras (3) - 2 freq
peus (3) - 1 freq
pus (3) - 14 freq
emosh (3) - 1 freq
m's (3) - 2 freq
SoundEx code - E520
enough - 883 freq
ens - 16 freq
enns - 11 freq
eneuch - 748 freq
eence - 316 freq
eens - 131 freq
enjoay - 1 freq
enjoy - 331 freq
eneugh - 49 freq
ense - 15 freq
eemage - 18 freq
emmma's - 1 freq
enuch - 89 freq
eemis - 10 freq
een's - 13 freq
eyn's - 1 freq
eyns - 7 freq
enjey - 11 freq
eense - 16 freq
eneaise - 1 freq
enn's - 1 freq
enough-he - 1 freq
enouch - 4 freq
enugh - 2 freq
enc - 1 freq
enic's - 1 freq
'enjoy - 2 freq
eunice - 1 freq
enoch - 17 freq
enosh - 3 freq
emmaus - 6 freq
eemock - 8 freq
einas - 1 freq
eins - 2 freq
enyoch - 36 freq
eens-shö - 1 freq
eans - 10 freq
emus - 1 freq
eimage - 17 freq
enogh - 20 freq
enjye - 1 freq
enough-a - 1 freq
'enough - 1 freq
eng - 10 freq
eneuch- - 1 freq
eyeing - 1 freq
eines - 3 freq
enack - 1 freq
eince - 1 freq
eneoch - 2 freq
€˜eence - 1 freq
eenies - 15 freq
€“eneuch - 1 freq
ems - 4 freq
enschew - 1 freq
enjy - 2 freq
emma's - 1 freq
ewing - 6 freq
enes - 2 freq
€œenoch - 1 freq
enyoch' - 1 freq
eneÂ’s - 1 freq
emms - 1 freq
emosh - 1 freq
engy - 2 freq
emaaq - 1 freq
emz - 4 freq
emoji - 3 freq
euang - 1 freq
e'en's - 1 freq
eyemask - 1 freq
eimsj - 1 freq
enoug - 1 freq
enj - 1 freq
euankay - 1 freq
MetaPhone code - EMS
emmma's - 1 freq
eemis - 10 freq
emmaus - 6 freq
emus - 1 freq
embassie - 1 freq
embassy - 1 freq
ems - 4 freq
emma's - 1 freq
emms - 1 freq
emz - 4 freq
EMUS
Time to execute Levenshtein function - 0.336171 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.517660 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027544 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070058 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000924 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.