A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to euang in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
euang (0) - 1 freq
huang (1) - 2 freq
euan (1) - 3 freq
geung (2) - 5 freq
whang (2) - 6 freq
guan (2) - 7 freq
amang (2) - 690 freq
luatg (2) - 1 freq
guano (2) - 1 freq
eland (2) - 1 freq
yung (2) - 72 freq
bang (2) - 97 freq
shang (2) - 1 freq
evan (2) - 12 freq
evans (2) - 27 freq
fang (2) - 12 freq
mang (2) - 17 freq
belang (2) - 85 freq
twang (2) - 13 freq
xiang (2) - 4 freq
iang (2) - 1 freq
rung (2) - 25 freq
clang (2) - 12 freq
guans (2) - 1 freq
spang (2) - 18 freq
euang (0) - 1 freq
yang (2) - 3 freq
iang (2) - 1 freq
eng (2) - 10 freq
ang (2) - 8 freq
ung (2) - 1 freq
huang (2) - 2 freq
euan (2) - 3 freq
ouyang (2) - 1 freq
yung (2) - 72 freq
eyan (3) - 1 freq
geang (3) - 3 freq
loang (3) - 3 freq
young (3) - 1087 freq
deuing (3) - 1 freq
gang (3) - 1098 freq
ean (3) - 46 freq
jang (3) - 3 freq
kang (3) - 1 freq
ewing (3) - 6 freq
soang (3) - 1 freq
kiang (3) - 1 freq
dung (3) - 26 freq
jung (3) - 3 freq
eurig (3) - 1 freq
SoundEx code - E520
enough - 883 freq
ens - 16 freq
enns - 11 freq
eneuch - 748 freq
eence - 316 freq
eens - 131 freq
enjoay - 1 freq
enjoy - 331 freq
eneugh - 49 freq
ense - 15 freq
eemage - 18 freq
emmma's - 1 freq
enuch - 89 freq
eemis - 10 freq
een's - 13 freq
eyn's - 1 freq
eyns - 7 freq
enjey - 11 freq
eense - 16 freq
eneaise - 1 freq
enn's - 1 freq
enough-he - 1 freq
enouch - 4 freq
enugh - 2 freq
enc - 1 freq
enic's - 1 freq
'enjoy - 2 freq
eunice - 1 freq
enoch - 17 freq
enosh - 3 freq
emmaus - 6 freq
eemock - 8 freq
einas - 1 freq
eins - 2 freq
enyoch - 36 freq
eens-shö - 1 freq
eans - 10 freq
emus - 1 freq
eimage - 17 freq
enogh - 20 freq
enjye - 1 freq
enough-a - 1 freq
'enough - 1 freq
eng - 10 freq
eneuch- - 1 freq
eyeing - 1 freq
eines - 3 freq
enack - 1 freq
eince - 1 freq
eneoch - 2 freq
€˜eence - 1 freq
eenies - 15 freq
€“eneuch - 1 freq
ems - 4 freq
enschew - 1 freq
enjy - 2 freq
emma's - 1 freq
ewing - 6 freq
enes - 2 freq
€œenoch - 1 freq
enyoch' - 1 freq
eneÂ’s - 1 freq
emms - 1 freq
emosh - 1 freq
engy - 2 freq
emaaq - 1 freq
emz - 4 freq
emoji - 3 freq
euang - 1 freq
e'en's - 1 freq
eyemask - 1 freq
eimsj - 1 freq
enoug - 1 freq
enj - 1 freq
euankay - 1 freq
MetaPhone code - ENK
enc - 1 freq
eng - 10 freq
enack - 1 freq
euang - 1 freq
enoug - 1 freq
euankay - 1 freq
EUANG
Time to execute Levenshtein function - 0.223361 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380214 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031305 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044462 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000809 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.