A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to est in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
est (0) - 22 freq
ese (1) - 2 freq
es (1) - 798 freq
esd (1) - 1 freq
ept (1) - 1 freq
lst (1) - 1 freq
eet (1) - 581 freq
yest (1) - 1 freq
rst (1) - 1 freq
best (1) - 1574 freq
ast (1) - 8 freq
egt (1) - 1 freq
ect (1) - 25 freq
test (1) - 145 freq
xst (1) - 1 freq
ett (1) - 97 freq
vst (1) - 1 freq
eat (1) - 460 freq
aest (1) - 21 freq
esh (1) - 6 freq
eso (1) - 1 freq
gest (1) - 1 freq
esk (1) - 22 freq
pest (1) - 25 freq
esa (1) - 2 freq
est (0) - 22 freq
aest (1) - 21 freq
eest (1) - 24 freq
ast (1) - 8 freq
yest (1) - 1 freq
ist (1) - 11 freq
esto (1) - 2 freq
east (1) - 304 freq
st (1) - 378 freq
seyt (2) - 2 freq
sti (2) - 1 freq
seet (2) - 5 freq
sot (2) - 6 freq
isit (2) - 1 freq
jest (2) - 80 freq
eit (2) - 644 freq
lest (2) - 215 freq
zest (2) - 2 freq
hst (2) - 1 freq
'st (2) - 1 freq
usit (2) - 4 freq
uist (2) - 9 freq
ste (2) - 41 freq
usty (2) - 1 freq
syt (2) - 1 freq
SoundEx code - E230
eichty - 7 freq
eicht - 61 freq
eesed - 65 freq
eest - 24 freq
eight - 66 freq
est - 22 freq
eaught - 1 freq
eejit - 69 freq
eiked - 3 freq
eighth - 4 freq
east - 304 freq
eikit - 65 freq
echtie - 4 freq
eschewed - 2 freq
echoed - 15 freq
echt - 114 freq
exit - 27 freq
eeejit - 1 freq
eegit - 4 freq
ecuid - 3 freq
echaed - 2 freq
eked - 3 freq
echae'd - 1 freq
egged - 5 freq
eeight - 1 freq
eichtie - 2 freq
echty - 22 freq
eestae - 3 freq
eastae - 1 freq
ejit - 1 freq
eesta - 1 freq
exude - 4 freq
eighty - 12 freq
eggheid - 1 freq
excite - 2 freq
'eicht - 2 freq
ect - 25 freq
echth - 1 freq
eaucht - 1 freq
eeyjit - 1 freq
eyght - 4 freq
¬‚eggit - 1 freq
egt - 1 freq
equate - 5 freq
exceed - 1 freq
eekit - 6 freq
esto - 2 freq
equity - 7 freq
eskside - 1 freq
ees't - 8 freq
eeside - 1 freq
eastawa - 3 freq
ekit - 1 freq
eased - 3 freq
eigged - 1 freq
eiged - 1 freq
€˜east - 1 freq
'exit' - 1 freq
€œeight - 1 freq
€™est - 4 freq
€œexit - 1 freq
eused - 1 freq
€˜eighty - 1 freq
ekd - 1 freq
echyty - 1 freq
eoggudh - 1 freq
exdt - 1 freq
ezd - 1 freq
eijit - 1 freq
eaxxd - 1 freq
ejjit - 1 freq
egid - 1 freq
esd - 1 freq
ezsgwt - 1 freq
eyesite - 1 freq
exsdu - 1 freq
MetaPhone code - EST
eesed - 65 freq
eest - 24 freq
est - 22 freq
east - 304 freq
eestae - 3 freq
eastae - 1 freq
eesta - 1 freq
aest - 21 freq
aesed - 2 freq
esto - 2 freq
ees't - 8 freq
eeside - 1 freq
eased - 3 freq
€˜east - 1 freq
€™est - 4 freq
eused - 1 freq
ezd - 1 freq
esd - 1 freq
EST
Time to execute Levenshtein function - 0.174299 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.321442 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027555 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036979 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000842 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.