A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ect in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
ect (0) - 25 freq
eit (1) - 644 freq
egt (1) - 1 freq
ept (1) - 1 freq
echt (1) - 115 freq
eat (1) - 467 freq
mect (1) - 1 freq
ecg (1) - 1 freq
act (1) - 241 freq
fect (1) - 12 freq
ict (1) - 1 freq
ecz (1) - 1 freq
ecv (1) - 1 freq
eft (1) - 3 freq
ent (1) - 3 freq
sect (1) - 3 freq
ects (1) - 4 freq
ecb (1) - 1 freq
zct (1) - 1 freq
ecn (1) - 2 freq
et (1) - 256 freq
est (1) - 22 freq
elt (1) - 2 freq
eet (1) - 581 freq
ett (1) - 97 freq
ect (0) - 25 freq
act (1) - 241 freq
oct (1) - 4 freq
ct (1) - 8 freq
ict (1) - 1 freq
ech (2) - 5 freq
eck (2) - 425 freq
sct (2) - 1 freq
ec (2) - 18 freq
auct (2) - 1 freq
ett (2) - 97 freq
ecc (2) - 1 freq
cat (2) - 569 freq
eco (2) - 2 freq
cet (2) - 1 freq
cot (2) - 44 freq
cut (2) - 461 freq
gct (2) - 1 freq
eet (2) - 581 freq
cto (2) - 1 freq
ecy (2) - 1 freq
cit (2) - 4 freq
ert (2) - 18 freq
ecg (2) - 1 freq
eit (2) - 644 freq
SoundEx code - E230
eichty - 7 freq
eicht - 61 freq
eesed - 65 freq
eest - 24 freq
eight - 69 freq
est - 22 freq
eaught - 1 freq
eejit - 71 freq
eiked - 3 freq
eighth - 4 freq
east - 307 freq
eikit - 65 freq
echtie - 4 freq
eschewed - 2 freq
echoed - 15 freq
echt - 115 freq
exit - 28 freq
eeejit - 1 freq
eegit - 4 freq
ecuid - 3 freq
echaed - 2 freq
eked - 3 freq
echae'd - 1 freq
egged - 5 freq
eeight - 1 freq
eichtie - 2 freq
echty - 22 freq
eestae - 3 freq
eastae - 1 freq
ejit - 1 freq
eesta - 1 freq
exude - 4 freq
eighty - 13 freq
eggheid - 1 freq
excite - 2 freq
'eicht - 2 freq
ect - 25 freq
echth - 1 freq
eaucht - 1 freq
eeyjit - 1 freq
eyght - 4 freq
¬‚eggit - 1 freq
egt - 1 freq
equate - 5 freq
exceed - 1 freq
eekit - 6 freq
esto - 2 freq
equity - 7 freq
eskside - 1 freq
ees't - 8 freq
eeside - 1 freq
eastawa - 3 freq
ekit - 1 freq
eased - 3 freq
eigged - 1 freq
eiged - 1 freq
€˜east - 1 freq
'exit' - 1 freq
€œeight - 1 freq
€™est - 4 freq
€œexit - 1 freq
eused - 1 freq
€˜eighty - 1 freq
ekd - 1 freq
echyty - 1 freq
eoggudh - 1 freq
exdt - 1 freq
ezd - 1 freq
eijit - 1 freq
eaxxd - 1 freq
ejjit - 1 freq
egid - 1 freq
esd - 1 freq
ezsgwt - 1 freq
eyesite - 1 freq
exsdu - 1 freq
MetaPhone code - EKT
eiked - 3 freq
eikit - 65 freq
ecuid - 3 freq
eked - 3 freq
egged - 5 freq
eggheid - 1 freq
ect - 25 freq
¬‚eggit - 1 freq
egt - 1 freq
equate - 5 freq
eekit - 6 freq
equity - 7 freq
ekit - 1 freq
eigged - 1 freq
ekd - 1 freq
eoggudh - 1 freq
ECT
Time to execute Levenshtein function - 0.184010 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.382524 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027628 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040113 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000925 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.