A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eichteen in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eichteen (0) - 15 freq
aichteen (1) - 13 freq
eighteen (1) - 26 freq
echteen (1) - 27 freq
eichteen' (1) - 1 freq
eichteent (1) - 2 freq
eiehteen (1) - 1 freq
eichtiet (2) - 1 freq
eichtie (2) - 2 freq
auchteen (2) - 5 freq
eichties (2) - 3 freq
nicht-een (2) - 1 freq
lichten (2) - 8 freq
echtteen (2) - 1 freq
eighteent (2) - 1 freq
achteen (2) - 2 freq
lighteen (2) - 2 freq
eichteenth (2) - 3 freq
echteent (2) - 7 freq
eightein (2) - 1 freq
tichten (2) - 3 freq
fithteen (2) - 2 freq
fitteen (3) - 1 freq
fichterin (3) - 1 freq
lichtet (3) - 2 freq
eichteen (0) - 15 freq
echteen (1) - 27 freq
aichteen (1) - 13 freq
auchteen (2) - 5 freq
eiehteen (2) - 1 freq
achteen (2) - 2 freq
eighteen (2) - 26 freq
eichteent (2) - 2 freq
eichteen' (2) - 1 freq
achten (3) - 1 freq
echteent (3) - 7 freq
tichten (3) - 3 freq
echtteen (3) - 1 freq
lichten (3) - 8 freq
eichtiet (3) - 1 freq
eichtie (3) - 2 freq
eichties (3) - 3 freq
eightein (3) - 1 freq
cheen (4) - 2 freq
eicht (4) - 61 freq
lichtan (4) - 2 freq
sichtin (4) - 4 freq
lichtin (4) - 26 freq
fichtin (4) - 2 freq
eichts (4) - 4 freq
SoundEx code - E235
excitin - 33 freq
eichteen - 15 freq
extent - 48 freq
eighteen - 26 freq
eastenders - 9 freq
esteem - 15 freq
excitement - 100 freq
extends - 10 freq
extending - 2 freq
equations - 5 freq
equation - 5 freq
echteen - 27 freq
extend - 9 freq
eichteenth - 3 freq
easedom - 4 freq
extensions - 5 freq
extendin - 1 freq
extinct - 12 freq
exciting - 16 freq
eichteen-nineteen - 2 freq
echtie-nine - 1 freq
ection - 16 freq
excitemint - 9 freq
echteenth - 7 freq
estimatit - 2 freq
echtteen - 1 freq
extension - 11 freq
ecudnae - 1 freq
esteim - 1 freq
ectin - 1 freq
extendet - 2 freq
extenuatin - 1 freq
easdom - 1 freq
exten - 1 freq
eichteen-eichty - 2 freq
eichteen' - 1 freq
eighteent - 1 freq
estimate - 4 freq
extinction - 6 freq
extinguished - 2 freq
eichty-nine - 1 freq
extenders - 1 freq
equaetions - 2 freq
extensive - 8 freq
extended - 15 freq
ections - 13 freq
excitin' - 2 freq
ectiononie - 1 freq
extendin' - 1 freq
exitement - 2 freq
extempore - 1 freq
echteent - 7 freq
'extinct' - 1 freq
estimated - 1 freq
eichteent - 2 freq
extant - 9 freq
extendit - 9 freq
extensioun - 1 freq
estimabil - 1 freq
€™easton - 1 freq
extens - 1 freq
excitment - 1 freq
eightein - 1 freq
exceedin - 1 freq
echidna - 4 freq
echidn - 1 freq
echty-nine - 1 freq
extensively - 4 freq
extenso - 1 freq
easton - 3 freq
echt-an-twuntie - 1 freq
extendable - 1 freq
echteent-century - 1 freq
eastendmark - 1 freq
exiting - 1 freq
eukdmmvxpz - 1 freq
extint - 1 freq
estimation - 1 freq
extension” - 1 freq
estimates - 1 freq
eastdumfriessnp - 1 freq
MetaPhone code - EXTN
eichteen - 15 freq
echteen - 27 freq
echtteen - 1 freq
eichteen' - 1 freq
echidna - 4 freq
echidn - 1 freq
EICHTEEN
Time to execute Levenshtein function - 0.194462 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342819 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028097 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037224 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000897 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.