A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eton in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eton (0) - 7 freq
egon (1) - 1 freq
ton (1) - 17 freq
elton (1) - 2 freq
seton (1) - 18 freq
eaton (1) - 1 freq
eon (1) - 3 freq
etin (1) - 10 freq
ston (1) - 9 freq
elon (1) - 1 freq
rtin (2) - 1 freq
getn (2) - 1 freq
yto (2) - 1 freq
roon (2) - 1633 freq
yoon (2) - 6 freq
jhon (2) - 2 freq
en (2) - 618 freq
beyon (2) - 1 freq
luton (2) - 2 freq
ection (2) - 16 freq
poon (2) - 2 freq
enou (2) - 1 freq
toi (2) - 34 freq
'yon (2) - 14 freq
ipon (2) - 16 freq
eton (0) - 7 freq
eaton (1) - 1 freq
etin (1) - 10 freq
ton (1) - 17 freq
tun (2) - 4 freq
eatin (2) - 154 freq
toni (2) - 2 freq
atin (2) - 3 freq
itn (2) - 2 freq
etioun (2) - 2 freq
eaten (2) - 31 freq
tony (2) - 33 freq
toun (2) - 379 freq
aten (2) - 2 freq
eatan (2) - 1 freq
aetan (2) - 3 freq
tion (2) - 1 freq
aeten (2) - 7 freq
toon (2) - 698 freq
tn (2) - 8 freq
tin (2) - 184 freq
ten (2) - 638 freq
toin (2) - 1 freq
etna (2) - 2 freq
atone (2) - 2 freq
SoundEx code - E350
eatin - 154 freq
'eaten - 1 freq
eaten - 31 freq
edwin - 29 freq
'edwin - 4 freq
etten - 35 freq
euidnae - 3 freq
eden - 36 freq
edam - 45 freq
edam' - 1 freq
'edam - 3 freq
eatin' - 2 freq
eidein - 1 freq
etna - 2 freq
eton - 7 freq
ettin - 13 freq
edwun - 1 freq
'edwun - 1 freq
edwina - 1 freq
eetin - 3 freq
eediom - 15 freq
etin - 10 freq
eitan - 1 freq
eithne - 1 freq
eidiom - 2 freq
etioun - 2 freq
eiehteen - 1 freq
eaton - 1 freq
eatan - 1 freq
ehodni - 1 freq
edm - 1 freq
ethno - 1 freq
edmo - 1 freq
ethan - 1 freq
edin - 4 freq
edina - 2 freq
edunn - 2 freq
MetaPhone code - ETN
eatin - 154 freq
'eaten - 1 freq
eaten - 31 freq
etten - 35 freq
aetin - 27 freq
euidnae - 3 freq
aeten - 7 freq
eden - 36 freq
eatin' - 2 freq
eidein - 1 freq
etna - 2 freq
eton - 7 freq
ettin - 13 freq
aetan - 3 freq
eetin - 3 freq
aeteen - 1 freq
etin - 10 freq
eitan - 1 freq
eiehteen - 1 freq
eaton - 1 freq
eatan - 1 freq
edin - 4 freq
edina - 2 freq
edunn - 2 freq
ETON
Time to execute Levenshtein function - 0.191342 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347263 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027624 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037624 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001044 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.