A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to toenail in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
toenail (0) - 1 freq
taenails (2) - 2 freq
hobnail (2) - 2 freq
tonal (2) - 8 freq
teenage (3) - 21 freq
twentie (3) - 1 freq
tonyip (3) - 1 freq
tendis (3) - 1 freq
donal (3) - 7 freq
o'neil (3) - 1 freq
trail (3) - 57 freq
connal (3) - 1 freq
doenae (3) - 1 freq
tenant (3) - 14 freq
ontil (3) - 53 freq
onabil (3) - 1 freq
donnal (3) - 1 freq
teil (3) - 4 freq
poneil (3) - 3 freq
teitil (3) - 6 freq
total (3) - 96 freq
prevail (3) - 10 freq
toonie (3) - 1 freq
teal (3) - 2 freq
tendit (3) - 19 freq
toenail (0) - 1 freq
tonal (2) - 8 freq
taenails (3) - 2 freq
tingil (4) - 4 freq
tounis (4) - 1 freq
teename (4) - 1 freq
teenas (4) - 1 freq
tonnel (4) - 1 freq
penal (4) - 5 freq
teenaige (4) - 1 freq
teenie (4) - 54 freq
teena (4) - 3 freq
nail (4) - 46 freq
tenniel (4) - 2 freq
snail (4) - 33 freq
tonic (4) - 13 freq
tail (4) - 263 freq
tenable (4) - 1 freq
tentie (4) - 38 freq
toyal (4) - 1 freq
toil (4) - 25 freq
tentily (4) - 22 freq
tencel (4) - 1 freq
toni (4) - 2 freq
total (4) - 96 freq
SoundEx code - T540
tunnel - 85 freq
tummle - 19 freq
thimmle - 4 freq
them'll - 4 freq
toonhill - 1 freq
thinly - 2 freq
twae-mile - 1 freq
timely - 4 freq
tinhalla - 1 freq
tummel - 5 freq
tummile - 1 freq
tenniel - 2 freq
thimel - 4 freq
thim-aal - 1 freq
tonal - 8 freq
twin'll - 1 freq
timmle - 4 freq
time'll - 3 freq
toenail - 1 freq
them-all - 1 freq
thoum-nail - 1 freq
tannahill - 7 freq
tonallie - 1 freq
team'll - 1 freq
thon'll - 1 freq
towneley - 1 freq
tonnel - 1 freq
tamil - 1 freq
ten'll - 1 freq
€˜timely - 1 freq
tommy'll - 1 freq
tam'll - 1 freq
tonyhill - 1 freq
themole - 1 freq
thnl - 1 freq
tanle - 1 freq
tommyle - 1 freq
tommel - 1 freq
MetaPhone code - TNL
donal - 7 freq
tunnel - 85 freq
denial - 14 freq
dinah'll - 9 freq
daniel - 127 freq
doon-low - 1 freq
'daniel - 1 freq
dunloy - 4 freq
dinnle - 2 freq
tenniel - 2 freq
tonal - 8 freq
donnal - 1 freq
danielle - 1 freq
toenail - 1 freq
dunneil - 1 freq
danelaw - 3 freq
tonallie - 1 freq
donella - 5 freq
towneley - 1 freq
dunghill - 1 freq
tonnel - 1 freq
dinnil - 1 freq
€™dinnle - 1 freq
ten'll - 1 freq
€˜daniel - 7 freq
donnell - 1 freq
donnelly - 1 freq
dunollie - 6 freq
daniella - 1 freq
duanalla - 1 freq
tanle - 1 freq
dunli - 1 freq
daniela - 2 freq
TOENAIL
Time to execute Levenshtein function - 0.317586 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.585836 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028328 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.070315 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000950 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.