A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to anaith in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
anaith (0) - 6 freq
aneith (1) - 8 freq
anaeth (1) - 1 freq
anits (2) - 2 freq
saith (2) - 2 freq
angieh (2) - 1 freq
nait (2) - 1 freq
inwith (2) - 11 freq
braith (2) - 215 freq
skaith (2) - 50 freq
awaits (2) - 8 freq
agait (2) - 1 freq
raith (2) - 9 freq
'faith (2) - 1 freq
daith (2) - 286 freq
await (2) - 7 freq
anagh (2) - 1 freq
sneith (2) - 2 freq
laith (2) - 25 freq
praith (2) - 1 freq
antit (2) - 1 freq
graith (2) - 128 freq
aneath (2) - 105 freq
anithr (2) - 2 freq
anaht (2) - 1 freq
anaith (0) - 6 freq
aneith (1) - 8 freq
anaeth (1) - 1 freq
aneath (2) - 105 freq
nith (2) - 12 freq
aneth (2) - 177 freq
baith (3) - 1582 freq
anithr (3) - 2 freq
paith (3) - 4 freq
nairth (3) - 1 freq
anneith (3) - 3 freq
waith (3) - 2 freq
faith (3) - 161 freq
neth (3) - 5 freq
inouth (3) - 1 freq
noth (3) - 5 freq
neath (3) - 4 freq
aith (3) - 23 freq
inooth (3) - 1 freq
anat (3) - 1 freq
anotha (3) - 1 freq
inwith (3) - 11 freq
raith (3) - 9 freq
angieh (3) - 1 freq
saith (3) - 2 freq
SoundEx code - A530
and - 24890 freq
ahint - 743 freq
ayont - 272 freq
'and - 179 freq
aneth - 177 freq
aneath - 105 freq
ahent - 100 freq
aimed - 15 freq
andy - 151 freq
annoyed - 45 freq
aunt - 75 freq
amid - 9 freq
'aneth - 1 freq
aunty - 42 freq
auntie - 157 freq
awned - 27 freq
'andy' - 1 freq
ained - 19 freq
ane-eed - 1 freq
awnt - 1 freq
ahind - 11 freq
ahin't - 2 freq
annuity - 2 freq
aneath' - 1 freq
andie - 1 freq
aeneid - 8 freq
aneith - 8 freq
anneith - 3 freq
aunte - 1 freq
amn't - 1 freq
'auntie - 1 freq
anyday - 3 freq
ain't - 9 freq
awnit - 2 freq
annoyt - 5 freq
aint - 13 freq
anyd - 2 freq
aimeth - 1 freq
ant - 8 freq
a'waant - 1 freq
amd - 2 freq
'aunty - 4 freq
anddy - 1 freq
aaind - 1 freq
aind - 1 freq
an'the - 1 freq
aintae - 4 freq
amdee - 1 freq
anaeth - 1 freq
aund - 2 freq
anti - 11 freq
ande - 4 freq
aaned - 1 freq
'annoyed - 1 freq
anaith - 6 freq
ane-twa - 4 freq
€˜aunt - 1 freq
€˜and - 14 freq
anet - 1 freq
€”and - 8 freq
€œayont - 1 freq
€œand - 15 freq
€¦and - 2 freq
ahmed - 6 freq
aand - 1 freq
and-aa - 1 freq
anotha - 1 freq
aantie - 2 freq
€˜annoyed - 1 freq
€˜ayont - 1 freq
amitie - 5 freq
amity - 2 freq
€™and - 2 freq
annette - 1 freq
anto - 1 freq
“and - 1 freq
ando - 1 freq
aaaaaand - 1 freq
anoot - 1 freq
annewitha - 1 freq
'aunty' - 1 freq
anaht - 1 freq
anat - 1 freq
andyh - 1 freq
ante - 1 freq
andÂ… - 1 freq
ainÂ’t - 1 freq
MetaPhone code - AN0
aneth - 177 freq
aneath - 105 freq
'aneth - 1 freq
aneath' - 1 freq
aneith - 8 freq
anneith - 3 freq
an'the - 1 freq
anaeth - 1 freq
anaith - 6 freq
anotha - 1 freq
ANAITH
Time to execute Levenshtein function - 0.180705 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348869 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027335 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036485 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000849 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.