A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to avatar in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
avatar (0) - 2 freq
avatars (1) - 3 freq
altar (2) - 22 freq
watar (2) - 1 freq
waaatar (2) - 1 freq
waatar (2) - 1 freq
attar (2) - 1 freq
vaar (2) - 1 freq
avaa (2) - 30 freq
aaltar (2) - 2 freq
agata (2) - 2 freq
qatar (2) - 9 freq
votar (2) - 1 freq
awaar (2) - 20 freq
astair (3) - 1 freq
aaeam (3) - 2 freq
pentar (3) - 1 freq
aifter (3) - 339 freq
votars (3) - 4 freq
alter (3) - 15 freq
aetan (3) - 3 freq
plater (3) - 1 freq
at'r (3) - 1 freq
ahaa (3) - 1 freq
tamar (3) - 2 freq
avatar (0) - 2 freq
votar (2) - 1 freq
avatars (2) - 3 freq
qatar (3) - 9 freq
voter (3) - 11 freq
aaltar (3) - 2 freq
vaar (3) - 1 freq
watar (3) - 1 freq
altar (3) - 22 freq
waatar (3) - 1 freq
waaatar (3) - 1 freq
attar (3) - 1 freq
vital (4) - 31 freq
aaltir (4) - 1 freq
dater (4) - 1 freq
atr (4) - 1 freq
aftur (4) - 1 freq
hatr (4) - 3 freq
orator (4) - 2 freq
lavatory (4) - 5 freq
avera (4) - 1 freq
natur (4) - 24 freq
water (4) - 252 freq
notar (4) - 2 freq
aftir (4) - 1 freq
SoundEx code - A136
after - 352 freq
aifter - 339 freq
aifterneen - 26 freq
aifterwart - 1 freq
aifter-life - 1 freq
afternoon - 103 freq
afterwards - 14 freq
aifternuin - 1 freq
aifternuins - 1 freq
after-shave - 2 freq
aiftertimes - 4 freq
aafaither - 1 freq
afterstang - 1 freq
after-kin - 1 freq
aftershave - 4 freq
affatr - 1 freq
afterneen - 6 freq
after-school - 1 freq
afterward - 3 freq
'afternoon - 1 freq
aifter-oors - 1 freq
aifterthoucht - 1 freq
afternoons - 1 freq
afternane - 1 freq
aftur - 1 freq
afturnoon - 1 freq
abderus - 2 freq
aff-there - 1 freq
aftir - 1 freq
aftairthocht - 1 freq
€˜aifter - 1 freq
€˜afternoon - 1 freq
aff-drawn - 1 freq
aifterwards - 3 freq
avatars - 3 freq
avatar - 2 freq
aiftir - 1 freq
aefter - 4 freq
aifterhins - 2 freq
aifterhin - 2 freq
€œafter - 3 freq
afternuin - 2 freq
aifternoon - 3 freq
aftermorning - 1 freq
afters - 3 freq
abother - 1 freq
'afters' - 1 freq
MetaPhone code - AFTR
after - 352 freq
aifter - 339 freq
affatr - 1 freq
aftur - 1 freq
aftir - 1 freq
€˜aifter - 1 freq
avatar - 2 freq
aiftir - 1 freq
€œafter - 3 freq
AVATAR
Time to execute Levenshtein function - 0.258777 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.400566 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029076 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037105 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000840 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.