A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to others in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
others (0) - 59 freq
otters (1) - 3 freq
mothers (1) - 9 freq
uthers (1) - 1 freq
ithers (1) - 577 freq
dothers (1) - 5 freq
thers (1) - 1 freq
other (1) - 579 freq
owthers (1) - 1 freq
other' (1) - 2 freq
other's (1) - 10 freq
bothers (1) - 9 freq
nothers (1) - 1 freq
'there (2) - 50 freq
osiers (2) - 1 freq
thes (2) - 12 freq
fathers (2) - 3 freq
ters (2) - 1 freq
'other' (2) - 1 freq
thars (2) - 2 freq
thors (2) - 1 freq
ooers (2) - 11 freq
orders (2) - 51 freq
theis (2) - 19 freq
owther (2) - 5 freq
others (0) - 59 freq
ithers (1) - 577 freq
uthers (1) - 1 freq
thers (1) - 1 freq
ithirs (2) - 10 freq
thirs (2) - 41 freq
thors (2) - 1 freq
aithers (2) - 7 freq
utheris (2) - 1 freq
theirs (2) - 44 freq
thars (2) - 2 freq
thurs (2) - 16 freq
theres (2) - 110 freq
eithers (2) - 1 freq
other's (2) - 10 freq
dothers (2) - 5 freq
other (2) - 579 freq
owthers (2) - 1 freq
other' (2) - 2 freq
mothers (2) - 9 freq
bothers (2) - 9 freq
otters (2) - 3 freq
nothers (2) - 1 freq
there (3) - 6822 freq
southers (3) - 1 freq
SoundEx code - O362
others - 59 freq
outricht - 1 freq
other's - 10 freq
otters - 3 freq
ootricht - 6 freq
odours - 3 freq
otherweys - 1 freq
otherwise - 19 freq
ootrageous - 3 freq
ootright - 1 freq
ootraged - 2 freq
otherside's - 1 freq
owthars - 2 freq
owthar's - 5 freq
ootreach - 9 freq
ootrage - 7 freq
outrage - 4 freq
otherwise' - 1 freq
'otters' - 1 freq
ootrekkit - 1 freq
oot-through - 7 freq
ootdoors - 9 freq
outdoors - 2 freq
owthors - 15 freq
owther's - 1 freq
oot-richt - 2 freq
ottirskynnis - 1 freq
otteris - 1 freq
out-drauchtit - 1 freq
outright - 1 freq
outraged - 1 freq
outrageous - 3 freq
ootersyde - 6 freq
ootdoorsy - 3 freq
outdoorsy - 1 freq
owthers - 1 freq
ootthrough - 1 freq
ootriggit - 1 freq
ohtyrqe - 1 freq
otherchrises - 1 freq
MetaPhone code - O0RS
others - 59 freq
other's - 10 freq
owthars - 2 freq
owthar's - 5 freq
owthors - 15 freq
owther's - 1 freq
owthers - 1 freq
OTHERS
other - 579 freq
tither - 202 freq
ither - 3074 freq
others - 59 freq
ithers - 577 freq
Time to execute Levenshtein function - 0.215456 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.435433 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027696 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041254 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000939 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.