A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to others in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
others (0) - 52 freq
ithers (1) - 574 freq
other's (1) - 7 freq
thers (1) - 1 freq
nothers (1) - 1 freq
dothers (1) - 5 freq
uthers (1) - 1 freq
otters (1) - 3 freq
owthers (1) - 1 freq
bothers (1) - 9 freq
other (1) - 530 freq
other' (1) - 2 freq
mothers (1) - 9 freq
southers (2) - 1 freq
mother's (2) - 3 freq
othir (2) - 1 freq
thors (2) - 1 freq
thes (2) - 12 freq
ether (2) - 26 freq
dother (2) - 36 freq
pouthers (2) - 1 freq
there (2) - 6720 freq
ther's (2) - 9 freq
theis (2) - 19 freq
hers (2) - 84 freq
others (0) - 52 freq
thers (1) - 1 freq
uthers (1) - 1 freq
ithers (1) - 574 freq
theres (2) - 108 freq
thors (2) - 1 freq
eithers (2) - 1 freq
utheris (2) - 1 freq
theirs (2) - 44 freq
thurs (2) - 16 freq
thars (2) - 2 freq
ithirs (2) - 10 freq
thirs (2) - 41 freq
aithers (2) - 7 freq
mothers (2) - 9 freq
dothers (2) - 5 freq
nothers (2) - 1 freq
other's (2) - 7 freq
owthers (2) - 1 freq
otters (2) - 3 freq
other' (2) - 2 freq
other (2) - 530 freq
bothers (2) - 9 freq
withers (3) - 5 freq
ither' (3) - 1 freq
SoundEx code - O362
others - 52 freq
outricht - 1 freq
other's - 7 freq
otters - 3 freq
ootricht - 6 freq
odours - 3 freq
otherweys - 1 freq
otherwise - 18 freq
ootrageous - 3 freq
ootright - 1 freq
ootraged - 2 freq
otherside's - 1 freq
owthars - 2 freq
owthar's - 5 freq
ootreach - 9 freq
ootrage - 7 freq
outrage - 4 freq
otherwise' - 1 freq
'otters' - 1 freq
ootrekkit - 1 freq
oot-through - 7 freq
ootdoors - 9 freq
outdoors - 2 freq
owthors - 15 freq
owther's - 1 freq
oot-richt - 2 freq
ottirskynnis - 1 freq
otteris - 1 freq
out-drauchtit - 1 freq
outright - 1 freq
outraged - 1 freq
outrageous - 3 freq
ootersyde - 6 freq
ootdoorsy - 3 freq
outdoorsy - 1 freq
owthers - 1 freq
ootthrough - 1 freq
ootriggit - 1 freq
ohtyrqe - 1 freq
otherchrises - 1 freq
MetaPhone code - O0RS
others - 52 freq
other's - 7 freq
owthars - 2 freq
owthar's - 5 freq
owthors - 15 freq
owther's - 1 freq
owthers - 1 freq
OTHERS
other - 530 freq
tither - 202 freq
ither - 3062 freq
others - 52 freq
ithers - 574 freq
Time to execute Levenshtein function - 0.403351 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.688907 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032721 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.075500 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000854 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.