A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mcholas in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mcholas (0) - 1 freq
scholls (2) - 1 freq
colas (2) - 1 freq
scholars (2) - 35 freq
nicholas (2) - 4 freq
mcthomas (2) - 1 freq
scholar (2) - 21 freq
mccoys (3) - 3 freq
scolds (3) - 2 freq
cools (3) - 6 freq
mccoy's (3) - 7 freq
mccoll (3) - 9 freq
acroas (3) - 4 freq
chokan (3) - 1 freq
moolah (3) - 1 freq
echoes (3) - 26 freq
colts (3) - 2 freq
chowks (3) - 9 freq
ochils (3) - 8 freq
hols (3) - 6 freq
choiss (3) - 2 freq
tholes (3) - 17 freq
chos (3) - 1 freq
chokes (3) - 4 freq
ghlas (3) - 1 freq
mcholas (0) - 1 freq
nicholas (3) - 4 freq
mochles (3) - 7 freq
muchas (4) - 1 freq
nichols (4) - 27 freq
schules (4) - 3 freq
ochils (4) - 8 freq
michaels (4) - 1 freq
colas (4) - 1 freq
scholls (4) - 1 freq
scholars (4) - 35 freq
mcthomas (4) - 1 freq
schools (4) - 44 freq
scholar (4) - 21 freq
schol (5) - 2 freq
chopes (5) - 1 freq
mactomas (5) - 1 freq
muchly (5) - 2 freq
machetes (5) - 1 freq
machars (5) - 4 freq
schons (5) - 1 freq
colls (5) - 1 freq
chills (5) - 7 freq
childs (5) - 1 freq
chocs (5) - 1 freq
SoundEx code - M242
misluck - 3 freq
muscles - 34 freq
mucklest - 8 freq
michael's - 8 freq
mossgeil's - 2 freq
mcleish's - 1 freq
missiles - 6 freq
meek-like - 1 freq
measles - 5 freq
muckle's - 13 freq
muslcians - 1 freq
maclahose - 1 freq
missals - 1 freq
michaels - 1 freq
mcholas - 1 freq
mauchless - 2 freq
mussels - 5 freq
mucklegubber's - 1 freq
mucklegubber - 2 freq
michelle's - 4 freq
musles - 1 freq
mukkil's - 1 freq
muckle-ish - 1 freq
mccolgan - 3 freq
maikless - 2 freq
mccolgan's - 1 freq
maclaughlan's - 1 freq
maxwell's - 1 freq
mcculloch - 6 freq
mcculloch's - 1 freq
miklés - 1 freq
macklike - 1 freq
mcleish - 5 freq
mizzles - 1 freq
macilliosa - 1 freq
'michael's - 1 freq
mcalister - 4 freq
mochles - 7 freq
moguls - 1 freq
mislikit - 2 freq
mccleish - 2 freq
muckle-scale - 1 freq
muggles - 3 freq
mashles - 1 freq
maze-like - 1 freq
meiklejohn - 1 freq
muckle-great - 1 freq
michaelswood - 4 freq
€œmichaelswood - 1 freq
mclachlan - 3 freq
miscalculated - 1 freq
muckles - 1 freq
mculkkzke - 1 freq
mcculluch - 1 freq
michaeljmarra - 1 freq
michaelgove - 5 freq
michaelgauld - 2 freq
maxwellsnp - 1 freq
mslizcee - 1 freq
mickgallowgate - 1 freq
michaellcrick - 2 freq
michaelglasper - 1 freq
mcallister - 5 freq
macauleyclare - 2 freq
mclaugh - 5 freq
maskless - 1 freq
mzxlq - 2 freq
MetaPhone code - MXLS
michael's - 8 freq
michaels - 1 freq
mcholas - 1 freq
mauchless - 2 freq
michelle's - 4 freq
'michael's - 1 freq
mochles - 7 freq
mashles - 1 freq
MCHOLAS
Time to execute Levenshtein function - 0.246183 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.453633 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027930 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039355 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001073 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.