A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mcalister in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mcalister (0) - 4 freq
mcallister (1) - 5 freq
canister (2) - 3 freq
alister (2) - 1 freq
mcenister (2) - 1 freq
maister (2) - 216 freq
clister (2) - 1 freq
glaister (3) - 2 freq
plister (3) - 1 freq
maistert (3) - 3 freq
cairter (3) - 1 freq
scaiter (3) - 1 freq
billister (3) - 1 freq
blister (3) - 3 freq
glister (3) - 8 freq
cluster (3) - 5 freq
meenister (3) - 145 freq
meinister (3) - 34 freq
caster (3) - 2 freq
bazmcalister (3) - 1 freq
banister (3) - 2 freq
plaister (3) - 20 freq
minister (3) - 67 freq
waister (3) - 1 freq
maitter (3) - 383 freq
mcalister (0) - 4 freq
mcallister (2) - 5 freq
clister (3) - 1 freq
mcenister (3) - 1 freq
molester (4) - 1 freq
mecnister (4) - 1 freq
cluster (4) - 5 freq
mcmaster (4) - 2 freq
cloister (4) - 2 freq
canister (4) - 3 freq
alister (4) - 1 freq
maister (4) - 216 freq
master (5) - 21 freq
minister (5) - 67 freq
allister (5) - 1 freq
ocklester (5) - 1 freq
mayster (5) - 1 freq
moister (5) - 7 freq
maistery (5) - 1 freq
mister (5) - 84 freq
slaister (5) - 8 freq
lister (5) - 6 freq
plaister (5) - 20 freq
billister (5) - 1 freq
meenister (5) - 145 freq
SoundEx code - M242
misluck - 3 freq
muscles - 34 freq
mucklest - 8 freq
michael's - 8 freq
mossgeil's - 2 freq
mcleish's - 1 freq
missiles - 6 freq
meek-like - 1 freq
measles - 5 freq
muckle's - 13 freq
muslcians - 1 freq
maclahose - 1 freq
missals - 1 freq
michaels - 1 freq
mcholas - 1 freq
mauchless - 2 freq
mussels - 5 freq
mucklegubber's - 1 freq
mucklegubber - 2 freq
michelle's - 4 freq
musles - 1 freq
mukkil's - 1 freq
muckle-ish - 1 freq
mccolgan - 3 freq
maikless - 2 freq
mccolgan's - 1 freq
maclaughlan's - 1 freq
maxwell's - 1 freq
mcculloch - 6 freq
mcculloch's - 1 freq
miklés - 1 freq
macklike - 1 freq
mcleish - 5 freq
mizzles - 1 freq
macilliosa - 1 freq
'michael's - 1 freq
mcalister - 4 freq
mochles - 7 freq
moguls - 1 freq
mislikit - 2 freq
mccleish - 2 freq
muckle-scale - 1 freq
muggles - 3 freq
mashles - 1 freq
maze-like - 1 freq
meiklejohn - 1 freq
muckle-great - 1 freq
michaelswood - 4 freq
€œmichaelswood - 1 freq
mclachlan - 3 freq
miscalculated - 1 freq
muckles - 1 freq
mculkkzke - 1 freq
mcculluch - 1 freq
michaeljmarra - 1 freq
michaelgove - 5 freq
michaelgauld - 2 freq
maxwellsnp - 1 freq
mslizcee - 1 freq
mickgallowgate - 1 freq
michaellcrick - 2 freq
michaelglasper - 1 freq
mcallister - 5 freq
macauleyclare - 2 freq
mclaugh - 5 freq
maskless - 1 freq
mzxlq - 2 freq
MetaPhone code - MKLSTR
mcalister - 4 freq
mcallister - 5 freq
MCALISTER
Time to execute Levenshtein function - 0.433383 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.871299 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.080394 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044222 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000917 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.