A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mccormack in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mccormack (0) - 5 freq
mccormick (1) - 1 freq
cormack (2) - 5 freq
maccormick (2) - 2 freq
mcconnach (3) - 1 freq
cmack (4) - 1 freq
mccready (4) - 6 freq
accordain (4) - 2 freq
crack (4) - 347 freq
correck (4) - 3 freq
lukecormack (4) - 2 freq
moorpark (4) - 10 freq
mccracken (4) - 1 freq
kermack (4) - 4 freq
craack (4) - 1 freq
mccolgan (4) - 3 freq
accordance (4) - 3 freq
contack (4) - 37 freq
cloack (4) - 20 freq
corpach (4) - 1 freq
accuracy (4) - 7 freq
cossack (4) - 8 freq
mermaid (5) - 3 freq
mcdermid (5) - 1 freq
moorcocks (5) - 1 freq
mccormack (0) - 5 freq
mccormick (1) - 1 freq
maccormick (2) - 2 freq
cormack (4) - 5 freq
lukecormack (6) - 2 freq
mccracken (6) - 1 freq
mcconnach (6) - 1 freq
kermack (7) - 4 freq
craack (7) - 1 freq
mccready (7) - 6 freq
accuracy (7) - 7 freq
mccrumb (7) - 2 freq
crack (7) - 347 freq
correck (7) - 3 freq
cmack (7) - 1 freq
rachcromack (7) - 1 freq
mccrone (8) - 5 freq
carmic (8) - 2 freq
tcmck (8) - 1 freq
mcculluch (8) - 1 freq
occurence (8) - 1 freq
carrick (8) - 6 freq
edcrick (8) - 5 freq
mccarthy (8) - 3 freq
craick (8) - 1 freq
SoundEx code - M265
mushroom - 51 freq
migrant's - 1 freq
mushrooms - 5 freq
meisurin - 3 freq
mushrump - 2 freq
measurements - 1 freq
measuring - 2 freq
mushroom's - 1 freq
miscreant - 2 freq
measurin - 4 freq
mizzerment - 1 freq
mizzerments - 1 freq
maccormick - 2 freq
mccormick - 1 freq
macaroni - 15 freq
mccormack - 5 freq
measuirin - 1 freq
micron - 1 freq
microns - 1 freq
megrim - 4 freq
megrims - 1 freq
migrants - 6 freq
misrememberin - 1 freq
macarena - 1 freq
macaroni-cheese - 1 freq
maesharan - 1 freq
mccrumb - 2 freq
missurement - 1 freq
mauk-wirm - 1 freq
misgrown - 1 freq
miscryin - 1 freq
maccruimein - 1 freq
maccrimmon - 1 freq
mascorn - 5 freq
mascorns - 1 freq
mccrone - 5 freq
measurement - 3 freq
migrant - 6 freq
misremember - 1 freq
micronation - 1 freq
migraine - 3 freq
miscreants - 1 freq
migraines - 1 freq
miscairryin - 1 freq
mackeerin - 1 freq
meisurements - 1 freq
mazerment - 1 freq
maigrant - 1 freq
majormcbloodnok - 2 freq
macaroni's - 1 freq
mishearing - 1 freq
micromoth - 1 freq
MetaPhone code - MKKRMK
maccormick - 2 freq
mccormick - 1 freq
mccormack - 5 freq
MCCORMACK
Time to execute Levenshtein function - 0.204843 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.389767 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027716 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038041 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000924 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.