A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to alexmassie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
alexmassie (0) - 3 freq
alessia (4) - 2 freq
glessie (4) - 3 freq
cramassie (4) - 1 freq
alexkane (4) - 1 freq
embassie (4) - 1 freq
massie (4) - 3 freq
lassie (4) - 790 freq
leadsie (4) - 1 freq
alexrae (4) - 1 freq
germanie (5) - 4 freq
excaise (5) - 6 freq
blemishis (5) - 1 freq
palliasse (5) - 11 freq
beatsie (5) - 1 freq
alexandra (5) - 7 freq
glessis (5) - 5 freq
mansie (5) - 22 freq
alikeapie (5) - 1 freq
glesgie (5) - 6 freq
jeemsie (5) - 22 freq
creashie (5) - 2 freq
leemartin (5) - 1 freq
lemsip (5) - 1 freq
aless (5) - 26 freq
alexmassie (0) - 3 freq
alessia (6) - 2 freq
lassie (6) - 790 freq
massie (6) - 3 freq
cramassie (6) - 1 freq
aless (7) - 26 freq
mossie (7) - 4 freq
ramasse (7) - 2 freq
lassi (7) - 3 freq
paulajmossie (7) - 1 freq
almaist (7) - 24 freq
drumossie (7) - 2 freq
lemsip (7) - 1 freq
almas' (7) - 1 freq
leadsie (7) - 1 freq
embassie (7) - 1 freq
glessie (7) - 3 freq
lassies (7) - 308 freq
alexrae (7) - 1 freq
masse (7) - 1 freq
lossie (7) - 7 freq
lemans (7) - 1 freq
alexkane (7) - 1 freq
less (8) - 552 freq
lass (8) - 496 freq
SoundEx code - A425
alchemy - 3 freq
alison - 410 freq
alzheimer's - 16 freq
alison's - 11 freq
alson - 1 freq
'alzheimer's - 1 freq
alexander - 76 freq
allegiance - 7 freq
alexander's - 2 freq
alignment - 2 freq
al'seen - 1 freq
alchemist - 2 freq
alexandra - 7 freq
align - 2 freq
alisaunder's - 1 freq
allusion - 2 freq
algonquin - 1 freq
aligned - 2 freq
alignit - 1 freq
alkin - 1 freq
alleghenies - 1 freq
alcan-foil - 1 freq
alexender - 1 freq
alkemical - 1 freq
aelic-and-scots-scottish-languages-bill - 1 freq
alicante - 1 freq
allegiances - 1 freq
alzheimer - 2 freq
alexandrine - 1 freq
ailison - 1 freq
allusions - 1 freq
alokxnxpxi - 1 freq
alexandra's - 1 freq
alisonharriso - 1 freq
allisonmorris - 2 freq
alexmaskeymla - 1 freq
alcham - 1 freq
alexkane - 1 freq
alexamichmusic - 1 freq
alisonsdiary - 2 freq
aeylskins - 1 freq
alexandria - 1 freq
allusionistshow - 3 freq
alisonmcfar - 3 freq
alexunleashed - 1 freq
alexanderso - 1 freq
alisonthewliss - 2 freq
alexmassie - 3 freq
alicemarramusic - 1 freq
alzheimerssoc - 1 freq
alexmacleod - 1 freq
alzheimers - 2 freq
alexand - 2 freq
alexandrabulat - 1 freq
alisonmoyet - 1 freq
alexneilsnp - 1 freq
allisonpearson - 1 freq
alexandermcrow - 1 freq
alisonhendry - 1 freq
alisonmccaffer - 3 freq
MetaPhone code - ALKSMS
alexmassie - 3 freq
ALEXMASSIE
Time to execute Levenshtein function - 0.434384 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.821692 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.079345 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038691 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000911 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.