A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mjr in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mjr (0) - 23 freq
mer (1) - 8 freq
pjr (1) - 1 freq
mr (1) - 1243 freq
mcr (1) - 1 freq
tjr (1) - 1 freq
mar (1) - 16 freq
njr (1) - 1 freq
jr (1) - 16 freq
mtr (1) - 1 freq
mj (1) - 6 freq
ajr (1) - 1 freq
mor (1) - 15 freq
mir (1) - 5 freq
gjr (1) - 1 freq
oj (2) - 2 freq
mary (2) - 763 freq
mxj (2) - 1 freq
ndr (2) - 1 freq
mp (2) - 39 freq
jc (2) - 10 freq
msy (2) - 1 freq
fja (2) - 1 freq
msw (2) - 1 freq
mid (2) - 60 freq
mjr (0) - 23 freq
ajr (2) - 1 freq
mj (2) - 6 freq
mor (2) - 15 freq
mir (2) - 5 freq
majer (2) - 1 freq
mtr (2) - 1 freq
major (2) - 108 freq
gjr (2) - 1 freq
mr (2) - 1243 freq
jr (2) - 16 freq
pjr (2) - 1 freq
mcr (2) - 1 freq
mer (2) - 8 freq
tjr (2) - 1 freq
njr (2) - 1 freq
mar (2) - 16 freq
moj (3) - 2 freq
mear (3) - 3 freq
ajer (3) - 2 freq
amor (3) - 1 freq
marr (3) - 5 freq
jer (3) - 1 freq
jar (3) - 41 freq
murr (3) - 1 freq
SoundEx code - M260
measure - 27 freq
meesure - 2 freq
makar - 97 freq
major - 108 freq
misery - 31 freq
maker - 17 freq
mascara - 3 freq
meesery - 5 freq
mixer - 5 freq
maugre - 34 freq
mucker - 11 freq
meisure - 20 freq
moger - 1 freq
'major - 1 freq
mcguire - 4 freq
meisur - 17 freq
mockery - 9 freq
meagre - 9 freq
maigre - 1 freq
mauger - 8 freq
machair - 3 freq
mowser - 12 freq
micro - 2 freq
mckerrow - 2 freq
makker - 7 freq
makar' - 1 freq
maisser - 1 freq
majer - 1 freq
mayjer - 1 freq
misure - 8 freq
m'grew - 1 freq
makeower - 1 freq
meissure - 1 freq
micra - 1 freq
masseur - 2 freq
maguire - 26 freq
miser - 4 freq
mazr - 1 freq
mizzour - 3 freq
miesjir - 1 freq
mouser - 3 freq
mizzer - 4 freq
mcr - 1 freq
'micro' - 3 freq
makkar - 3 freq
maaker - 1 freq
maeshur - 1 freq
mcgraw - 1 freq
missure - 1 freq
émigré - 1 freq
maskara - 1 freq
maisure - 1 freq
miserie - 5 freq
megara - 3 freq
macro - 2 freq
maager - 1 freq
'makar - 1 freq
meiserie - 1 freq
macrae - 5 freq
machar - 9 freq
miscarry - 2 freq
misyur - 1 freq
mcrae - 4 freq
mcwhir - 1 freq
macwhir - 1 freq
mcquarry - 2 freq
maisrie - 2 freq
measuir - 1 freq
mizzure - 1 freq
mjr - 23 freq
mccr - 1 freq
mikeyr - 1 freq
moocher - 2 freq
mogre - 1 freq
mcgerry - 1 freq
meysr - 1 freq
mjchr - 1 freq
macari - 1 freq
MetaPhone code - MJR
major - 108 freq
moger - 1 freq
'major - 1 freq
mauger - 8 freq
majer - 1 freq
mayjer - 1 freq
maager - 1 freq
mjr - 23 freq
MJR
Time to execute Levenshtein function - 0.216359 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.393198 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032815 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038372 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000874 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.