A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to major in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
major (0) - 109 freq
'major (1) - 1 freq
mayor (1) - 3 freq
manor (1) - 4 freq
majer (1) - 1 freq
mjr (2) - 23 freq
nahor (2) - 3 freq
lajos (2) - 2 freq
amor (2) - 1 freq
major's (2) - 4 freq
maiur (2) - 1 freq
bator (2) - 1 freq
bajoe (2) - 2 freq
favor (2) - 1 freq
majig (2) - 1 freq
mazr (2) - 1 freq
mayo (2) - 3 freq
tabor (2) - 1 freq
mano (2) - 1 freq
majorca (2) - 8 freq
mano' (2) - 1 freq
ajod (2) - 1 freq
mair (2) - 6172 freq
majored (2) - 1 freq
mayjer (2) - 1 freq
major (0) - 109 freq
majer (1) - 1 freq
mayjer (2) - 1 freq
mjr (2) - 23 freq
manor (2) - 4 freq
mayor (2) - 3 freq
'major (2) - 1 freq
ajer (3) - 2 freq
motor (3) - 164 freq
mar (3) - 16 freq
wajer (3) - 1 freq
marr (3) - 5 freq
minor (3) - 15 freq
maer (3) - 3 freq
mailr (3) - 1 freq
mater (3) - 16 freq
ajr (3) - 1 freq
maner (3) - 4 freq
mojo (3) - 4 freq
ajar (3) - 4 freq
marjory (3) - 1 freq
matr (3) - 7 freq
malir (3) - 1 freq
majorly (3) - 2 freq
majin (3) - 1 freq
SoundEx code - M260
measure - 27 freq
meesure - 2 freq
makar - 98 freq
major - 109 freq
misery - 31 freq
maker - 18 freq
mascara - 3 freq
meesery - 5 freq
mixer - 5 freq
maugre - 34 freq
mucker - 11 freq
meisure - 20 freq
moger - 1 freq
'major - 1 freq
mcguire - 4 freq
meisur - 18 freq
mockery - 9 freq
meagre - 9 freq
maigre - 1 freq
mauger - 8 freq
machair - 3 freq
mowser - 12 freq
micro - 3 freq
mckerrow - 2 freq
makker - 7 freq
makar' - 1 freq
maisser - 1 freq
majer - 1 freq
mayjer - 1 freq
misure - 8 freq
m'grew - 1 freq
makeower - 1 freq
meissure - 1 freq
micra - 1 freq
masseur - 2 freq
maguire - 26 freq
miser - 4 freq
mazr - 1 freq
mizzour - 3 freq
miesjir - 1 freq
mouser - 3 freq
mizzer - 4 freq
mcr - 1 freq
'micro' - 3 freq
makkar - 3 freq
maaker - 1 freq
maeshur - 1 freq
mcgraw - 1 freq
missure - 1 freq
émigré - 1 freq
maskara - 1 freq
maisure - 1 freq
miserie - 5 freq
megara - 3 freq
macro - 2 freq
maager - 1 freq
'makar - 1 freq
meiserie - 1 freq
macrae - 5 freq
machar - 9 freq
miscarry - 2 freq
misyur - 1 freq
mcrae - 4 freq
mcwhir - 1 freq
macwhir - 1 freq
mcquarry - 2 freq
maisrie - 2 freq
measuir - 1 freq
mizzure - 1 freq
mjr - 23 freq
mccr - 1 freq
mikeyr - 1 freq
moocher - 2 freq
mogre - 1 freq
mcgerry - 1 freq
meysr - 1 freq
mjchr - 1 freq
macari - 1 freq
MetaPhone code - MJR
major - 109 freq
moger - 1 freq
'major - 1 freq
mauger - 8 freq
majer - 1 freq
mayjer - 1 freq
maager - 1 freq
mjr - 23 freq
MAJOR
Time to execute Levenshtein function - 0.606857 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.160568 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.037815 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.103030 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001139 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.