A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to maup in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
maup (0) - 1 freq
maum (1) - 2 freq
maus (1) - 1 freq
map (1) - 91 freq
maux (1) - 3 freq
macp (1) - 1 freq
maud (1) - 10 freq
maut (1) - 12 freq
mup (1) - 2 freq
mauk (1) - 5 freq
gaup (1) - 4 freq
caup (1) - 11 freq
jaup (1) - 3 freq
maun (1) - 1163 freq
aup (1) - 1 freq
taul (2) - 11 freq
yauk (2) - 3 freq
baul (2) - 2 freq
tup (2) - 6 freq
wasp (2) - 5 freq
aug (2) - 6 freq
mask (2) - 70 freq
'yup (2) - 1 freq
mams (2) - 5 freq
maw (2) - 392 freq
maup (0) - 1 freq
mup (1) - 2 freq
map (1) - 91 freq
aup (2) - 1 freq
maun (2) - 1163 freq
amp (2) - 10 freq
meep (2) - 2 freq
moip (2) - 2 freq
mp (2) - 39 freq
mop (2) - 16 freq
jaup (2) - 3 freq
mep (2) - 13 freq
maus (2) - 1 freq
maux (2) - 3 freq
maum (2) - 2 freq
caup (2) - 11 freq
macp (2) - 1 freq
maut (2) - 12 freq
mauk (2) - 5 freq
gaup (2) - 4 freq
maud (2) - 10 freq
makeup (3) - 12 freq
mul (3) - 2 freq
sup (3) - 53 freq
maer (3) - 3 freq
SoundEx code - M100
mibbe - 372 freq
'mibbe - 24 freq
mebbe - 692 freq
move - 314 freq
muive - 26 freq
mbe - 5 freq
mayhap - 3 freq
maybe - 428 freq
mob - 26 freq
map - 91 freq
mop - 16 freq
meeve - 17 freq
mibbie - 104 freq
mibby - 27 freq
muve - 20 freq
mp - 39 freq
mebbie - 6 freq
mebbee - 1 freq
mibbee - 7 freq
movie - 15 freq
moufae - 2 freq
mmph - 2 freq
m-mebbe - 1 freq
mibee - 16 freq
'mayhap - 1 freq
mappa - 1 freq
moby - 2 freq
moov - 3 freq
mappey - 2 freq
maybee - 3 freq
'maybe - 5 freq
'meep - 2 freq
meep - 2 freq
'mibbie - 3 freq
mafia - 6 freq
mef - 2 freq
'move - 2 freq
moofu - 4 freq
maave - 4 freq
mibey - 1 freq
mibe - 3 freq
moave - 1 freq
maybae - 14 freq
maif - 1 freq
mauve - 4 freq
mi'bae - 1 freq
moofae - 1 freq
maup - 1 freq
muv - 1 freq
moab - 2 freq
mfaa - 4 freq
mappie - 20 freq
mov - 3 freq
mibbe-' - 1 freq
mouvie - 1 freq
mave - 1 freq
mibbbe - 1 freq
maiv - 1 freq
maebbi - 6 freq
möv - 1 freq
möfi - 1 freq
myweb - 1 freq
mavie - 4 freq
mibbi - 1 freq
€˜maybe - 3 freq
€˜mibbe - 2 freq
moufu - 5 freq
€œmibbe - 2 freq
€œmibbie - 2 freq
maebae - 2 freq
€œmaybe - 7 freq
mep - 13 freq
€œmebbe - 3 freq
mabbie - 1 freq
meuv - 4 freq
mb - 5 freq
mpy - 1 freq
mayb - 1 freq
mubba - 1 freq
mv - 2 freq
myvwe - 1 freq
mebe - 1 freq
mby - 1 freq
mfi - 1 freq
mi'be - 6 freq
mup - 2 freq
mf - 1 freq
mebee - 2 freq
mve - 1 freq
mph - 2 freq
mbpfuu - 1 freq
mvhwi - 1 freq
mbu - 1 freq
moip - 2 freq
mbh - 1 freq
movieÂ’ - 1 freq
MetaPhone code - MP
map - 91 freq
mop - 16 freq
mp - 39 freq
mappa - 1 freq
mappey - 2 freq
ymp - 2 freq
'meep - 2 freq
meep - 2 freq
maup - 1 freq
mappie - 20 freq
mep - 13 freq
mpy - 1 freq
mup - 2 freq
moip - 2 freq
mbappe - 1 freq
hmp - 1 freq
MAUP
Time to execute Levenshtein function - 0.241169 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.398985 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034406 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047456 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000888 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.