A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to midori in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
midori (0) - 2 freq
midi (2) - 1 freq
minor (2) - 15 freq
minors (2) - 1 freq
maori (2) - 2 freq
midrit (2) - 1 freq
dork (3) - 5 freq
adore (3) - 12 freq
miners (3) - 41 freq
widyi (3) - 21 freq
cider (3) - 38 freq
mixers (3) - 1 freq
igor (3) - 6 freq
migo (3) - 1 freq
mairi (3) - 4 freq
mindo (3) - 1 freq
midas (3) - 1 freq
idot (3) - 1 freq
adorin (3) - 1 freq
modern (3) - 144 freq
doris (3) - 58 freq
mixer (3) - 5 freq
vidi (3) - 1 freq
manor (3) - 4 freq
midges (3) - 26 freq
midori (0) - 2 freq
moadore (3) - 9 freq
modere (3) - 1 freq
maori (3) - 2 freq
midrit (3) - 1 freq
midi (3) - 1 freq
moder (3) - 2 freq
minor (3) - 15 freq
miser (4) - 4 freq
memore (4) - 3 freq
memor (4) - 1 freq
mire (4) - 14 freq
macari (4) - 1 freq
fedora (4) - 3 freq
minder (4) - 4 freq
misery (4) - 31 freq
milder (4) - 2 freq
mitre (4) - 2 freq
micra (4) - 1 freq
more (4) - 461 freq
madoe (4) - 1 freq
misdoor (4) - 1 freq
miner (4) - 6 freq
mora (4) - 8 freq
medow (4) - 1 freq
SoundEx code - M360
maitter - 382 freq
mither - 1376 freq
metter - 53 freq
matter - 175 freq
matr - 7 freq
mother - 97 freq
mutter - 10 freq
motor - 162 freq
mature - 14 freq
maiter - 67 freq
mid-air - 8 freq
madder - 5 freq
m'dear - 1 freq
'mither - 12 freq
metier - 1 freq
'motor - 6 freq
matter' - 1 freq
miuther - 1 freq
mither' - 4 freq
motorway - 8 freq
moter - 5 freq
meter - 8 freq
metther - 1 freq
metre - 13 freq
'mature - 1 freq
mettér - 5 freq
motir - 12 freq
mithir - 15 freq
mettèr - 7 freq
'mother' - 2 freq
meteor - 1 freq
midder - 64 freq
motthor - 2 freq
maettir - 1 freq
mitherie - 1 freq
mother' - 1 freq
matther - 2 freq
muthur - 1 freq
mater - 15 freq
matur - 1 freq
möder - 1 freq
mettir - 2 freq
€˜midder - 1 freq
€œmither - 3 freq
modere - 1 freq
€˜mother - 2 freq
maeter - 2 freq
€˜mither - 1 freq
maither - 1 freq
moather - 5 freq
mettur - 1 freq
mitre - 2 freq
metro - 3 freq
muther - 26 freq
mithier - 1 freq
motòr - 2 freq
myther - 2 freq
'mither' - 1 freq
mitter - 1 freq
midori - 2 freq
mtr - 1 freq
moadore - 9 freq
moder - 2 freq
'maitter' - 1 freq
MetaPhone code - MTR
maitter - 382 freq
metter - 53 freq
matter - 175 freq
matr - 7 freq
mutter - 10 freq
motor - 162 freq
mature - 14 freq
maiter - 67 freq
mid-air - 8 freq
madder - 5 freq
m'dear - 1 freq
metier - 1 freq
'motor - 6 freq
matter' - 1 freq
moter - 5 freq
meter - 8 freq
metther - 1 freq
metre - 13 freq
'mature - 1 freq
mettér - 5 freq
motir - 12 freq
mettèr - 7 freq
meteor - 1 freq
midder - 64 freq
motthor - 2 freq
maettir - 1 freq
matther - 2 freq
mater - 15 freq
matur - 1 freq
möder - 1 freq
mettir - 2 freq
€˜midder - 1 freq
modere - 1 freq
maeter - 2 freq
mettur - 1 freq
mitre - 2 freq
metro - 3 freq
motòr - 2 freq
mitter - 1 freq
midori - 2 freq
mtr - 1 freq
moadore - 9 freq
moder - 2 freq
'maitter' - 1 freq
MIDORI
Time to execute Levenshtein function - 0.376854 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.002349 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.099395 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.108268 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000861 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.