A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to midge in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
midge (0) - 10 freq
aidge (1) - 5 freq
midget (1) - 6 freq
ridge (1) - 18 freq
midse (1) - 1 freq
midges (1) - 26 freq
midgie (1) - 27 freq
mudge (1) - 2 freq
fidge (1) - 9 freq
madge (1) - 9 freq
minge (1) - 1 freq
nidge (1) - 2 freq
widge (1) - 1 freq
mibee (2) - 16 freq
hide (2) - 185 freq
fudge (2) - 18 freq
mikie (2) - 7 freq
juidge (2) - 17 freq
mage (2) - 3 freq
ide (2) - 3 freq
mings (2) - 1 freq
middie (2) - 2 freq
fidged (2) - 7 freq
nudge (2) - 79 freq
tide (2) - 117 freq
midge (0) - 10 freq
midgie (1) - 27 freq
mudge (1) - 2 freq
madge (1) - 9 freq
nidge (2) - 2 freq
aidge (2) - 5 freq
widge (2) - 1 freq
fidge (2) - 9 freq
minge (2) - 1 freq
midges (2) - 26 freq
ridge (2) - 18 freq
midget (2) - 6 freq
midse (2) - 1 freq
dge (3) - 1 freq
fodge (3) - 6 freq
maidie (3) - 2 freq
aedge (3) - 8 freq
midas (3) - 1 freq
budge (3) - 12 freq
badge (3) - 14 freq
midgies (3) - 20 freq
fidgie (3) - 1 freq
fidgy (3) - 1 freq
mingie (3) - 2 freq
wedge (3) - 8 freq
SoundEx code - M320
mooths - 74 freq
mats - 8 freq
madge - 9 freq
mids - 70 freq
meidaes - 2 freq
medic - 3 freq
mauts - 4 freq
meadows - 13 freq
meets - 33 freq
mates - 88 freq
meiths - 3 freq
muids - 3 freq
match - 174 freq
maths - 32 freq
matthew's - 7 freq
maieutic - 1 freq
mitts - 8 freq
meedows - 8 freq
meeeets - 1 freq
moods - 7 freq
modes - 2 freq
maids - 14 freq
muits - 1 freq
mouths - 7 freq
moths - 7 freq
meats - 2 freq
mathews - 1 freq
mut's - 1 freq
midgie - 27 freq
meedie's - 2 freq
meeda's - 1 freq
mutch - 1 freq
mooth's - 6 freq
mate's - 4 freq
myths - 18 freq
matthey's - 1 freq
mayota's - 1 freq
midweek - 3 freq
mutes - 1 freq
maddies - 1 freq
mattha's - 1 freq
mythic - 2 freq
media's - 2 freq
midse - 1 freq
motes - 3 freq
moth's - 1 freq
moots - 3 freq
maet's - 2 freq
medics - 4 freq
mowdies - 7 freq
mitch - 6 freq
maatch - 4 freq
midge - 10 freq
motts - 1 freq
meethis - 1 freq
mowat's - 1 freq
metch - 1 freq
mits - 3 freq
matiz - 1 freq
meids - 3 freq
mites - 1 freq
mudge - 2 freq
matsuo - 2 freq
matty's - 15 freq
maid's - 1 freq
meths - 1 freq
meidies - 1 freq
moutach - 1 freq
matisse - 1 freq
mínties - 1 freq
modus - 1 freq
midas - 1 freq
€œmatch - 1 freq
meadshaw - 12 freq
mínits - 2 freq
mythos - 1 freq
mods - 3 freq
mtgha - 1 freq
midwik - 1 freq
meds - 3 freq
mets - 1 freq
mtwz - 1 freq
mdj - 1 freq
motoki - 1 freq
madass - 2 freq
mnits - 1 freq
mínutes - 1 freq
medias - 1 freq
mdozy - 1 freq
myttqqg - 1 freq
mwatts - 1 freq
matthewjooooo - 6 freq
madhouse - 1 freq
mtzj - 1 freq
mtqu - 1 freq
mdduq - 1 freq
midhoose - 1 freq
matties - 1 freq
MetaPhone code - MJ
madge - 9 freq
mojo - 4 freq
midgie - 27 freq
mage - 3 freq
magie' - 2 freq
midge - 10 freq
mudge - 2 freq
magee - 1 freq
mj - 6 freq
moj - 2 freq
mhj - 1 freq
mgy - 1 freq
magi - 3 freq
meej - 1 freq
MIDGE
Time to execute Levenshtein function - 0.180086 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.376509 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028428 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.066482 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000831 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.