A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to midgie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
midgie (0) - 27 freq
midgies (1) - 20 freq
mingie (1) - 2 freq
midge (1) - 10 freq
fidgie (1) - 1 freq
tidgie (1) - 1 freq
middie (1) - 2 freq
midi (2) - 1 freq
airgie (2) - 1 freq
yiddie (2) - 1 freq
meggie (2) - 1 freq
radgie (2) - 10 freq
gadgie (2) - 38 freq
fidgit (2) - 3 freq
fudgie (2) - 4 freq
midse (2) - 1 freq
smiddie (2) - 1 freq
biddie (2) - 1 freq
minnie (2) - 214 freq
ciggie (2) - 4 freq
midwife (2) - 5 freq
pidgin (2) - 4 freq
midget (2) - 6 freq
minge (2) - 1 freq
millie (2) - 5 freq
midgie (0) - 27 freq
midge (1) - 10 freq
madge (2) - 9 freq
tidgie (2) - 1 freq
mudge (2) - 2 freq
middie (2) - 2 freq
midgies (2) - 20 freq
fidgie (2) - 1 freq
mingie (2) - 2 freq
widge (3) - 1 freq
maidie (3) - 2 freq
moggie (3) - 1 freq
budgie (3) - 9 freq
aidge (3) - 5 freq
maddie (3) - 1 freq
muggie (3) - 62 freq
maggie (3) - 153 freq
misgae (3) - 1 freq
meidie (3) - 1 freq
mudgit (3) - 1 freq
fidge (3) - 9 freq
hedgie (3) - 3 freq
codgie (3) - 2 freq
mangie (3) - 2 freq
mudgies (3) - 1 freq
SoundEx code - M320
mooths - 74 freq
mats - 8 freq
madge - 9 freq
mids - 70 freq
meidaes - 2 freq
medic - 3 freq
mauts - 4 freq
meadows - 13 freq
meets - 33 freq
mates - 88 freq
meiths - 3 freq
muids - 3 freq
match - 174 freq
maths - 32 freq
matthew's - 7 freq
maieutic - 1 freq
mitts - 8 freq
meedows - 8 freq
meeeets - 1 freq
moods - 7 freq
modes - 2 freq
maids - 14 freq
muits - 1 freq
mouths - 7 freq
moths - 7 freq
meats - 2 freq
mathews - 1 freq
mut's - 1 freq
midgie - 27 freq
meedie's - 2 freq
meeda's - 1 freq
mutch - 1 freq
mooth's - 6 freq
mate's - 4 freq
myths - 18 freq
matthey's - 1 freq
mayota's - 1 freq
midweek - 3 freq
mutes - 1 freq
maddies - 1 freq
mattha's - 1 freq
mythic - 2 freq
media's - 2 freq
midse - 1 freq
motes - 3 freq
moth's - 1 freq
moots - 3 freq
maet's - 2 freq
medics - 4 freq
mowdies - 7 freq
mitch - 6 freq
maatch - 4 freq
midge - 10 freq
motts - 1 freq
meethis - 1 freq
mowat's - 1 freq
metch - 1 freq
mits - 3 freq
matiz - 1 freq
meids - 3 freq
mites - 1 freq
mudge - 2 freq
matsuo - 2 freq
matty's - 15 freq
maid's - 1 freq
meths - 1 freq
meidies - 1 freq
moutach - 1 freq
matisse - 1 freq
mínties - 1 freq
modus - 1 freq
midas - 1 freq
€œmatch - 1 freq
meadshaw - 12 freq
mínits - 2 freq
mythos - 1 freq
mods - 3 freq
mtgha - 1 freq
midwik - 1 freq
meds - 3 freq
mets - 1 freq
mtwz - 1 freq
mdj - 1 freq
motoki - 1 freq
madass - 2 freq
mnits - 1 freq
mínutes - 1 freq
medias - 1 freq
mdozy - 1 freq
myttqqg - 1 freq
mwatts - 1 freq
matthewjooooo - 6 freq
madhouse - 1 freq
mtzj - 1 freq
mtqu - 1 freq
mdduq - 1 freq
midhoose - 1 freq
matties - 1 freq
MetaPhone code - MJ
madge - 9 freq
mojo - 4 freq
midgie - 27 freq
mage - 3 freq
magie' - 2 freq
midge - 10 freq
mudge - 2 freq
magee - 1 freq
mj - 6 freq
moj - 2 freq
mhj - 1 freq
mgy - 1 freq
magi - 3 freq
meej - 1 freq
MIDGIE
Time to execute Levenshtein function - 0.182375 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.344272 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028882 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041861 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001119 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.