A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mild in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mild (0) - 40 freq
mird (1) - 6 freq
mold (1) - 2 freq
meld (1) - 4 freq
milo (1) - 1 freq
mily (1) - 2 freq
myld (1) - 1 freq
eild (1) - 17 freq
mid (1) - 60 freq
milk (1) - 243 freq
mill (1) - 80 freq
wild (1) - 236 freq
moild (1) - 1 freq
mile (1) - 275 freq
miln (1) - 3 freq
muild (1) - 4 freq
mind (1) - 2299 freq
aild (1) - 1 freq
gild (1) - 1 freq
haild (2) - 2 freq
mole (2) - 11 freq
owld (2) - 166 freq
meed (2) - 54 freq
iiid (2) - 1 freq
cilt (2) - 7 freq
mild (0) - 40 freq
myld (1) - 1 freq
moild (1) - 1 freq
meld (1) - 4 freq
mold (1) - 2 freq
muild (1) - 4 freq
gild (2) - 1 freq
aild (2) - 1 freq
mind (2) - 2299 freq
mooild (2) - 1 freq
moold (2) - 1 freq
mailed (2) - 2 freq
mould (2) - 14 freq
muldy (2) - 1 freq
moldy (2) - 1 freq
melde (2) - 1 freq
mid (2) - 60 freq
mily (2) - 2 freq
miln (2) - 3 freq
eild (2) - 17 freq
milk (2) - 243 freq
milo (2) - 1 freq
wild (2) - 236 freq
mile (2) - 275 freq
mill (2) - 80 freq
SoundEx code - M430
melt - 41 freq
melodie - 2 freq
melled - 34 freq
malt - 18 freq
muild - 4 freq
mold - 2 freq
meld - 4 freq
mild - 40 freq
milled - 3 freq
mellit - 4 freq
melody - 6 freq
mildew - 2 freq
mellowed - 2 freq
mouldy - 4 freq
mould - 14 freq
multi - 7 freq
muldy - 1 freq
mail't - 1 freq
malady - 1 freq
meltt - 1 freq
moold - 1 freq
maelody - 1 freq
m'lud - 1 freq
mailed - 2 freq
malta - 2 freq
malti - 1 freq
möld - 4 freq
myld - 1 freq
militia - 1 freq
melde - 1 freq
moladh - 1 freq
mniled - 1 freq
mulled - 2 freq
molto - 1 freq
mill-lade - 1 freq
mullet - 3 freq
mullt - 1 freq
mileetia - 3 freq
mailoot - 3 freq
mooild - 1 freq
moild - 1 freq
moldy - 1 freq
malty - 6 freq
mlitt - 2 freq
MetaPhone code - MLT
melt - 41 freq
melodie - 2 freq
melled - 34 freq
malt - 18 freq
muild - 4 freq
mold - 2 freq
meld - 4 freq
mild - 40 freq
milled - 3 freq
mellit - 4 freq
melody - 6 freq
mildew - 2 freq
mouldy - 4 freq
mould - 14 freq
multi - 7 freq
muldy - 1 freq
mail't - 1 freq
malady - 1 freq
meltt - 1 freq
moold - 1 freq
maelody - 1 freq
m'lud - 1 freq
mailed - 2 freq
malta - 2 freq
malti - 1 freq
möld - 4 freq
myld - 1 freq
melde - 1 freq
moladh - 1 freq
mulled - 2 freq
molto - 1 freq
mullet - 3 freq
mullt - 1 freq
mailoot - 3 freq
mooild - 1 freq
moild - 1 freq
moldy - 1 freq
malty - 6 freq
mlitt - 2 freq
MILD
Time to execute Levenshtein function - 0.175542 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.309110 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027118 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036800 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000831 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.