A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mould in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mould (0) - 14 freq
mou'd (1) - 1 freq
fould (1) - 1 freq
tould (1) - 2 freq
mound (1) - 7 freq
ould (1) - 21 freq
would (1) - 690 freq
mold (1) - 2 freq
mouldy (1) - 4 freq
mouls (1) - 5 freq
moulds (1) - 2 freq
moild (1) - 1 freq
could (1) - 2637 freq
sould (1) - 6 freq
moul (1) - 1 freq
eould (1) - 2 freq
moold (1) - 1 freq
couldo (2) - 1 freq
'auld (2) - 11 freq
cold (2) - 44 freq
bou'd (2) - 5 freq
'wuld (2) - 1 freq
boud (2) - 1 freq
mouss (2) - 26 freq
mooed (2) - 2 freq
mould (0) - 14 freq
mouldy (1) - 4 freq
moild (1) - 1 freq
mold (1) - 2 freq
moold (1) - 1 freq
ould (2) - 21 freq
myld (2) - 1 freq
muild (2) - 4 freq
mild (2) - 40 freq
muldy (2) - 1 freq
mooild (2) - 1 freq
moldy (2) - 1 freq
meld (2) - 4 freq
mou'd (2) - 1 freq
eould (2) - 2 freq
mouls (2) - 5 freq
tould (2) - 2 freq
mound (2) - 7 freq
would (2) - 690 freq
moul (2) - 1 freq
moulds (2) - 2 freq
sould (2) - 6 freq
fould (2) - 1 freq
could (2) - 2637 freq
melde (3) - 1 freq
SoundEx code - M430
melt - 41 freq
melodie - 2 freq
melled - 34 freq
malt - 18 freq
muild - 4 freq
mold - 2 freq
meld - 4 freq
mild - 40 freq
milled - 3 freq
mellit - 4 freq
melody - 6 freq
mildew - 2 freq
mellowed - 2 freq
mouldy - 4 freq
mould - 14 freq
multi - 7 freq
muldy - 1 freq
mail't - 1 freq
malady - 1 freq
meltt - 1 freq
moold - 1 freq
maelody - 1 freq
m'lud - 1 freq
mailed - 2 freq
malta - 2 freq
malti - 1 freq
möld - 4 freq
myld - 1 freq
militia - 1 freq
melde - 1 freq
moladh - 1 freq
mniled - 1 freq
mulled - 2 freq
molto - 1 freq
mill-lade - 1 freq
mullet - 3 freq
mullt - 1 freq
mileetia - 3 freq
mailoot - 3 freq
mooild - 1 freq
moild - 1 freq
moldy - 1 freq
malty - 6 freq
mlitt - 2 freq
MetaPhone code - MLT
melt - 41 freq
melodie - 2 freq
melled - 34 freq
malt - 18 freq
muild - 4 freq
mold - 2 freq
meld - 4 freq
mild - 40 freq
milled - 3 freq
mellit - 4 freq
melody - 6 freq
mildew - 2 freq
mouldy - 4 freq
mould - 14 freq
multi - 7 freq
muldy - 1 freq
mail't - 1 freq
malady - 1 freq
meltt - 1 freq
moold - 1 freq
maelody - 1 freq
m'lud - 1 freq
mailed - 2 freq
malta - 2 freq
malti - 1 freq
möld - 4 freq
myld - 1 freq
melde - 1 freq
moladh - 1 freq
mulled - 2 freq
molto - 1 freq
mullet - 3 freq
mullt - 1 freq
mailoot - 3 freq
mooild - 1 freq
moild - 1 freq
moldy - 1 freq
malty - 6 freq
mlitt - 2 freq
MOULD
Time to execute Levenshtein function - 0.238896 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.358740 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028002 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036675 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000836 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.