A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to mdruc in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
mdruc (0) - 1 freq
merc (2) - 2 freq
drug (2) - 25 freq
muc (2) - 1 freq
marc (2) - 16 freq
mrc (2) - 1 freq
mdduq (2) - 1 freq
madrum (2) - 1 freq
myrus (2) - 2 freq
druv (2) - 1 freq
drum (2) - 72 freq
mruq (2) - 1 freq
sruc (2) - 1 freq
rxruc (2) - 1 freq
duc (2) - 2 freq
mnrur (2) - 1 freq
trua (3) - 1 freq
ruch (3) - 18 freq
murs (3) - 1 freq
strut (3) - 8 freq
crum (3) - 1 freq
mup (3) - 2 freq
rub (3) - 51 freq
erum (3) - 1 freq
mrez (3) - 1 freq
mdruc (0) - 1 freq
madrum (3) - 1 freq
marc (3) - 16 freq
mrc (3) - 1 freq
merc (3) - 2 freq
madras (4) - 1 freq
morice (4) - 1 freq
dreic (4) - 1 freq
merci (4) - 1 freq
modren (4) - 202 freq
medic (4) - 3 freq
marco (4) - 5 freq
doric (4) - 480 freq
midrit (4) - 1 freq
godric (4) - 1 freq
madrid (4) - 8 freq
metric (4) - 3 freq
mercy (4) - 94 freq
madram (4) - 1 freq
duc (4) - 2 freq
rxruc (4) - 1 freq
sruc (4) - 1 freq
mdduq (4) - 1 freq
drug (4) - 25 freq
mnrur (4) - 1 freq
SoundEx code - M362
mattress - 18 freq
mither's - 138 freq
mothers - 9 freq
maitters - 115 freq
matters - 29 freq
mutters - 12 freq
mithers - 80 freq
mither-side - 1 freq
metres - 15 freq
mattresses - 6 freq
mithers-in-law - 1 freq
mothers-in-law - 1 freq
meteoric - 1 freq
mattrass - 5 freq
motor's - 3 freq
motors - 35 freq
maiters - 28 freq
maitter's - 3 freq
matures - 2 freq
motorists - 3 freq
'mithers - 1 freq
motthors - 2 freq
motthor-car - 2 freq
motirs - 2 freq
motrice - 1 freq
maitiers - 1 freq
metters - 8 freq
meters - 12 freq
mother's - 3 freq
mither-son - 1 freq
mettèrs - 1 freq
mither-goat - 2 freq
mither-gab - 1 freq
midders - 8 freq
motorised - 1 freq
maittèrs - 10 freq
matters' - 1 freq
maittèrs' - 1 freq
matrix - 1 freq
midder's - 1 freq
mottir-kaars - 1 freq
mithir's - 6 freq
mither-speik - 1 freq
mattirs - 1 freq
maitters-na - 1 freq
maetirs - 1 freq
metrical - 1 freq
motorist - 2 freq
mathrick - 1 freq
mithers' - 1 freq
mathers - 2 freq
metric - 3 freq
metrically - 1 freq
motor-coach - 1 freq
motòrs - 1 freq
moters - 1 freq
metrosplaining - 1 freq
mither’s - 5 freq
mdruc - 1 freq
mattrichardson - 50 freq
madtrust - 1 freq
mattreguson - 1 freq
madras - 1 freq
mothersday - 3 freq
motorcycle - 1 freq
'mothers - 1 freq
muthers - 1 freq
MetaPhone code - MTRK
meteoric - 1 freq
metric - 3 freq
mdruc - 1 freq
MDRUC
Time to execute Levenshtein function - 0.289464 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.494829 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034355 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042942 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000929 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.