A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dummies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dummies (0) - 4 freq
mummies (1) - 3 freq
dummie (1) - 1 freq
dumbies (1) - 1 freq
gummies (1) - 2 freq
rummies (1) - 1 freq
rammies (2) - 4 freq
crummies (2) - 1 freq
dumpie (2) - 1 freq
mummils (2) - 1 freq
dullies (2) - 2 freq
duties (2) - 16 freq
jimmies (2) - 7 freq
deemies (2) - 1 freq
tummles (2) - 2 freq
dumfries (2) - 41 freq
drammies (2) - 1 freq
mummles (2) - 1 freq
cummins (2) - 1 freq
duddies (2) - 2 freq
jammies (2) - 14 freq
tammies (2) - 1 freq
damies (2) - 1 freq
dunkies (2) - 1 freq
hummles (2) - 1 freq
dummies (0) - 4 freq
rummies (2) - 1 freq
dumbies (2) - 1 freq
gummies (2) - 2 freq
mummies (2) - 3 freq
dummie (2) - 1 freq
jammies (3) - 14 freq
damies (3) - 1 freq
domms (3) - 2 freq
lammies (3) - 5 freq
mammies (3) - 8 freq
drammies (3) - 1 freq
sommies (3) - 3 freq
gemmies (3) - 1 freq
tammies (3) - 1 freq
jimmies (3) - 7 freq
deemies (3) - 1 freq
rammies (3) - 4 freq
dimmin (4) - 3 freq
cumms (4) - 2 freq
dames (4) - 4 freq
domsie (4) - 1 freq
drames (4) - 7 freq
dumps (4) - 14 freq
domains (4) - 6 freq
SoundEx code - D520
deems - 6 freq
ding - 87 freq
dying - 15 freq
dinah's - 5 freq
damage - 45 freq
dance - 380 freq
denies - 12 freq
dwynes - 10 freq
dwaums - 5 freq
damies - 1 freq
dennis's - 1 freq
dings - 16 freq
dwams - 18 freq
dung - 29 freq
dens - 21 freq
dams - 6 freq
dam-s - 1 freq
daunce - 42 freq
demise - 12 freq
danny's - 7 freq
dimaggio - 1 freq
doing - 85 freq
dunkey - 19 freq
dunkie - 5 freq
deen's - 3 freq
donk - 1 freq
dawn's - 2 freq
dinghy - 3 freq
dims - 3 freq
denish' - 1 freq
dionysia - 1 freq
douns - 2 freq
dwines - 6 freq
dons - 31 freq
dunce - 27 freq
dennis - 11 freq
dunes - 6 freq
donkey - 16 freq
donsie - 2 freq
daeins - 125 freq
doons - 11 freq
dames - 4 freq
dang - 12 freq
dense - 12 freq
domes - 2 freq
danes - 2 freq
damayge - 1 freq
dammeyge - 1 freq
danns - 1 freq
damege - 1 freq
don's - 8 freq
dunse - 1 freq
daein's - 1 freq
dunk - 7 freq
dam's - 1 freq
dinnis - 1 freq
danse - 4 freq
donks - 5 freq
doensae - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
duns - 2 freq
dmnk - 1 freq
'daunce - 1 freq
doyens - 1 freq
dunch - 27 freq
danish - 19 freq
do-eeng - 1 freq
do-ing - 1 freq
damask - 2 freq
dimewise - 1 freq
denys - 1 freq
dinng - 1 freq
daince - 1 freq
dunky - 1 freq
dina's - 3 freq
deeins - 7 freq
dank - 11 freq
dinkie - 3 freq
denis - 11 freq
damish - 4 freq
danss - 2 freq
daunss - 1 freq
domms - 2 freq
dwangs - 3 freq
'ding - 4 freq
dong - 5 freq
daeneesh - 1 freq
daeings - 3 freq
deans - 3 freq
deanies - 1 freq
damns - 2 freq
dwyns - 1 freq
dwymes - 1 freq
doonways - 1 freq
dwance - 1 freq
dawins - 1 freq
diine's - 1 freq
dummies - 4 freq
'dance - 1 freq
deemikie - 2 freq
dinks - 1 freq
donne's - 1 freq
donnachie - 1 freq
deames' - 1 freq
dywnes - 1 freq
deimos - 1 freq
dinnaes - 1 freq
dawns - 5 freq
dons' - 1 freq
dunsh - 1 freq
dooeeng - 1 freq
domsie - 1 freq
dink - 2 freq
dans - 3 freq
dingey - 1 freq
deeing - 5 freq
diniz - 1 freq
daimige - 1 freq
dame's' - 1 freq
'daeing' - 1 freq
daing - 1 freq
dowiness - 1 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
dansk - 1 freq
dannsa - 1 freq
daeing - 2 freq
deans's - 1 freq
deuing - 1 freq
dingy - 1 freq
dinky - 1 freq
dms - 1 freq
dynesy - 1 freq
dingo - 1 freq
dunskey - 5 freq
dmgjei - 1 freq
donsÂ’ - 2 freq
dunns - 1 freq
dunc - 4 freq
dnycq - 1 freq
dmjco - 1 freq
dmxyo - 1 freq
dimmock - 1 freq
damij - 1 freq
dancey - 1 freq
dhmck - 1 freq
dunak's - 1 freq
dunnocks - 1 freq
donna's - 1 freq
denise - 1 freq
demoss - 1 freq
dumsch - 1 freq
dunsy - 2 freq
dunnock - 1 freq
MetaPhone code - TMS
times - 934 freq
deems - 6 freq
damies - 1 freq
tam's - 31 freq
tams - 4 freq
timeous - 7 freq
dams - 6 freq
dam-s - 1 freq
time's - 28 freq
tammies - 1 freq
demise - 12 freq
team's - 4 freq
teams - 84 freq
dims - 3 freq
tymes - 14 freq
tom's - 5 freq
dames - 4 freq
tammas - 188 freq
domes - 2 freq
'tammas - 1 freq
tyme's - 8 freq
dumbs - 1 freq
dam's - 1 freq
tammie's - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
tommy's - 6 freq
tems - 1 freq
tmsa - 8 freq
tombs - 9 freq
tims - 8 freq
'tims - 1 freq
times' - 4 freq
domms - 2 freq
tomboys - 1 freq
dumb's - 1 freq
dwymes - 1 freq
dummies - 4 freq
tuim's - 1 freq
deames' - 1 freq
deimos - 1 freq
teems - 1 freq
dumbies - 1 freq
tums - 1 freq
domsie - 1 freq
tamas - 4 freq
€˜timeous - 1 freq
tymous - 2 freq
dame's' - 1 freq
toamy's - 2 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
€œtimes - 1 freq
€˜times - 1 freq
taims - 1 freq
€™times - 1 freq
tomÂ’s - 1 freq
dms - 1 freq
teamÂ’s - 2 freq
toms - 1 freq
'teams' - 1 freq
demoss - 1 freq
DUMMIES
Time to execute Levenshtein function - 0.205158 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373780 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027939 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038893 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000886 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.