A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dms in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dms (0) - 1 freq
dus (1) - 24 freq
dfs (1) - 2 freq
ems (1) - 4 freq
dims (1) - 3 freq
dis (1) - 1552 freq
hms (1) - 8 freq
kms (1) - 1 freq
dos (1) - 4 freq
lms (1) - 1 freq
des (1) - 23 freq
ds (1) - 9 freq
cdms (1) - 1 freq
dams (1) - 6 freq
dss (1) - 2 freq
dm (1) - 12 freq
pms (1) - 7 freq
dmi (1) - 1 freq
dps (1) - 1 freq
ms (1) - 29 freq
nms (1) - 2 freq
dems (1) - 6 freq
doms (1) - 3 freq
das (1) - 23 freq
du's (2) - 113 freq
dms (0) - 1 freq
dams (1) - 6 freq
dems (1) - 6 freq
doms (1) - 3 freq
dims (1) - 3 freq
dmi (2) - 1 freq
nms (2) - 2 freq
ms (2) - 29 freq
dps (2) - 1 freq
das (2) - 23 freq
dooms (2) - 1 freq
adams (2) - 9 freq
deems (2) - 6 freq
demos (2) - 2 freq
pms (2) - 7 freq
domes (2) - 2 freq
dames (2) - 4 freq
dus (2) - 24 freq
kms (2) - 1 freq
dfs (2) - 2 freq
ems (2) - 4 freq
dis (2) - 1552 freq
dm (2) - 12 freq
hms (2) - 8 freq
ds (2) - 9 freq
SoundEx code - D520
deems - 6 freq
ding - 87 freq
dying - 15 freq
dinah's - 5 freq
damage - 45 freq
dance - 380 freq
denies - 12 freq
dwynes - 10 freq
dwaums - 5 freq
damies - 1 freq
dennis's - 1 freq
dings - 16 freq
dwams - 18 freq
dung - 29 freq
dens - 21 freq
dams - 6 freq
dam-s - 1 freq
daunce - 42 freq
demise - 12 freq
danny's - 7 freq
dimaggio - 1 freq
doing - 85 freq
dunkey - 19 freq
dunkie - 5 freq
deen's - 3 freq
donk - 1 freq
dawn's - 2 freq
dinghy - 3 freq
dims - 3 freq
denish' - 1 freq
dionysia - 1 freq
douns - 2 freq
dwines - 6 freq
dons - 31 freq
dunce - 27 freq
dennis - 11 freq
dunes - 6 freq
donkey - 16 freq
donsie - 2 freq
daeins - 125 freq
doons - 11 freq
dames - 4 freq
dang - 12 freq
dense - 12 freq
domes - 2 freq
danes - 2 freq
damayge - 1 freq
dammeyge - 1 freq
danns - 1 freq
damege - 1 freq
don's - 8 freq
dunse - 1 freq
daein's - 1 freq
dunk - 7 freq
dam's - 1 freq
dinnis - 1 freq
danse - 4 freq
donks - 5 freq
doensae - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
duns - 2 freq
dmnk - 1 freq
'daunce - 1 freq
doyens - 1 freq
dunch - 27 freq
danish - 19 freq
do-eeng - 1 freq
do-ing - 1 freq
damask - 2 freq
dimewise - 1 freq
denys - 1 freq
dinng - 1 freq
daince - 1 freq
dunky - 1 freq
dina's - 3 freq
deeins - 7 freq
dank - 11 freq
dinkie - 3 freq
denis - 11 freq
damish - 4 freq
danss - 2 freq
daunss - 1 freq
domms - 2 freq
dwangs - 3 freq
'ding - 4 freq
dong - 5 freq
daeneesh - 1 freq
daeings - 3 freq
deans - 3 freq
deanies - 1 freq
damns - 2 freq
dwyns - 1 freq
dwymes - 1 freq
doonways - 1 freq
dwance - 1 freq
dawins - 1 freq
diine's - 1 freq
dummies - 4 freq
'dance - 1 freq
deemikie - 2 freq
dinks - 1 freq
donne's - 1 freq
donnachie - 1 freq
deames' - 1 freq
dywnes - 1 freq
deimos - 1 freq
dinnaes - 1 freq
dawns - 5 freq
dons' - 1 freq
dunsh - 1 freq
dooeeng - 1 freq
domsie - 1 freq
dink - 2 freq
dans - 3 freq
dingey - 1 freq
deeing - 5 freq
diniz - 1 freq
daimige - 1 freq
dame's' - 1 freq
'daeing' - 1 freq
daing - 1 freq
dowiness - 1 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
dansk - 1 freq
dannsa - 1 freq
daeing - 2 freq
deans's - 1 freq
deuing - 1 freq
dingy - 1 freq
dinky - 1 freq
dms - 1 freq
dynesy - 1 freq
dingo - 1 freq
dunskey - 5 freq
dmgjei - 1 freq
donsÂ’ - 2 freq
dunns - 1 freq
dunc - 4 freq
dnycq - 1 freq
dmjco - 1 freq
dmxyo - 1 freq
dimmock - 1 freq
damij - 1 freq
dancey - 1 freq
dhmck - 1 freq
dunak's - 1 freq
dunnocks - 1 freq
donna's - 1 freq
denise - 1 freq
demoss - 1 freq
dumsch - 1 freq
dunsy - 2 freq
dunnock - 1 freq
MetaPhone code - TMS
times - 934 freq
deems - 6 freq
damies - 1 freq
tam's - 31 freq
tams - 4 freq
timeous - 7 freq
dams - 6 freq
dam-s - 1 freq
time's - 28 freq
tammies - 1 freq
demise - 12 freq
team's - 4 freq
teams - 84 freq
dims - 3 freq
tymes - 14 freq
tom's - 5 freq
dames - 4 freq
tammas - 188 freq
domes - 2 freq
'tammas - 1 freq
tyme's - 8 freq
dumbs - 1 freq
dam's - 1 freq
tammie's - 2 freq
doms - 3 freq
deemies - 1 freq
dems - 6 freq
tommy's - 6 freq
tems - 1 freq
tmsa - 8 freq
tombs - 9 freq
tims - 8 freq
'tims - 1 freq
times' - 4 freq
domms - 2 freq
tomboys - 1 freq
dumb's - 1 freq
dwymes - 1 freq
dummies - 4 freq
tuim's - 1 freq
deames' - 1 freq
deimos - 1 freq
teems - 1 freq
dumbies - 1 freq
tums - 1 freq
domsie - 1 freq
tamas - 4 freq
€˜timeous - 1 freq
tymous - 2 freq
dame's' - 1 freq
toamy's - 2 freq
dooms - 1 freq
demos- - 2 freq
demos - 2 freq
€œtimes - 1 freq
€˜times - 1 freq
taims - 1 freq
€™times - 1 freq
tomÂ’s - 1 freq
dms - 1 freq
teamÂ’s - 2 freq
toms - 1 freq
'teams' - 1 freq
demoss - 1 freq
DMS
Time to execute Levenshtein function - 0.223917 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.420426 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031675 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041264 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000880 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.