A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dem in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dem (0) - 722 freq
deem (1) - 19 freq
dep (1) - 2 freq
demn (1) - 2 freq
dum (1) - 16 freq
vem (1) - 1 freq
bem (1) - 1 freq
rem (1) - 2 freq
dqm (1) - 1 freq
nem (1) - 53 freq
des (1) - 23 freq
det (1) - 3 freq
deb (1) - 2 freq
dei (1) - 3 freq
deu (1) - 40 freq
dems (1) - 6 freq
dec (1) - 9 freq
em (1) - 354 freq
del (1) - 15 freq
dey (1) - 1241 freq
dm (1) - 12 freq
der (1) - 303 freq
de (1) - 262 freq
'em (1) - 44 freq
demo (1) - 3 freq
dem (0) - 722 freq
diem (1) - 2 freq
demo (1) - 3 freq
dm (1) - 12 freq
demi (1) - 1 freq
daem (1) - 1 freq
dam (1) - 29 freq
dom (1) - 3 freq
adem (1) - 1 freq
dim (1) - 51 freq
dum (1) - 16 freq
deem (1) - 19 freq
edam (2) - 45 freq
lem (2) - 5 freq
dev (2) - 4 freq
dame (2) - 16 freq
ded (2) - 4 freq
adam (2) - 189 freq
gem (2) - 24 freq
doom (2) - 34 freq
drem (2) - 7 freq
mem (2) - 3 freq
adom (2) - 1 freq
ydm (2) - 1 freq
duim (2) - 1 freq
SoundEx code - D500
doon - 7067 freq
dinna - 1825 freq
dwam - 102 freq
done - 821 freq
dawn - 92 freq
down - 208 freq
daein - 882 freq
dinnae - 1942 freq
deein - 295 freq
den - 104 freq
'dinna - 52 freq
deen - 289 freq
diane - 9 freq
duin - 393 freq
doun - 1278 freq
dean - 18 freq
dinah - 75 freq
doon-hoi - 1 freq
dunno - 43 freq
doon' - 9 freq
dinn- - 3 freq
dinn - 6 freq
deny - 53 freq
din - 75 freq
dwaum - 19 freq
damn - 73 freq
dwyne - 23 freq
dyin - 51 freq
dawin - 32 freq
dinny - 10 freq
dune - 103 freq
dain - 144 freq
domme - 1 freq
daen - 223 freq
daena - 26 freq
deem - 19 freq
doom - 34 freq
danny - 135 freq
'danny - 1 freq
dooin - 2 freq
doony - 2 freq
'dunno - 1 freq
doin - 75 freq
dae'in - 1 freq
dine - 26 freq
'dinnae - 63 freq
dwamie - 2 freq
demn - 2 freq
dwine - 19 freq
dinn-- - 1 freq
downy - 1 freq
dinae - 40 freq
daein' - 14 freq
'dam - 1 freq
'doon - 6 freq
'deein - 2 freq
dummy - 28 freq
dwaumie - 2 freq
don - 260 freq
dam - 29 freq
dein - 22 freq
dem - 722 freq
demi - 1 freq
dann - 9 freq
dana - 1 freq
dan - 472 freq
dun - 36 freq
'dame - 1 freq
dim - 51 freq
dinnnae - 1 freq
day-in - 1 freq
dome - 24 freq
downie - 2 freq
doyen - 1 freq
dinaye - 1 freq
dinnaye - 2 freq
duun - 1 freq
dina - 43 freq
demo - 3 freq
da'en - 1 freq
din''na - 1 freq
dum - 16 freq
diowin - 1 freq
ddnae - 1 freq
dno - 1 freq
dene - 3 freq
deean - 3 freq
daenae - 4 freq
'done - 4 freq
dame - 16 freq
diine - 2 freq
dna - 16 freq
doenae - 1 freq
dianae - 1 freq
diana - 8 freq
dwamy - 1 freq
dwammy - 2 freq
denn- - 1 freq
denn - 3 freq
deuan - 16 freq
deun - 31 freq
dane - 55 freq
dinah' - 1 freq
dimnae - 1 freq
'damn - 3 freq
don' - 1 freq
dïdnae - 24 freq
dïnnae - 124 freq
'dïnnae - 4 freq
daem - 1 freq
dinnae' - 2 freq
done' - 1 freq
deane - 3 freq
daen' - 1 freq
dwaam - 8 freq
dee'in - 3 freq
'daein - 2 freq
dönna - 4 freq
denee - 1 freq
dunna - 159 freq
döne - 6 freq
döin - 16 freq
dem-aa - 1 freq
dem- - 1 freq
'dan - 2 freq
dön - 52 freq
dom - 3 freq
'dunna - 4 freq
ddin - 5 freq
döön - 1 freq
'dom - 2 freq
dyan - 2 freq
down' - 3 freq
dae'n - 2 freq
dae-in - 3 freq
dinno - 16 freq
domm - 1 freq
dwaumy - 2 freq
'deen' - 1 freq
dønna - 1 freq
daean - 8 freq
doomy - 1 freq
dehan - 1 freq
duma - 1 freq
dona - 1 freq
dwiny - 1 freq
dyein - 1 freq
doein - 1 freq
duina - 6 freq
di-yan - 1 freq
'dain - 1 freq
doan - 9 freq
døn - 8 freq
døin - 5 freq
-dom - 2 freq
doen - 4 freq
dunnae - 3 freq
dwaamie - 1 freq
dima-' - 1 freq
deemie - 3 freq
deen- - 1 freq
duana - 1 freq
dun' - 2 freq
duim - 1 freq
dàin - 1 freq
dyne - 2 freq
dunn - 8 freq
dummie - 1 freq
dhan - 1 freq
€˜dan - 1 freq
dione - 1 freq
diinne - 1 freq
donnie - 9 freq
€˜don - 12 freq
€“dinna - 1 freq
douna - 1 freq
€˜down - 1 freq
€˜dinnae - 7 freq
dün - 1 freq
dunnie - 6 freq
dane' - 1 freq
€œdinna - 34 freq
€œdinnae - 19 freq
€œdoon - 1 freq
€˜doom - 1 freq
€œdoun - 1 freq
dunny - 1 freq
€œdon - 10 freq
dinnna - 1 freq
€œdunna - 3 freq
€˜dunna - 1 freq
dien - 1 freq
dunnoo - 2 freq
dyeen - 1 freq
€˜danny - 7 freq
diem - 2 freq
doin' - 1 freq
'don - 1 freq
donna - 51 freq
diean - 1 freq
€˜dem - 4 freq
doon-o - 1 freq
€˜dinna - 4 freq
daun - 1 freq
'daein' - 1 freq
€˜daein - 1 freq
doune - 3 freq
dain' - 2 freq
dinni - 10 freq
dini - 9 freq
deemy - 1 freq
€™doon - 2 freq
€œdaena - 3 freq
€œdamn - 3 freq
diin - 2 freq
€™don - 1 freq
dùn - 1 freq
€”dinnae - 1 freq
damo - 10 freq
€˜done - 1 freq
dyin' - 2 freq
denie - 1 freq
€™dinnae - 2 freq
deeÂ’in - 1 freq
“dinna - 1 freq
dm - 12 freq
duine - 1 freq
‘done - 1 freq
downey - 1 freq
deano - 1 freq
de'in - 4 freq
dtn - 1 freq
deein' - 1 freq
'dinna' - 1 freq
dmi - 1 freq
deain - 1 freq
denny - 2 freq
daein” - 1 freq
deaein - 1 freq
“daein - 1 freq
deen' - 1 freq
“dunna - 2 freq
“dinnae - 1 freq
damoa - 7 freq
donny - 1 freq
deena - 1 freq
deun' - 1 freq
dinnea - 4 freq
MetaPhone code - TM
time - 5974 freq
'time - 4 freq
teem - 47 freq
tam - 522 freq
tom - 134 freq
time- - 2 freq
toom - 35 freq
tame - 23 freq
'tam - 14 freq
domme - 1 freq
'tm - 2 freq
tom' - 1 freq
deem - 19 freq
doom - 34 freq
tame' - 3 freq
team - 306 freq
tam' - 1 freq
teeeeeam - 1 freq
teeeamy - 1 freq
tuim - 103 freq
t'm - 3 freq
tyme - 220 freq
tm - 7 freq
time' - 6 freq
team' - 2 freq
'dam - 1 freq
tomb - 31 freq
dumb - 38 freq
't'm - 3 freq
'tom - 1 freq
dummy - 28 freq
dam - 29 freq
dem - 722 freq
demi - 1 freq
tomboy - 1 freq
timé - 1 freq
'dame - 1 freq
dim - 51 freq
dome - 24 freq
demo - 3 freq
tme - 3 freq
ttttaaaahhhhaaaaaam - 1 freq
taaahhmm - 1 freq
dum - 16 freq
tammie - 25 freq
tim - 47 freq
dame - 16 freq
tommy - 100 freq
ttme - 1 freq
t'im - 1 freq
t'im' - 1 freq
tum - 17 freq
htm - 17 freq
'tommy - 2 freq
daem - 1 freq
tim' - 43 freq
'dumb - 2 freq
tome - 4 freq
dem-aa - 1 freq
dem- - 1 freq
dom - 3 freq
'team'' - 1 freq
'dom - 2 freq
tammy - 48 freq
töm - 2 freq
tume - 14 freq
tym - 2 freq
ti'm - 1 freq
teum - 8 freq
domm - 1 freq
dumbo - 2 freq
tummy - 4 freq
toam - 1 freq
doomy - 1 freq
duma - 1 freq
tiym - 6 freq
-dom - 2 freq
tøm - 1 freq
tumb - 1 freq
dima-' - 1 freq
tae'm - 2 freq
deemie - 3 freq
duim - 1 freq
dummie - 1 freq
tombaugh - 1 freq
€˜tam - 6 freq
'teem - 1 freq
€˜doom - 1 freq
diem - 2 freq
€˜dem - 4 freq
dumba - 1 freq
€œtime - 1 freq
damb - 1 freq
toamy - 12 freq
deemy - 1 freq
€œdumb - 1 freq
€œtommy - 1 freq
damo - 10 freq
taim - 1 freq
€™time - 1 freq
wdma - 1 freq
teamie - 1 freq
dm - 12 freq
timmy - 2 freq
tommie - 1 freq
timeÂ’ - 1 freq
dmi - 1 freq
tmi - 1 freq
tomb” - 1 freq
time” - 1 freq
ydm - 1 freq
'taim' - 1 freq
damoa - 7 freq
tomow - 1 freq
taeyma - 1 freq
DEM
thaim - 2522 freq
them - 5422 freq
thems - 1 freq
dem - 722 freq
dems - 6 freq
thum - 463 freq
Time to execute Levenshtein function - 0.239319 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.456823 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027917 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037521 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000942 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.