A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dïd in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dïd (0) - 22 freq
hïd (1) - 6 freq
bïd (1) - 10 freq
rïd (1) - 4 freq
dïg (1) - 2 freq
öd (2) - 3 freq
bïds (2) - 1 freq
pït (2) - 1 freq
divïd (2) - 8 freq
lït (2) - 2 freq
fïg (2) - 4 freq
dohd (2) - 1 freq
djkd (2) - 1 freq
bït (2) - 7 freq
dood (2) - 1 freq
sït (2) - 10 freq
dön (2) - 52 freq
díds (2) - 1 freq
röd (2) - 2 freq
(2) - 1 freq
dün (2) - 1 freq
(2) - 4 freq
dæs (2) - 2 freq
ddd (2) - 13 freq
göd (2) - 254 freq
dïd (0) - 22 freq
dïg (2) - 2 freq
rïd (2) - 4 freq
bïd (2) - 10 freq
hïd (2) - 6 freq
divïd (3) - 8 freq
díed (3) - 1 freq
döl (4) - 1 freq
töd (4) - 1 freq
du'd (4) - 2 freq
dird (4) - 2 freq
gød (4) - 5 freq
dvd (4) - 11 freq
dös (4) - 1 freq
dnd (4) - 1 freq
bød (4) - 3 freq
dùn (4) - 1 freq
deid (4) - 946 freq
duid (4) - 1 freq
deyd (4) - 13 freq
daed (4) - 19 freq
dyed (4) - 25 freq
did (4) - 2817 freq
dadd (4) - 2 freq
(4) - 16 freq
SoundEx code - D000
day - 5942 freq
dae - 4498 freq
'dae - 55 freq
dee - 1204 freq
da - 9776 freq
due - 177 freq
dao - 1 freq
die - 122 freq
d'ye - 109 freq
d - 462 freq
doo - 126 freq
day- - 8 freq
de - 260 freq
dowie - 119 freq
dou - 13 freq
deh - 5 freq
dau - 1 freq
dew - 29 freq
daw - 20 freq
do - 837 freq
'do - 13 freq
d-day - 3 freq
du - 727 freq
dowe - 1 freq
doh - 31 freq
'd - 10 freq
doe - 6 freq
de- - 2 freq
d' - 3 freq
'd'ye - 23 freq
di - 68 freq
d'you - 10 freq
''d - 3 freq
dewy - 2 freq
dow - 59 freq
doei - 1 freq
dha - 1 freq
dieu - 2 freq
deaw - 1 freq
dhu - 2 freq
d-dae - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
daiy - 1 freq
duo - 3 freq
die' - 1 freq
dye - 12 freq
dae' - 7 freq
day' - 7 freq
dy - 236 freq
dee' - 4 freq
diy - 9 freq
deu - 40 freq
d'yi - 22 freq
doy - 1 freq
dïd - 22 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dyow - 6 freq
- 127 freq
'du' - 2 freq
'dey - 3 freq
öd - 3 freq
'du - 8 freq
'd' - 2 freq
d'eau - 1 freq
d'eau' - 2 freq
daa - 4 freq
'do' - 2 freq
'day - 2 freq
de'i - 1 freq
- 4 freq
dææ - 1 freq
dieh - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
- 16 freq
d - 3 freq
d - 2 freq
'de - 3 freq
-d - 8 freq
d'ae - 1 freq
dee-ye - 1 freq
daie - 3 freq
d'a - 1 freq
d - 1680 freq
da - 9 freq
duy - 3 freq
- 1 freq
da - 4 freq
do - 14 freq
dy - 23 freq
dae - 10 freq
dae - 17 freq
do - 4 freq
- 1 freq
dia- - 1 freq
d - 4 freq
d - 2 freq
ddd - 13 freq
daa - 3 freq
dah - 3 freq
dai - 2 freq
'd'you - 1 freq
day - 1 freq
diyi - 1 freq
dee - 1 freq
dewie - 1 freq
dyew - 1 freq
doa - 1 freq
doe-e - 1 freq
d - 5 freq
-day - 2 freq
da - 10 freq
dey - 4 freq
du - 8 freq
dy - 2 freq
da-a - 1 freq
dae - 1 freq
do - 3 freq
- 1 freq
dw - 4 freq
da- - 1 freq
dhe - 2 freq
dt - 3 freq
duu - 1 freq
doea - 1 freq
‘d’you - 1 freq
d’ye - 2 freq
dei - 3 freq
dae’e - 1 freq
ddewwy - 1 freq
doña - 1 freq
dd - 1 freq
d'day - 1 freq
dee” - 1 freq
“dee” - 1 freq
dii - 1 freq
da'y - 2 freq
dhowie - 20 freq
dh - 1 freq
dyi - 1 freq
duh - 2 freq
dwa - 1 freq
MetaPhone code - TT
did - 2817 freq
deid - 946 freq
'did - 51 freq
doot - 565 freq
tied - 119 freq
tide - 117 freq
tattoo - 12 freq
tait - 42 freq
tottie - 106 freq
daud - 78 freq
dee'd - 93 freq
tattie - 157 freq
dead - 272 freq
ted - 13 freq
dout - 167 freq
wydit - 1 freq
toot - 33 freq
totey - 20 freq
dodo - 79 freq
tidy - 50 freq
died - 152 freq
tae-tae - 2 freq
tit - 23 freq
deid-a - 1 freq
tid - 25 freq
dod - 129 freq
tote - 4 freq
dad - 261 freq
dotty - 1 freq
duty - 77 freq
dot - 47 freq
toaty - 8 freq
today - 154 freq
totie - 18 freq
d-day - 3 freq
dae't - 7 freq
dee't - 36 freq
'deed - 19 freq
deed - 235 freq
tod - 275 freq
deity - 2 freq
toatie - 6 freq
date - 160 freq
diet - 70 freq
teddy - 16 freq
tae-dae - 6 freq
t'd - 2 freq
tat - 16 freq
daddy - 124 freq
dautie - 5 freq
towt - 41 freq
tite - 2 freq
deid' - 3 freq
dae'd - 1 freq
wytit - 20 freq
deet - 24 freq
'deid - 3 freq
tate - 12 freq
dawtie - 6 freq
'deid' - 2 freq
wyted - 13 freq
tut - 67 freq
diddy - 5 freq
doddie - 4 freq
ditty - 4 freq
tot - 4 freq
ta-ta - 2 freq
dottie - 84 freq
d-dae - 1 freq
toad - 5 freq
daed - 19 freq
deat - 1 freq
day-oot - 1 freq
tutu - 3 freq
tae't - 7 freq
dowt - 6 freq
tyde - 24 freq
toty - 41 freq
totty - 12 freq
did' - 1 freq
deeit - 1 freq
tatty - 11 freq
todd - 7 freq
tad - 54 freq
'daddy - 7 freq
duddie - 1 freq
'tattie' - 2 freq
doyt - 1 freq
'tit - 1 freq
data - 68 freq
dei'd - 1 freq
dude - 20 freq
toate - 1 freq
dote - 2 freq
'dod - 1 freq
doad - 3 freq
taid - 2 freq
td - 9 freq
dïd - 22 freq
dutie - 1 freq
tottte - 1 freq
date' - 1 freq
'deed' - 1 freq
daad - 5 freq
tood - 1 freq
doughty - 3 freq
deit - 9 freq
'dad - 2 freq
toddy - 15 freq
't'd - 1 freq
t'da - 5 freq
dat - 1391 freq
dadd - 2 freq
toeht - 1 freq
taday - 2 freq
'dat - 2 freq
dadda - 1 freq
taut - 2 freq
deday - 5 freq
dud - 2 freq
totue - 1 freq
'tottie - 1 freq
ti'd - 1 freq
dewtie - 8 freq
tittie - 30 freq
'tod - 4 freq
tod' - 2 freq
'dee'd - 1 freq
'dee'd' - 1 freq
tead - 1 freq
tet - 1 freq
deeid - 15 freq
doit - 3 freq
t'tow - 1 freq
daday - 74 freq
tiyt - 1 freq
dee'at - 1 freq
de'ed - 7 freq
teedee - 1 freq
daid - 5 freq
duid - 1 freq
dowdy - 1 freq
day-daw - 1 freq
tee'd - 2 freq
teet - 13 freq
du'd - 2 freq
tyd - 4 freq
t'die - 1 freq
daddie - 3 freq
daddie' - 1 freq
ded - 4 freq
doute - 2 freq
dey'd - 4 freq
daatie - 6 freq
tedd - 1 freq
dyte - 2 freq
dede - 3 freq
dei't - 2 freq
díed - 1 freq
tide' - 1 freq
dodie - 18 freq
dat - 3 freq
dodie - 1 freq
did - 17 freq
doddy - 4 freq
doddy - 1 freq
duty - 1 freq
töd - 1 freq
tout - 3 freq
taed - 15 freq
dow'd - 1 freq
wyded - 1 freq
det - 3 freq
étude - 1 freq
to-day - 1 freq
tito - 2 freq
dad - 2 freq
dat - 8 freq
tat' - 1 freq
doot - 1 freq
didie - 5 freq
dide - 2 freq
did - 6 freq
date - 1 freq
daddy - 1 freq
daode - 1 freq
dyde - 1 freq
towt - 1 freq
dydie - 3 freq
didi - 1 freq
tad' - 1 freq
tayto - 1 freq
toto - 6 freq
tattie - 1 freq
deyd - 13 freq
dada - 12 freq
tad - 2 freq
dotie - 1 freq
dode - 2 freq
deud - 1 freq
today - 1 freq
did - 2 freq
doat - 1 freq
yytde - 1 freq
dohd - 1 freq
titi - 1 freq
“daddy - 1 freq
“daaaaaaaaaaaaaaaad” - 1 freq
ditto - 2 freq
dought - 1 freq
dt - 3 freq
‘did - 1 freq
totti - 8 freq
toadie - 1 freq
dado - 1 freq
teady - 1 freq
duyd - 1 freq
dwyhd - 1 freq
dht - 1 freq
diddie - 2 freq
'dido - 1 freq
teduio - 1 freq
tawtie - 1 freq
dood - 1 freq
tatt - 1 freq
d'day - 1 freq
“dey’d - 1 freq
tide” - 1 freq
todo - 1 freq
teat - 1 freq
twt - 1 freq
taeday - 1 freq
toht - 1 freq
DÏD
Time to execute Levenshtein function - 0.230134 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.570223 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.079317 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.085050 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.