A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dou in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dou (0) - 13 freq
ou (1) - 17 freq
dow (1) - 59 freq
dour (1) - 124 freq
dous (1) - 6 freq
du (1) - 727 freq
ou (1) - 1 freq
pou (1) - 53 freq
gou (1) - 1 freq
doa (1) - 1 freq
dout (1) - 167 freq
dog (1) - 157 freq
wou (1) - 7 freq
dor (1) - 5 freq
dhu (1) - 2 freq
duu (1) - 1 freq
cou (1) - 8 freq
doh (1) - 31 freq
fou (1) - 317 freq
doug (1) - 81 freq
dos (1) - 4 freq
doun (1) - 1274 freq
dof (1) - 1 freq
mou (1) - 176 freq
deu (1) - 40 freq
dou (0) - 13 freq
duu (1) - 1 freq
doo (1) - 126 freq
dau (1) - 1 freq
doa (1) - 1 freq
du (1) - 727 freq
deu (1) - 40 freq
do (1) - 837 freq
doy (1) - 1 freq
doe (1) - 6 freq
doea (2) - 1 freq
duy (2) - 3 freq
due (2) - 177 freq
dye (2) - 12 freq
ydo (2) - 1 freq
dii (2) - 1 freq
you (2) - 6457 freq
dob (2) - 1 freq
doul (2) - 4 freq
nou (2) - 1372 freq
ado (2) - 4 freq
od (2) - 8 freq
youd (2) - 4 freq
dae (2) - 4498 freq
daa (2) - 4 freq
SoundEx code - D000
day - 5942 freq
dae - 4498 freq
'dae - 55 freq
dee - 1204 freq
da - 9776 freq
due - 177 freq
dao - 1 freq
die - 122 freq
d'ye - 109 freq
d - 462 freq
doo - 126 freq
day- - 8 freq
de - 260 freq
dowie - 119 freq
dou - 13 freq
deh - 5 freq
dau - 1 freq
dew - 29 freq
daw - 20 freq
do - 837 freq
'do - 13 freq
d-day - 3 freq
du - 727 freq
dowe - 1 freq
doh - 31 freq
'd - 10 freq
doe - 6 freq
de- - 2 freq
d' - 3 freq
'd'ye - 23 freq
di - 68 freq
d'you - 10 freq
''d - 3 freq
dewy - 2 freq
dow - 59 freq
doei - 1 freq
dha - 1 freq
dieu - 2 freq
deaw - 1 freq
dhu - 2 freq
d-dae - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
daiy - 1 freq
duo - 3 freq
die' - 1 freq
dye - 12 freq
dae' - 7 freq
day' - 7 freq
dy - 236 freq
dee' - 4 freq
diy - 9 freq
deu - 40 freq
d'yi - 22 freq
doy - 1 freq
dïd - 22 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dyow - 6 freq
- 127 freq
'du' - 2 freq
'dey - 3 freq
öd - 3 freq
'du - 8 freq
'd' - 2 freq
d'eau - 1 freq
d'eau' - 2 freq
daa - 4 freq
'do' - 2 freq
'day - 2 freq
de'i - 1 freq
- 4 freq
dææ - 1 freq
dieh - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
- 16 freq
d - 3 freq
d - 2 freq
'de - 3 freq
-d - 8 freq
d'ae - 1 freq
dee-ye - 1 freq
daie - 3 freq
d'a - 1 freq
d - 1680 freq
da - 9 freq
duy - 3 freq
- 1 freq
da - 4 freq
do - 14 freq
dy - 23 freq
dae - 10 freq
dae - 17 freq
do - 4 freq
- 1 freq
dia- - 1 freq
d - 4 freq
d - 2 freq
ddd - 13 freq
daa - 3 freq
dah - 3 freq
dai - 2 freq
'd'you - 1 freq
day - 1 freq
diyi - 1 freq
dee - 1 freq
dewie - 1 freq
dyew - 1 freq
doa - 1 freq
doe-e - 1 freq
d - 5 freq
-day - 2 freq
da - 10 freq
dey - 4 freq
du - 8 freq
dy - 2 freq
da-a - 1 freq
dae - 1 freq
do - 3 freq
- 1 freq
dw - 4 freq
da- - 1 freq
dhe - 2 freq
dt - 3 freq
duu - 1 freq
doea - 1 freq
‘d’you - 1 freq
d’ye - 2 freq
dei - 3 freq
dae’e - 1 freq
ddewwy - 1 freq
doña - 1 freq
dd - 1 freq
d'day - 1 freq
dee” - 1 freq
“dee” - 1 freq
dii - 1 freq
da'y - 2 freq
dhowie - 20 freq
dh - 1 freq
dyi - 1 freq
duh - 2 freq
dwa - 1 freq
MetaPhone code - T
tae - 64038 freq
day - 5942 freq
dae - 4498 freq
'dae - 55 freq
tea - 560 freq
dee - 1204 freq
'tae - 47 freq
to - 4049 freq
da - 9776 freq
due - 177 freq
dao - 1 freq
die - 122 freq
t - 5646 freq
ta - 2534 freq
'ti - 3 freq
tt - 36 freq
tie - 88 freq
wyde - 11 freq
toy - 44 freq
ti - 4160 freq
too - 992 freq
taw - 9 freq
tue - 9 freq
d - 462 freq
doo - 126 freq
tae- - 5 freq
day- - 8 freq
tea- - 5 freq
de - 260 freq
dou - 13 freq
hyte - 3 freq
deh - 5 freq
dau - 1 freq
dew - 29 freq
dough - 18 freq
daw - 20 freq
tae-' - 2 freq
do - 837 freq
'do - 13 freq
'to - 6 freq
wyte - 82 freq
du - 727 freq
tee - 165 freq
tw - 6 freq
- 1 freq
doh - 31 freq
yt - 12 freq
toi - 34 freq
'd - 10 freq
tah - 2 freq
'toi - 1 freq
'yt - 1 freq
doe - 6 freq
hyde - 16 freq
toa - 2 freq
de- - 2 freq
'tt - 4 freq
d' - 3 freq
tow - 44 freq
t'ae - 1 freq
wyt - 2 freq
di - 68 freq
''d - 3 freq
toe - 21 freq
dewy - 2 freq
te - 1569 freq
dow - 59 freq
doei - 1 freq
tay - 185 freq
dieu - 2 freq
deaw - 1 freq
ta¢ - 5 freq
tae' - 3 freq
't - 21 freq
doughie - 1 freq
toue - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
yte - 1 freq
daiy - 1 freq
duo - 3 freq
'hd - 1 freq
to' - 2 freq
tai - 1 freq
w'it - 1 freq
die' - 1 freq
dae' - 7 freq
hd - 4 freq
t' - 2 freq
day' - 7 freq
dy - 236 freq
t'a - 17 freq
dee' - 4 freq
diy - 9 freq
ty - 7 freq
tthey - 1 freq
deu - 40 freq
teu - 29 freq
ht - 8 freq
taa - 3 freq
doy - 1 freq
ït - 331 freq
hïd - 6 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
- 127 freq
öt - 7 freq
- 19 freq
'du' - 2 freq
ta-' - 1 freq
'dey - 3 freq
ta- - 1 freq
tu - 23 freq
öd - 3 freq
'du - 8 freq
'ta - 2 freq
hyt - 1 freq
'd' - 2 freq
'to' - 1 freq
d'eau - 1 freq
d'eau' - 2 freq
doughy - 1 freq
daa - 4 freq
wd - 6 freq
'do' - 2 freq
wéit - 1 freq
'day - 2 freq
'tae' - 1 freq
tau - 2 freq
'too - 1 freq
de'i - 1 freq
- 4 freq
dææ - 1 freq
- 3 freq
dieh - 1 freq
wt' - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
ytt - 1 freq
taé - 1 freq
- 16 freq
t - 2 freq
t - 6 freq
d - 3 freq
d - 2 freq
'de - 3 freq
-d - 8 freq
þæt - 2 freq
t - 1 freq
teh - 1 freq
tao - 2 freq
d'ae - 1 freq
daie - 3 freq
tey - 3 freq
tua - 2 freq
d'a - 1 freq
d - 1680 freq
da - 9 freq
duy - 3 freq
- 1 freq
tae'a - 1 freq
da - 4 freq
tih - 7 freq
do - 14 freq
- 1 freq
t - 688 freq
dy - 23 freq
dae - 10 freq
dae - 17 freq
ti - 2 freq
do - 4 freq
to - 17 freq
t - 2 freq
tae - 6 freq
- 1 freq
dia- - 1 freq
taew - 1 freq
d - 4 freq
ti - 2 freq
tu - 2 freq
d - 2 freq
-to- - 1 freq
'tea' - 1 freq
'te - 1 freq
wt - 3 freq
ddd - 13 freq
daa - 3 freq
dah - 3 freq
too - 2 freq
to - 1 freq
hte - 1 freq
dai - 2 freq
toh - 4 freq
tui - 2 freq
hyde' - 1 freq
tae - 1 freq
day - 1 freq
dee - 1 freq
w't - 1 freq
tei - 5 freq
tae - 5 freq
doa - 1 freq
doe-e - 1 freq
to - 4 freq
t - 1 freq
tae - 1 freq
d - 5 freq
t - 3 freq
-day - 2 freq
tae - 1 freq
hyd - 7 freq
too - 1 freq
-t - 3 freq
da - 10 freq
dey - 4 freq
du - 8 freq
dy - 2 freq
da-a - 1 freq
hd - 1 freq
doagh - 1 freq
'tooooo' - 1 freq
dae - 1 freq
do - 3 freq
- 1 freq
wwdh - 1 freq
dw - 4 freq
da- - 1 freq
- 1 freq
ytuo - 1 freq
‘to - 2 freq
ydo - 1 freq
tii - 1 freq
duu - 1 freq
wtt - 1 freq
doea - 1 freq
tooo - 1 freq
tuo - 1 freq
“to - 1 freq
“tae - 2 freq
dei - 3 freq
dae’e - 1 freq
ddewwy - 1 freq
tiu - 1 freq
doña - 1 freq
dd - 1 freq
dee” - 1 freq
“dee” - 1 freq
“tea - 1 freq
dii - 1 freq
da'y - 2 freq
- 1 freq
htew - 1 freq
tth - 1 freq
tew - 1 freq
tea’ - 1 freq
yto - 1 freq
tthe - 1 freq
tuw - 1 freq
dh - 1 freq
tuu - 1 freq
htt - 1 freq
'teu' - 1 freq
ta” - 1 freq
duh - 2 freq
teo - 1 freq
DOU
Time to execute Levenshtein function - 0.410550 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.782043 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027620 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.103089 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000786 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.