A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to do in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
do (0) - 837 freq
dvo (1) - 2 freq
dq (1) - 7 freq
'o (1) - 35 freq
dol (1) - 1 freq
io (1) - 7 freq
doa (1) - 1 freq
dao (1) - 1 freq
co (1) - 5195 freq
ydo (1) - 1 freq
db (1) - 2 freq
zo (1) - 5 freq
dop (1) - 1 freq
du (1) - 727 freq
de (1) - 260 freq
dof (1) - 1 freq
ro (1) - 5 freq
di (1) - 68 freq
vo (1) - 8 freq
qo (1) - 28 freq
dm (1) - 12 freq
do' (1) - 2 freq
fo (1) - 16 freq
dob (1) - 1 freq
o (1) - 56035 freq
do (0) - 837 freq
d (1) - 462 freq
duo (1) - 3 freq
da (1) - 9776 freq
di (1) - 68 freq
de (1) - 260 freq
doy (1) - 1 freq
dy (1) - 236 freq
ado (1) - 4 freq
dou (1) - 13 freq
doe (1) - 6 freq
doo (1) - 126 freq
du (1) - 727 freq
odo (1) - 1 freq
doa (1) - 1 freq
dao (1) - 1 freq
ydo (1) - 1 freq
oda (2) - 5 freq
dei (2) - 3 freq
dow (2) - 59 freq
so (2) - 4266 freq
dyi (2) - 1 freq
dui (2) - 8 freq
ad (2) - 126 freq
doea (2) - 1 freq
SoundEx code - D000
day - 5942 freq
dae - 4498 freq
'dae - 55 freq
dee - 1204 freq
da - 9776 freq
due - 177 freq
dao - 1 freq
die - 122 freq
d'ye - 109 freq
d - 462 freq
doo - 126 freq
day- - 8 freq
de - 260 freq
dowie - 119 freq
dou - 13 freq
deh - 5 freq
dau - 1 freq
dew - 29 freq
daw - 20 freq
do - 837 freq
'do - 13 freq
d-day - 3 freq
du - 727 freq
dowe - 1 freq
doh - 31 freq
'd - 10 freq
doe - 6 freq
de- - 2 freq
d' - 3 freq
'd'ye - 23 freq
di - 68 freq
d'you - 10 freq
''d - 3 freq
dewy - 2 freq
dow - 59 freq
doei - 1 freq
dha - 1 freq
dieu - 2 freq
deaw - 1 freq
dhu - 2 freq
d-dae - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
daiy - 1 freq
duo - 3 freq
die' - 1 freq
dye - 12 freq
dae' - 7 freq
day' - 7 freq
dy - 236 freq
dee' - 4 freq
diy - 9 freq
deu - 40 freq
d'yi - 22 freq
doy - 1 freq
dïd - 22 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dyow - 6 freq
dö - 127 freq
'du' - 2 freq
'dey - 3 freq
öd - 3 freq
'du - 8 freq
'd' - 2 freq
d'eau - 1 freq
d'eau' - 2 freq
daa - 4 freq
'do' - 2 freq
'day - 2 freq
de'i - 1 freq
dæ - 4 freq
dææ - 1 freq
dieh - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
dø - 16 freq
ªd - 3 freq
«d - 2 freq
'de - 3 freq
-d - 8 freq
d'ae - 1 freq
dee-ye - 1 freq
daie - 3 freq
d'a - 1 freq
€™d - 1680 freq
€™da - 9 freq
duy - 3 freq
dü - 1 freq
€˜da - 4 freq
€œdo - 14 freq
€™dy - 23 freq
€˜dae - 10 freq
€œdae - 17 freq
€˜do - 4 freq
dé - 1 freq
dia- - 1 freq
€œd - 4 freq
€˜d - 2 freq
ddd - 13 freq
€œdaa - 3 freq
dah - 3 freq
dai - 2 freq
'd'you - 1 freq
€œday - 1 freq
diyi - 1 freq
€œdee - 1 freq
dewie - 1 freq
dyew - 1 freq
doa - 1 freq
doe-e - 1 freq
€Ÿd - 5 freq
-day - 2 freq
€œda - 10 freq
€œdey - 4 freq
€œdu - 8 freq
€œdy - 2 freq
da-a - 1 freq
€™dae - 1 freq
€™do - 3 freq
d” - 1 freq
dw - 4 freq
da- - 1 freq
dhe - 2 freq
dt - 3 freq
duu - 1 freq
doea - 1 freq
‘d’you - 1 freq
dÂ’ye - 2 freq
dei - 3 freq
daeÂ’e - 1 freq
ddewwy - 1 freq
doña - 1 freq
dd - 1 freq
d'day - 1 freq
dee” - 1 freq
“dee” - 1 freq
dii - 1 freq
da'y - 2 freq
dhowie - 20 freq
dh - 1 freq
dyi - 1 freq
duh - 2 freq
dwa - 1 freq
MetaPhone code - T
tae - 64038 freq
day - 5942 freq
dae - 4498 freq
'dae - 55 freq
tea - 560 freq
dee - 1204 freq
'tae - 47 freq
to - 4049 freq
da - 9776 freq
due - 177 freq
dao - 1 freq
die - 122 freq
t - 5646 freq
ta - 2534 freq
'ti - 3 freq
tt - 36 freq
tie - 88 freq
wyde - 11 freq
toy - 44 freq
ti - 4160 freq
too - 992 freq
taw - 9 freq
tue - 9 freq
d - 462 freq
doo - 126 freq
tae- - 5 freq
day- - 8 freq
tea- - 5 freq
de - 260 freq
dou - 13 freq
hyte - 3 freq
deh - 5 freq
dau - 1 freq
dew - 29 freq
dough - 18 freq
daw - 20 freq
tae-' - 2 freq
do - 837 freq
'do - 13 freq
'to - 6 freq
wyte - 82 freq
du - 727 freq
tee - 165 freq
tw - 6 freq
t¢ - 1 freq
doh - 31 freq
yt - 12 freq
toi - 34 freq
'd - 10 freq
tah - 2 freq
'toi - 1 freq
'yt - 1 freq
doe - 6 freq
hyde - 16 freq
toa - 2 freq
de- - 2 freq
'tt - 4 freq
d' - 3 freq
tow - 44 freq
t'ae - 1 freq
wyt - 2 freq
di - 68 freq
''d - 3 freq
toe - 21 freq
dewy - 2 freq
te - 1569 freq
dow - 59 freq
doei - 1 freq
tay - 185 freq
dieu - 2 freq
deaw - 1 freq
ta¢ - 5 freq
tae' - 3 freq
't - 21 freq
doughie - 1 freq
toue - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
yte - 1 freq
daiy - 1 freq
duo - 3 freq
'hd - 1 freq
to' - 2 freq
tai - 1 freq
w'it - 1 freq
die' - 1 freq
dae' - 7 freq
hd - 4 freq
t' - 2 freq
day' - 7 freq
dy - 236 freq
t'a - 17 freq
dee' - 4 freq
diy - 9 freq
ty - 7 freq
tthey - 1 freq
deu - 40 freq
teu - 29 freq
ht - 8 freq
taa - 3 freq
doy - 1 freq
ït - 331 freq
hïd - 6 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dö - 127 freq
öt - 7 freq
tö - 19 freq
'du' - 2 freq
ta-' - 1 freq
'dey - 3 freq
ta- - 1 freq
tu - 23 freq
öd - 3 freq
'du - 8 freq
'ta - 2 freq
hyt - 1 freq
'd' - 2 freq
'to' - 1 freq
d'eau - 1 freq
d'eau' - 2 freq
doughy - 1 freq
daa - 4 freq
wd - 6 freq
'do' - 2 freq
wéit - 1 freq
'day - 2 freq
'tae' - 1 freq
tau - 2 freq
'too - 1 freq
de'i - 1 freq
dæ - 4 freq
dææ - 1 freq
tæ - 3 freq
dieh - 1 freq
wt' - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
ytt - 1 freq
taé - 1 freq
dø - 16 freq
™t - 2 freq
ªt - 6 freq
ªd - 3 freq
«d - 2 freq
'de - 3 freq
-d - 8 freq
þæt - 2 freq
žt - 1 freq
teh - 1 freq
tao - 2 freq
d'ae - 1 freq
daie - 3 freq
tey - 3 freq
tua - 2 freq
d'a - 1 freq
€™d - 1680 freq
€™da - 9 freq
duy - 3 freq
dü - 1 freq
tae'a - 1 freq
€˜da - 4 freq
tih - 7 freq
€œdo - 14 freq
tø - 1 freq
€™t - 688 freq
€™dy - 23 freq
€˜dae - 10 freq
€œdae - 17 freq
€˜ti - 2 freq
€˜do - 4 freq
€˜to - 17 freq
€˜t - 2 freq
€œtae - 6 freq
dé - 1 freq
dia- - 1 freq
taew - 1 freq
€œd - 4 freq
€œti - 2 freq
€œtu - 2 freq
€˜d - 2 freq
-to- - 1 freq
'tea' - 1 freq
'te - 1 freq
wt - 3 freq
ddd - 13 freq
€œdaa - 3 freq
dah - 3 freq
€œtoo - 2 freq
€¦to - 1 freq
hte - 1 freq
dai - 2 freq
toh - 4 freq
tui - 2 freq
hyde' - 1 freq
€¦tae - 1 freq
€œday - 1 freq
€œdee - 1 freq
w't - 1 freq
tei - 5 freq
€˜tae - 5 freq
doa - 1 freq
doe-e - 1 freq
€œto - 4 freq
€œt - 1 freq
€™tae - 1 freq
€Ÿd - 5 freq
€Ÿt - 3 freq
-day - 2 freq
€”tae - 1 freq
hyd - 7 freq
€˜too - 1 freq
€˜-t - 3 freq
€œda - 10 freq
€œdey - 4 freq
€œdu - 8 freq
€œdy - 2 freq
da-a - 1 freq
€™hd - 1 freq
doagh - 1 freq
'tooooo' - 1 freq
€™dae - 1 freq
€™do - 3 freq
d” - 1 freq
wwdh - 1 freq
dw - 4 freq
da- - 1 freq
t” - 1 freq
ytuo - 1 freq
‘to - 2 freq
ydo - 1 freq
tii - 1 freq
duu - 1 freq
wtt - 1 freq
doea - 1 freq
tooo - 1 freq
tuo - 1 freq
“to - 1 freq
“tae - 2 freq
dei - 3 freq
daeÂ’e - 1 freq
ddewwy - 1 freq
tiu - 1 freq
doña - 1 freq
dd - 1 freq
dee” - 1 freq
“dee” - 1 freq
“tea - 1 freq
dii - 1 freq
da'y - 2 freq
tè - 1 freq
htew - 1 freq
tth - 1 freq
tew - 1 freq
teaÂ’ - 1 freq
yto - 1 freq
tthe - 1 freq
tuw - 1 freq
dh - 1 freq
tuu - 1 freq
htt - 1 freq
'teu' - 1 freq
ta” - 1 freq
duh - 2 freq
teo - 1 freq
DO
dae - 4498 freq
do - 837 freq
does - 375 freq
did - 2817 freq
doin - 74 freq
daein - 860 freq
div - 506 freq
done - 797 freq
dee - 1204 freq
don't - 560 freq
didn't - 38 freq
dinna - 1817 freq
dinnae - 1921 freq
didnae - 1656 freq
didna - 1623 freq
daenae - 4 freq
disnae - 577 freq
doesnae - 166 freq
doesna - 90 freq
duin - 392 freq
doing - 82 freq
dain - 144 freq
dane - 55 freq
Time to execute Levenshtein function - 0.176711 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.318422 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027879 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.069115 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.