A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to da in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
da (0) - 9788 freq
ds (1) - 9 freq
sa (1) - 62 freq
dwa (1) - 1 freq
dah (1) - 3 freq
dab (1) - 58 freq
dna (1) - 16 freq
dax (1) - 1 freq
d' (1) - 3 freq
'da (1) - 27 freq
dv (1) - 5 freq
df (1) - 2 freq
ba (1) - 140 freq
aa (1) - 7151 freq
dam (1) - 29 freq
la (1) - 116 freq
db (1) - 2 freq
za (1) - 3 freq
dg (1) - 19 freq
oa (1) - 14 freq
dsa (1) - 1 freq
dal (1) - 2 freq
dx (1) - 3 freq
ga (1) - 29 freq
dl (1) - 2 freq
da (0) - 9788 freq
dy (1) - 236 freq
di (1) - 69 freq
ida (1) - 127 freq
dai (1) - 2 freq
daa (1) - 4 freq
do (1) - 861 freq
ada (1) - 4 freq
du (1) - 727 freq
day (1) - 6020 freq
doa (1) - 1 freq
dao (1) - 1 freq
d (1) - 462 freq
dau (1) - 1 freq
de (1) - 262 freq
dia (1) - 2 freq
dae (1) - 4565 freq
oda (1) - 5 freq
dye (2) - 13 freq
ta (2) - 2534 freq
dt (2) - 3 freq
dii (2) - 1 freq
ˆa (2) - 1 freq
-a (2) - 2 freq
dyi (2) - 1 freq
SoundEx code - D000
day - 6020 freq
dae - 4565 freq
'dae - 55 freq
dee - 1212 freq
da - 9788 freq
due - 177 freq
dao - 1 freq
die - 122 freq
d'ye - 110 freq
d - 462 freq
doo - 126 freq
day- - 8 freq
de - 262 freq
dowie - 119 freq
dou - 13 freq
deh - 5 freq
dau - 1 freq
dew - 30 freq
daw - 20 freq
do - 861 freq
'do - 14 freq
d-day - 3 freq
du - 727 freq
dowe - 1 freq
doh - 31 freq
'd - 10 freq
doe - 6 freq
de- - 2 freq
d' - 3 freq
'd'ye - 22 freq
di - 69 freq
d'you - 11 freq
''d - 4 freq
dewy - 2 freq
dow - 59 freq
doei - 1 freq
dha - 1 freq
dieu - 2 freq
deaw - 1 freq
dhu - 2 freq
d-dae - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
daiy - 1 freq
duo - 3 freq
dye - 13 freq
die' - 1 freq
dae' - 7 freq
day' - 7 freq
dy - 236 freq
dee' - 4 freq
diy - 9 freq
deu - 40 freq
d'yi - 22 freq
doy - 1 freq
dïd - 22 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dyow - 6 freq
dö - 127 freq
'du' - 2 freq
'dey - 3 freq
öd - 3 freq
'du - 8 freq
'd' - 2 freq
d'eau - 1 freq
d'eau' - 2 freq
daa - 4 freq
'do' - 2 freq
'day - 2 freq
de'i - 1 freq
dæ - 4 freq
dææ - 1 freq
dieh - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
dø - 16 freq
ªd - 3 freq
«d - 2 freq
'de - 3 freq
-d - 8 freq
d'ae - 1 freq
dee-ye - 1 freq
daie - 3 freq
d'a - 1 freq
€™d - 1681 freq
€™da - 9 freq
duy - 3 freq
dü - 1 freq
€˜da - 4 freq
€œdo - 14 freq
€™dy - 25 freq
€˜dae - 10 freq
€œdae - 21 freq
€˜do - 4 freq
dé - 1 freq
dia- - 1 freq
€œd - 4 freq
€˜d - 2 freq
ddd - 13 freq
€œdaa - 3 freq
dah - 3 freq
dai - 2 freq
'd'you - 1 freq
€œday - 1 freq
diyi - 1 freq
€œdee - 1 freq
dewie - 1 freq
dyew - 1 freq
doa - 1 freq
doe-e - 1 freq
€Ÿd - 5 freq
-day - 2 freq
€œda - 10 freq
€œdey - 4 freq
€œdu - 8 freq
€œdy - 2 freq
da-a - 1 freq
€™dae - 1 freq
€™do - 3 freq
d” - 1 freq
dw - 4 freq
da- - 1 freq
dhe - 2 freq
dt - 3 freq
duu - 1 freq
doea - 1 freq
‘d’you - 1 freq
dÂ’ye - 2 freq
dei - 3 freq
daeÂ’e - 1 freq
ddewwy - 1 freq
doña - 1 freq
dd - 1 freq
d'day - 1 freq
dee” - 1 freq
“dee” - 1 freq
dii - 1 freq
da'y - 2 freq
dhowie - 20 freq
dh - 1 freq
dyi - 1 freq
duh - 2 freq
dwa - 1 freq
MetaPhone code - T
tae - 65006 freq
day - 6020 freq
dae - 4565 freq
'dae - 55 freq
tea - 573 freq
dee - 1212 freq
'tae - 47 freq
to - 4164 freq
da - 9788 freq
due - 177 freq
dao - 1 freq
die - 122 freq
t - 5648 freq
ta - 2534 freq
'ti - 3 freq
tt - 36 freq
tie - 88 freq
wyde - 11 freq
toy - 45 freq
ti - 4171 freq
too - 1030 freq
taw - 10 freq
tue - 9 freq
d - 462 freq
doo - 126 freq
tae- - 5 freq
day- - 8 freq
tea- - 5 freq
de - 262 freq
dou - 13 freq
hyte - 3 freq
deh - 5 freq
dau - 1 freq
dew - 30 freq
dough - 18 freq
daw - 20 freq
tae-' - 2 freq
do - 861 freq
'do - 14 freq
'to - 7 freq
wyte - 84 freq
du - 727 freq
tee - 168 freq
tw - 6 freq
t¢ - 1 freq
doh - 31 freq
yt - 12 freq
toi - 34 freq
'd - 10 freq
tah - 2 freq
'toi - 1 freq
'yt - 1 freq
doe - 6 freq
hyde - 16 freq
toa - 2 freq
de- - 2 freq
'tt - 4 freq
d' - 3 freq
tow - 44 freq
t'ae - 1 freq
wyt - 2 freq
di - 69 freq
''d - 4 freq
toe - 22 freq
dewy - 2 freq
te - 1570 freq
dow - 59 freq
doei - 1 freq
tay - 186 freq
dieu - 2 freq
deaw - 1 freq
ta¢ - 6 freq
tae' - 3 freq
't - 23 freq
doughie - 1 freq
toue - 1 freq
dey - 1241 freq
daia - 2 freq
'da - 27 freq
yte - 1 freq
daiy - 1 freq
duo - 3 freq
'hd - 1 freq
'ta' - 1 freq
tahhh - 1 freq
tih - 8 freq
toooo - 1 freq
wt - 5 freq
to' - 2 freq
tai - 1 freq
w'it - 1 freq
die' - 1 freq
dae' - 7 freq
hd - 4 freq
t' - 2 freq
day' - 7 freq
dy - 236 freq
t'a - 17 freq
dee' - 4 freq
diy - 9 freq
ty - 7 freq
tthey - 1 freq
deu - 40 freq
teu - 29 freq
ht - 8 freq
taa - 3 freq
doy - 1 freq
ït - 331 freq
hïd - 6 freq
do' - 2 freq
da' - 3 freq
dia - 2 freq
'dee - 1 freq
dö - 127 freq
öt - 7 freq
tö - 19 freq
'du' - 2 freq
ta-' - 1 freq
'dey - 3 freq
ta- - 1 freq
tu - 23 freq
öd - 3 freq
'du - 8 freq
'ta - 2 freq
hyt - 1 freq
'd' - 2 freq
'to' - 1 freq
d'eau - 1 freq
d'eau' - 2 freq
doughy - 1 freq
daa - 4 freq
wd - 6 freq
'do' - 2 freq
wéit - 1 freq
'day - 2 freq
'tae' - 1 freq
tau - 2 freq
'too - 1 freq
de'i - 1 freq
dæ - 4 freq
dææ - 1 freq
tæ - 3 freq
dieh - 1 freq
wt' - 1 freq
di'a - 2 freq
'dee' - 1 freq
dui - 8 freq
dowy - 1 freq
ytt - 1 freq
taé - 1 freq
dø - 16 freq
™t - 2 freq
ªt - 6 freq
ªd - 3 freq
«d - 2 freq
'de - 3 freq
-d - 8 freq
þæt - 2 freq
žt - 1 freq
teh - 1 freq
tao - 2 freq
d'ae - 1 freq
daie - 3 freq
tey - 3 freq
tua - 2 freq
d'a - 1 freq
€™d - 1681 freq
€™da - 9 freq
duy - 3 freq
dü - 1 freq
tae'a - 1 freq
€˜da - 4 freq
€œdo - 14 freq
tø - 1 freq
€™t - 693 freq
€™dy - 25 freq
€˜dae - 10 freq
€œdae - 21 freq
€˜ti - 2 freq
€˜do - 4 freq
€˜to - 17 freq
€˜t - 2 freq
€œtae - 6 freq
dé - 1 freq
dia- - 1 freq
taew - 1 freq
€œd - 4 freq
€œti - 2 freq
€œtu - 2 freq
€˜d - 2 freq
-to- - 1 freq
'tea' - 1 freq
'te - 1 freq
ddd - 13 freq
€œdaa - 3 freq
dah - 3 freq
€œtoo - 2 freq
€¦to - 1 freq
hte - 1 freq
dai - 2 freq
toh - 4 freq
tui - 2 freq
hyde' - 1 freq
€¦tae - 1 freq
€œday - 1 freq
€œdee - 1 freq
w't - 1 freq
tei - 5 freq
€˜tae - 5 freq
doa - 1 freq
doe-e - 1 freq
€œto - 4 freq
€œt - 1 freq
€™tae - 1 freq
€Ÿd - 5 freq
€Ÿt - 3 freq
-day - 2 freq
€”tae - 1 freq
hyd - 7 freq
€˜too - 1 freq
€˜-t - 3 freq
€œda - 10 freq
€œdey - 4 freq
€œdu - 8 freq
€œdy - 2 freq
da-a - 1 freq
€™hd - 1 freq
doagh - 1 freq
'tooooo' - 1 freq
€™dae - 1 freq
€™do - 3 freq
d” - 1 freq
wwdh - 1 freq
dw - 4 freq
da- - 1 freq
t” - 1 freq
ytuo - 1 freq
‘to - 2 freq
ydo - 1 freq
tii - 1 freq
duu - 1 freq
wtt - 1 freq
doea - 1 freq
tooo - 1 freq
tuo - 1 freq
“to - 1 freq
“tae - 2 freq
dei - 3 freq
daeÂ’e - 1 freq
ddewwy - 1 freq
tiu - 1 freq
doña - 1 freq
dd - 1 freq
dee” - 1 freq
“dee” - 1 freq
“tea - 1 freq
dii - 1 freq
da'y - 2 freq
tè - 1 freq
htew - 1 freq
tth - 1 freq
tew - 1 freq
teaÂ’ - 1 freq
yto - 1 freq
tthe - 1 freq
tuw - 1 freq
dh - 1 freq
tuu - 1 freq
htt - 1 freq
'teu' - 1 freq
ta” - 1 freq
duh - 2 freq
teo - 1 freq
DA
the - 157218 freq
da - 9788 freq
tha - 6295 freq
e - 4634 freq
th - 2479 freq
thi - 2576 freq
the' - 572 freq
Time to execute Levenshtein function - 0.198361 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.320586 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027861 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037331 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000918 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.