A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to td in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
td (0) - 9 freq
-d (1) - 8 freq
d (1) - 462 freq
thd (1) - 1 freq
tq (1) - 2 freq
tod (1) - 275 freq
jd (1) - 5 freq
gd (1) - 12 freq
tm (1) - 7 freq
id (1) - 597 freq
ftd (1) - 1 freq
d (1) - 3 freq
ts (1) - 41 freq
ta (1) - 2534 freq
cd (1) - 48 freq
fd (1) - 3 freq
ud (1) - 4 freq
ed (1) - 53 freq
ld (1) - 3 freq
tf (1) - 9 freq
sd (1) - 7 freq
tr (1) - 8 freq
tu (1) - 23 freq
tb (1) - 14 freq
std (1) - 1 freq
td (0) - 9 freq
tyd (1) - 4 freq
tod (1) - 275 freq
tad (1) - 54 freq
tid (1) - 25 freq
ted (1) - 13 freq
utd (1) - 11 freq
to (2) - 4049 freq
ltd (2) - 8 freq
ty (2) - 7 freq
tl (2) - 8 freq
dd (2) - 1 freq
tv (2) - 207 freq
vd (2) - 2 freq
th (2) - 2472 freq
kd (2) - 2 freq
tp (2) - 5 freq
nd (2) - 88 freq
tc (2) - 3 freq
tt (2) - 36 freq
od (2) - 8 freq
tg (2) - 7 freq
xd (2) - 4 freq
todo (2) - 1 freq
ootd (2) - 2 freq
SoundEx code - T000
the - 154319 freq
tae - 64038 freq
twa - 3470 freq
they - 11266 freq
tea - 560 freq
'tae - 47 freq
to - 4049 freq
two - 717 freq
thae - 1219 freq
tho - 1074 freq
'the - 348 freq
t - 5646 freq
ta - 2534 freq
tha - 6292 freq
'they - 48 freq
'thae - 1 freq
'ti - 3 freq
tt - 36 freq
tie - 88 freq
twae - 228 freq
thou - 95 freq
thay - 703 freq
towe - 21 freq
toy - 44 freq
ti - 4160 freq
th - 2472 freq
too - 992 freq
they' - 13 freq
taw - 9 freq
tue - 9 freq
thai - 445 freq
tae- - 5 freq
the- - 2 freq
tea- - 5 freq
two' - 1 freq
tae-' - 2 freq
tia - 2 freq
'to - 6 freq
'tho - 1 freq
'twa - 13 freq
tee - 165 freq
tw - 6 freq
- 1 freq
thy - 97 freq
toi - 34 freq
tah - 2 freq
'toi - 1 freq
thow - 2 freq
toa - 2 freq
t'd - 2 freq
'tt - 4 freq
they-eh - 1 freq
the' - 572 freq
tow - 44 freq
t'ae - 1 freq
tho' - 48 freq
twaa - 30 freq
th' - 105 freq
'the' - 6 freq
tee-hee - 1 freq
toe - 21 freq
thi - 2576 freq
te - 1569 freq
thay' - 1 freq
thee - 233 freq
thu - 23 freq
tay - 185 freq
thee' - 1 freq
ta¢ - 5 freq
thei - 5 freq
thé - 1 freq
tae' - 3 freq
't - 21 freq
toue - 1 freq
thaw - 11 freq
tha' - 11 freq
thoo - 277 freq
theiy - 6 freq
tye - 8 freq
'they' - 2 freq
to' - 2 freq
tai - 1 freq
they'u - 1 freq
t-t-twa - 1 freq
t' - 2 freq
thowe - 4 freq
'tha - 16 freq
t'a - 17 freq
thé - 1 freq
ty - 7 freq
theyaw - 1 freq
'th - 1 freq
twae' - 1 freq
tthey - 1 freq
too-whoo - 1 freq
teu - 29 freq
taa - 3 freq
twa' - 1 freq
td - 9 freq
'th- - 1 freq
ït - 331 freq
°tha - 1 freq
'two - 2 freq
twee - 5 freq
't'd - 1 freq
t'da - 5 freq
twi - 1 freq
öt - 7 freq
- 19 freq
ta-' - 1 freq
ta- - 1 freq
tu - 23 freq
'ta - 2 freq
twa-wey - 6 freq
'th' - 1 freq
'thou' - 2 freq
'thee' - 2 freq
'thy' - 2 freq
'to' - 1 freq
t-hah - 1 freq
thie - 8 freq
'tae' - 1 freq
tau - 2 freq
thoo' - 1 freq
'too - 1 freq
t'wo - 1 freq
t'tow - 1 freq
- 3 freq
'thy - 3 freq
t'die - 1 freq
thoa - 4 freq
two'why - 1 freq
taé - 1 freq
t - 2 freq
t - 6 freq
þæt - 2 freq
t - 1 freq
teh - 1 freq
tao - 2 freq
tey - 3 freq
tua - 2 freq
the - 177 freq
tae'a - 1 freq
tih - 7 freq
- 1 freq
t - 688 freq
theh - 1 freq
the - 108 freq
tho - 2 freq
ti - 2 freq
they - 24 freq
tho - 1 freq
th - 2 freq
to - 17 freq
thay - 2 freq
thew - 1 freq
t - 2 freq
the - 3 freq
tae - 6 freq
twa - 1 freq
the - 6 freq
the - 8 freq
töd - 1 freq
taew - 1 freq
they - 2 freq
thai - 6 freq
thae - 2 freq
ti - 2 freq
they - 47 freq
tu - 2 freq
thae - 6 freq
thay - 2 freq
-to- - 1 freq
'tea' - 1 freq
'te - 1 freq
thoo - 2 freq
thee - 1 freq
the - 4 freq
thoo - 30 freq
too - 2 freq
to - 1 freq
tha - 1 freq
the - 1 freq
toh - 4 freq
tui - 2 freq
two - 2 freq
thai - 2 freq
twa - 2 freq
tha - 3 freq
tae - 1 freq
tei - 5 freq
theii - 1 freq
tae - 5 freq
thy - 2 freq
to - 4 freq
t - 1 freq
tae - 1 freq
t - 3 freq
tae - 1 freq
too - 1 freq
-t - 3 freq
'tooooo' - 1 freq
they - 2 freq
twa - 1 freq
thaa - 1 freq
tuyu - 1 freq
- 1 freq
‘to - 2 freq
tii - 1 freq
‘the - 2 freq
theaa - 1 freq
thea - 1 freq
the… - 1 freq
the“i” - 1 freq
“the - 4 freq
tooo - 1 freq
tuo - 1 freq
“to - 1 freq
“they - 1 freq
“tae - 2 freq
tiu - 1 freq
“tea - 1 freq
- 1 freq
tth - 1 freq
tew - 1 freq
tea’ - 1 freq
tthe - 1 freq
tuw - 1 freq
tuu - 1 freq
'teu' - 1 freq
ta” - 1 freq
theo - 1 freq
teo - 1 freq
MetaPhone code - TT
did - 2817 freq
deid - 946 freq
'did - 51 freq
doot - 565 freq
tied - 119 freq
tide - 117 freq
tattoo - 12 freq
tait - 42 freq
tottie - 106 freq
daud - 78 freq
dee'd - 93 freq
tattie - 157 freq
dead - 272 freq
ted - 13 freq
dout - 167 freq
wydit - 1 freq
toot - 33 freq
totey - 20 freq
dodo - 79 freq
tidy - 50 freq
died - 152 freq
tae-tae - 2 freq
tit - 23 freq
deid-a - 1 freq
tid - 25 freq
dod - 129 freq
tote - 4 freq
dad - 261 freq
dotty - 1 freq
duty - 77 freq
dot - 47 freq
toaty - 8 freq
today - 154 freq
totie - 18 freq
d-day - 3 freq
dae't - 7 freq
dee't - 36 freq
'deed - 19 freq
deed - 235 freq
tod - 275 freq
deity - 2 freq
toatie - 6 freq
date - 160 freq
diet - 70 freq
teddy - 16 freq
tae-dae - 6 freq
t'd - 2 freq
tat - 16 freq
daddy - 124 freq
dautie - 5 freq
towt - 41 freq
tite - 2 freq
deid' - 3 freq
dae'd - 1 freq
wytit - 20 freq
deet - 24 freq
'deid - 3 freq
tate - 12 freq
dawtie - 6 freq
'deid' - 2 freq
wyted - 13 freq
tut - 67 freq
diddy - 5 freq
doddie - 4 freq
ditty - 4 freq
tot - 4 freq
ta-ta - 2 freq
dottie - 84 freq
d-dae - 1 freq
toad - 5 freq
daed - 19 freq
deat - 1 freq
day-oot - 1 freq
tutu - 3 freq
tae't - 7 freq
dowt - 6 freq
tyde - 24 freq
toty - 41 freq
totty - 12 freq
did' - 1 freq
deeit - 1 freq
tatty - 11 freq
todd - 7 freq
tad - 54 freq
'daddy - 7 freq
duddie - 1 freq
'tattie' - 2 freq
doyt - 1 freq
'tit - 1 freq
data - 68 freq
dei'd - 1 freq
dude - 20 freq
toate - 1 freq
dote - 2 freq
'dod - 1 freq
doad - 3 freq
taid - 2 freq
td - 9 freq
dïd - 22 freq
dutie - 1 freq
tottte - 1 freq
date' - 1 freq
'deed' - 1 freq
daad - 5 freq
tood - 1 freq
doughty - 3 freq
deit - 9 freq
'dad - 2 freq
toddy - 15 freq
't'd - 1 freq
t'da - 5 freq
dat - 1391 freq
dadd - 2 freq
toeht - 1 freq
taday - 2 freq
'dat - 2 freq
dadda - 1 freq
taut - 2 freq
deday - 5 freq
dud - 2 freq
totue - 1 freq
'tottie - 1 freq
ti'd - 1 freq
dewtie - 8 freq
tittie - 30 freq
'tod - 4 freq
tod' - 2 freq
'dee'd - 1 freq
'dee'd' - 1 freq
tead - 1 freq
tet - 1 freq
deeid - 15 freq
doit - 3 freq
t'tow - 1 freq
daday - 74 freq
tiyt - 1 freq
dee'at - 1 freq
de'ed - 7 freq
teedee - 1 freq
daid - 5 freq
duid - 1 freq
dowdy - 1 freq
day-daw - 1 freq
tee'd - 2 freq
teet - 13 freq
du'd - 2 freq
tyd - 4 freq
t'die - 1 freq
daddie - 3 freq
daddie' - 1 freq
ded - 4 freq
doute - 2 freq
dey'd - 4 freq
daatie - 6 freq
tedd - 1 freq
dyte - 2 freq
dede - 3 freq
dei't - 2 freq
díed - 1 freq
tide' - 1 freq
dodie - 18 freq
dat - 3 freq
dodie - 1 freq
did - 17 freq
doddy - 4 freq
doddy - 1 freq
duty - 1 freq
töd - 1 freq
tout - 3 freq
taed - 15 freq
dow'd - 1 freq
wyded - 1 freq
det - 3 freq
étude - 1 freq
to-day - 1 freq
tito - 2 freq
dad - 2 freq
dat - 8 freq
tat' - 1 freq
doot - 1 freq
didie - 5 freq
dide - 2 freq
did - 6 freq
date - 1 freq
daddy - 1 freq
daode - 1 freq
dyde - 1 freq
towt - 1 freq
dydie - 3 freq
didi - 1 freq
tad' - 1 freq
tayto - 1 freq
toto - 6 freq
tattie - 1 freq
deyd - 13 freq
dada - 12 freq
tad - 2 freq
dotie - 1 freq
dode - 2 freq
deud - 1 freq
today - 1 freq
did - 2 freq
doat - 1 freq
yytde - 1 freq
dohd - 1 freq
titi - 1 freq
“daddy - 1 freq
“daaaaaaaaaaaaaaaad” - 1 freq
ditto - 2 freq
dought - 1 freq
dt - 3 freq
‘did - 1 freq
totti - 8 freq
toadie - 1 freq
dado - 1 freq
teady - 1 freq
duyd - 1 freq
dwyhd - 1 freq
dht - 1 freq
diddie - 2 freq
'dido - 1 freq
teduio - 1 freq
tawtie - 1 freq
dood - 1 freq
tatt - 1 freq
d'day - 1 freq
“dey’d - 1 freq
tide” - 1 freq
todo - 1 freq
teat - 1 freq
twt - 1 freq
taeday - 1 freq
toht - 1 freq
TD
Time to execute Levenshtein function - 0.303009 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.493394 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031839 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040688 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000812 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.