A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to today in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
today (0) - 154 freq
taday (1) - 2 freq
toddy (1) - 15 freq
to-day (1) - 1 freq
thray (2) - 2 freq
tod' (2) - 2 freq
total (2) - 96 freq
sodas (2) - 1 freq
stodgy (2) - 1 freq
yoda (2) - 3 freq
todds (2) - 1 freq
oda (2) - 5 freq
tod (2) - 275 freq
tods (2) - 48 freq
boay (2) - 199 freq
b'day (2) - 2 freq
totty (2) - 12 freq
th'day (2) - 4 freq
totsy (2) - 1 freq
'okay (2) - 7 freq
torty (2) - 1 freq
awday (2) - 2 freq
rody (2) - 1 freq
i'day (2) - 32 freq
to-an (2) - 1 freq
today (0) - 154 freq
taday (1) - 2 freq
tod (2) - 275 freq
tidy (2) - 50 freq
taeday (2) - 1 freq
toad (2) - 5 freq
todo (2) - 1 freq
toddy (2) - 15 freq
to-day (2) - 1 freq
bday (3) - 1 freq
tide (3) - 117 freq
soda (3) - 20 freq
cody (3) - 1 freq
toyal (3) - 1 freq
t'da (3) - 5 freq
teddy (3) - 16 freq
toley (3) - 1 freq
teady (3) - 1 freq
tood (3) - 1 freq
theday (3) - 57 freq
toga (3) - 2 freq
'day (3) - 2 freq
tovey (3) - 2 freq
tyde (3) - 24 freq
toap (3) - 37 freq
SoundEx code - T300
that - 26604 freq
'that - 72 freq
the-day - 32 freq
tied - 119 freq
thit - 566 freq
tide - 117 freq
tattoo - 12 freq
teeth - 316 freq
tait - 42 freq
tottie - 106 freq
they'd - 430 freq
'that' - 4 freq
thud - 17 freq
tattie - 157 freq
ted - 13 freq
toot - 33 freq
totey - 20 freq
that'd - 29 freq
tidy - 50 freq
that-ah - 1 freq
tae-tae - 2 freq
tit - 23 freq
tuith - 11 freq
tid - 25 freq
tweed - 25 freq
tote - 4 freq
thoat - 103 freq
tooth - 21 freq
toaty - 8 freq
today - 154 freq
totie - 18 freq
that' - 19 freq
tod - 275 freq
toothy - 4 freq
toatie - 6 freq
teddy - 16 freq
thowed - 3 freq
tae-dae - 6 freq
thoeht - 4 freq
-that - 1 freq
tat - 16 freq
towt - 41 freq
tite - 2 freq
''that - 1 freq
thay'd - 6 freq
thet - 9 freq
tate - 12 freq
tie-dye - 1 freq
theday - 57 freq
tut - 67 freq
tithe - 4 freq
tot - 4 freq
ta-ta - 2 freq
twit - 9 freq
theit - 2 freq
toad - 5 freq
thowt - 97 freq
taewatt - 1 freq
thate - 1 freq
tutu - 3 freq
tae't - 7 freq
thuddy - 1 freq
tyde - 24 freq
tyed - 2 freq
towed - 6 freq
theyd - 10 freq
tweet - 100 freq
toty - 41 freq
totty - 12 freq
tatty - 11 freq
todd - 7 freq
thaot - 1 freq
tad - 54 freq
téte-a-téte - 2 freq
thid - 1 freq
'tattie' - 2 freq
'tit - 1 freq
th'day - 4 freq
twat - 11 freq
thout - 2 freq
toate - 1 freq
that'ah - 1 freq
taid - 2 freq
the'd - 4 freq
tottte - 1 freq
tothe - 2 freq
'they'd - 2 freq
thut - 29 freq
tood - 1 freq
toddy - 15 freq
toeht - 1 freq
taday - 2 freq
taut - 2 freq
thoo'd - 2 freq
twathie - 8 freq
t'wud - 1 freq
thed - 1 freq
totue - 1 freq
'tottie - 1 freq
ti'd - 1 freq
tottiewie - 1 freq
towit - 1 freq
teith - 5 freq
tittie - 30 freq
'tod - 4 freq
tod' - 2 freq
tead - 1 freq
tet - 1 freq
they'ed - 1 freq
tiyt - 1 freq
teedee - 1 freq
tee'd - 2 freq
teet - 13 freq
tyd - 4 freq
that- - 1 freq
'twad - 4 freq
twad - 1 freq
thooht - 1 freq
tedd - 1 freq
tuatha - 1 freq
theat - 1 freq
tuithie - 3 freq
thai'd - 5 freq
thae'd - 1 freq
tide' - 1 freq
that - 52 freq
that - 79 freq
tweed - 1 freq
tout - 3 freq
taed - 15 freq
twid - 1 freq
they'd - 1 freq
étude - 1 freq
tête-a-tête - 1 freq
toyota - 1 freq
twa-twa - 1 freq
to-day - 1 freq
tito - 2 freq
twitty - 1 freq
t-that - 1 freq
twatt - 7 freq
tat' - 1 freq
towt - 1 freq
tad' - 1 freq
tae-haud - 1 freq
tayto - 1 freq
thote - 4 freq
toto - 6 freq
tattie - 1 freq
thud - 1 freq
that - 2 freq
thawed - 1 freq
that - 1 freq
that - 17 freq
tad - 2 freq
tuithy - 1 freq
teath - 1 freq
tweet - 1 freq
thot - 21 freq
today - 1 freq
titi - 1 freq
theehut - 1 freq
thetwo - 1 freq
they’d - 3 freq
totti - 8 freq
thatd - 1 freq
toadie - 1 freq
twittee - 1 freq
teady - 1 freq
tee-he'd - 1 freq
‘that - 1 freq
tweedie - 3 freq
that’d - 1 freq
that“ - 1 freq
thehead - 2 freq
teduio - 1 freq
tawtie - 1 freq
t’whit - 1 freq
twite - 2 freq
tatt - 1 freq
totd - 1 freq
tide” - 1 freq
todo - 1 freq
thd - 1 freq
thait - 3 freq
tuath - 1 freq
teat - 1 freq
twt - 1 freq
taeday - 1 freq
th’day - 1 freq
toht - 1 freq
MetaPhone code - TT
did - 2817 freq
deid - 946 freq
'did - 51 freq
doot - 565 freq
tied - 119 freq
tide - 117 freq
tattoo - 12 freq
tait - 42 freq
tottie - 106 freq
daud - 78 freq
dee'd - 93 freq
tattie - 157 freq
dead - 272 freq
ted - 13 freq
dout - 167 freq
wydit - 1 freq
toot - 33 freq
totey - 20 freq
dodo - 79 freq
tidy - 50 freq
died - 152 freq
tae-tae - 2 freq
tit - 23 freq
deid-a - 1 freq
tid - 25 freq
dod - 129 freq
tote - 4 freq
dad - 261 freq
dotty - 1 freq
duty - 77 freq
dot - 47 freq
toaty - 8 freq
today - 154 freq
totie - 18 freq
d-day - 3 freq
dae't - 7 freq
dee't - 36 freq
'deed - 19 freq
deed - 235 freq
tod - 275 freq
deity - 2 freq
toatie - 6 freq
date - 160 freq
diet - 70 freq
teddy - 16 freq
tae-dae - 6 freq
t'd - 2 freq
tat - 16 freq
daddy - 124 freq
dautie - 5 freq
towt - 41 freq
tite - 2 freq
deid' - 3 freq
dae'd - 1 freq
wytit - 20 freq
deet - 24 freq
'deid - 3 freq
tate - 12 freq
dawtie - 6 freq
'deid' - 2 freq
wyted - 13 freq
tut - 67 freq
diddy - 5 freq
doddie - 4 freq
ditty - 4 freq
tot - 4 freq
ta-ta - 2 freq
dottie - 84 freq
d-dae - 1 freq
toad - 5 freq
daed - 19 freq
deat - 1 freq
day-oot - 1 freq
tutu - 3 freq
tae't - 7 freq
dowt - 6 freq
tyde - 24 freq
toty - 41 freq
totty - 12 freq
did' - 1 freq
deeit - 1 freq
tatty - 11 freq
todd - 7 freq
tad - 54 freq
'daddy - 7 freq
duddie - 1 freq
'tattie' - 2 freq
doyt - 1 freq
'tit - 1 freq
data - 68 freq
dei'd - 1 freq
dude - 20 freq
toate - 1 freq
dote - 2 freq
'dod - 1 freq
doad - 3 freq
taid - 2 freq
td - 9 freq
dïd - 22 freq
dutie - 1 freq
tottte - 1 freq
date' - 1 freq
'deed' - 1 freq
daad - 5 freq
tood - 1 freq
doughty - 3 freq
deit - 9 freq
'dad - 2 freq
toddy - 15 freq
't'd - 1 freq
t'da - 5 freq
dat - 1391 freq
dadd - 2 freq
toeht - 1 freq
taday - 2 freq
'dat - 2 freq
dadda - 1 freq
taut - 2 freq
deday - 5 freq
dud - 2 freq
totue - 1 freq
'tottie - 1 freq
ti'd - 1 freq
dewtie - 8 freq
tittie - 30 freq
'tod - 4 freq
tod' - 2 freq
'dee'd - 1 freq
'dee'd' - 1 freq
tead - 1 freq
tet - 1 freq
deeid - 15 freq
doit - 3 freq
t'tow - 1 freq
daday - 74 freq
tiyt - 1 freq
dee'at - 1 freq
de'ed - 7 freq
teedee - 1 freq
daid - 5 freq
duid - 1 freq
dowdy - 1 freq
day-daw - 1 freq
tee'd - 2 freq
teet - 13 freq
du'd - 2 freq
tyd - 4 freq
t'die - 1 freq
daddie - 3 freq
daddie' - 1 freq
ded - 4 freq
doute - 2 freq
dey'd - 4 freq
daatie - 6 freq
tedd - 1 freq
dyte - 2 freq
dede - 3 freq
dei't - 2 freq
díed - 1 freq
tide' - 1 freq
dodie - 18 freq
dat - 3 freq
dodie - 1 freq
did - 17 freq
doddy - 4 freq
doddy - 1 freq
duty - 1 freq
töd - 1 freq
tout - 3 freq
taed - 15 freq
dow'd - 1 freq
wyded - 1 freq
det - 3 freq
étude - 1 freq
to-day - 1 freq
tito - 2 freq
dad - 2 freq
dat - 8 freq
tat' - 1 freq
doot - 1 freq
didie - 5 freq
dide - 2 freq
did - 6 freq
date - 1 freq
daddy - 1 freq
daode - 1 freq
dyde - 1 freq
towt - 1 freq
dydie - 3 freq
didi - 1 freq
tad' - 1 freq
tayto - 1 freq
toto - 6 freq
tattie - 1 freq
deyd - 13 freq
dada - 12 freq
tad - 2 freq
dotie - 1 freq
dode - 2 freq
deud - 1 freq
today - 1 freq
did - 2 freq
doat - 1 freq
yytde - 1 freq
dohd - 1 freq
titi - 1 freq
“daddy - 1 freq
“daaaaaaaaaaaaaaaad” - 1 freq
ditto - 2 freq
dought - 1 freq
dt - 3 freq
‘did - 1 freq
totti - 8 freq
toadie - 1 freq
dado - 1 freq
teady - 1 freq
duyd - 1 freq
dwyhd - 1 freq
dht - 1 freq
diddie - 2 freq
'dido - 1 freq
teduio - 1 freq
tawtie - 1 freq
dood - 1 freq
tatt - 1 freq
d'day - 1 freq
“dey’d - 1 freq
tide” - 1 freq
todo - 1 freq
teat - 1 freq
twt - 1 freq
taeday - 1 freq
toht - 1 freq
TODAY
Time to execute Levenshtein function - 0.479848 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.206608 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.099068 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.182541 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000743 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.