A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to doughty in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
doughty (0) - 3 freq
doughy (1) - 1 freq
dought (1) - 1 freq
douchty (1) - 3 freq
drouchty (2) - 1 freq
nought (2) - 32 freq
bought (2) - 83 freq
tought (2) - 3 freq
roughly (2) - 16 freq
foushty (2) - 1 freq
noughte (2) - 1 freq
dougray (2) - 1 freq
ought (2) - 12 freq
broughty (2) - 7 freq
doughie (2) - 1 freq
draughty (2) - 1 freq
rought (2) - 1 freq
sought (2) - 10 freq
haughty (2) - 3 freq
dough (2) - 18 freq
fought (2) - 19 freq
dighty (2) - 2 freq
drought (2) - 4 freq
naughty (2) - 9 freq
hough' (3) - 1 freq
doughty (0) - 3 freq
dought (1) - 1 freq
doughy (2) - 1 freq
dighty (2) - 2 freq
douchty (2) - 3 freq
sought (3) - 10 freq
draughty (3) - 1 freq
haughty (3) - 3 freq
rought (3) - 1 freq
naughty (3) - 9 freq
dight (3) - 3 freq
doughie (3) - 1 freq
drought (3) - 4 freq
fought (3) - 19 freq
dough (3) - 18 freq
nought (3) - 32 freq
tought (3) - 3 freq
noughte (3) - 1 freq
bought (3) - 83 freq
ought (3) - 12 freq
mighty (4) - 41 freq
oaght (4) - 1 freq
righty (4) - 1 freq
boght (4) - 2 freq
faught (4) - 2 freq
SoundEx code - D230
dicht - 96 freq
decide - 121 freq
dooked - 12 freq
doocot - 28 freq
dusty - 18 freq
douked - 4 freq
dust - 89 freq
dozed - 8 freq
decade - 30 freq
dist - 18 freq
dowiest - 1 freq
dazed - 4 freq
dogged - 3 freq
doukt - 3 freq
dockside - 9 freq
dookit - 10 freq
dick'd - 1 freq
dished - 10 freq
doused - 2 freq
deeside - 14 freq
dwight - 1 freq
dockhead - 1 freq
dash't - 2 freq
dowsed - 1 freq
docht - 4 freq
dashit - 2 freq
dousit - 1 freq
daoist - 2 freq
doukit - 3 freq
diseyd - 1 freq
deesyde - 1 freq
dukket - 1 freq
deceit - 5 freq
dickhead - 2 freq
duct - 4 freq
dashed - 14 freq
docket - 4 freq
dis't - 4 freq
decked - 8 freq
doacked - 1 freq
dichtt - 1 freq
ducked - 4 freq
dake-the - 1 freq
dost - 39 freq
duckit - 1 freq
deckit - 6 freq
dosed - 2 freq
dighty - 2 freq
doosht - 2 freq
duist - 1 freq
doughty - 3 freq
dish't - 1 freq
deckt - 3 freq
diced - 4 freq
decode - 2 freq
dight - 3 freq
dekkid - 1 freq
deshed - 1 freq
decayed - 1 freq
distie - 1 freq
disyde - 1 freq
dakota - 1 freq
decid - 1 freq
duggid - 1 freq
deeskit - 1 freq
dae-guid - 2 freq
douchty - 3 freq
dockit - 4 freq
duckweed - 1 freq
dossed - 1 freq
decait - 2 freq
dis-the - 1 freq
dochtie - 3 freq
dayset - 5 freq
doosit - 1 freq
daes't - 1 freq
dizzied - 1 freq
dioxide - 2 freq
diskythe - 2 freq
doosed - 1 freq
dog-shite - 1 freq
dasht - 1 freq
doocoot - 1 freq
daisy'd - 1 freq
duguid - 36 freq
docquet - 2 freq
doo-cot - 1 freq
decayit - 2 freq
dukit - 1 freq
dishit - 1 freq
dug-shite - 1 freq
docked - 2 freq
dichit - 1 freq
dogshit - 2 freq
dugged - 1 freq
dtjkiyd - 1 freq
dought - 1 freq
dogscott - 1 freq
‘dogged’ - 1 freq
decht - 1 freq
dzd - 1 freq
dquyda - 1 freq
djkd - 1 freq
dhgate - 1 freq
MetaPhone code - TT
did - 2859 freq
deid - 954 freq
'did - 52 freq
doot - 573 freq
tied - 120 freq
tide - 118 freq
tattoo - 12 freq
tait - 42 freq
tottie - 107 freq
daud - 79 freq
dee'd - 94 freq
tattie - 158 freq
dead - 273 freq
ted - 13 freq
dout - 167 freq
wydit - 1 freq
toot - 33 freq
totey - 20 freq
dodo - 79 freq
tidy - 50 freq
died - 158 freq
tae-tae - 2 freq
tit - 26 freq
deid-a - 1 freq
tid - 25 freq
dod - 130 freq
tote - 4 freq
dad - 266 freq
dotty - 1 freq
duty - 77 freq
dot - 47 freq
toaty - 8 freq
today - 161 freq
totie - 18 freq
d-day - 3 freq
dae't - 7 freq
dee't - 36 freq
'deed - 19 freq
deed - 236 freq
tod - 276 freq
deity - 3 freq
toatie - 6 freq
date - 162 freq
diet - 71 freq
teddy - 16 freq
tae-dae - 6 freq
t'd - 2 freq
tat - 16 freq
daddy - 140 freq
dautie - 5 freq
towt - 41 freq
tite - 2 freq
deid' - 3 freq
dae'd - 1 freq
wytit - 20 freq
deet - 24 freq
'deid - 3 freq
tate - 12 freq
dawtie - 6 freq
'deid' - 2 freq
wyted - 13 freq
tut - 67 freq
diddy - 5 freq
doddie - 4 freq
ditty - 4 freq
tot - 4 freq
ta-ta - 2 freq
dottie - 84 freq
d-dae - 1 freq
toad - 5 freq
daed - 19 freq
deat - 1 freq
day-oot - 1 freq
tutu - 3 freq
tae't - 7 freq
dowt - 6 freq
tyde - 24 freq
toty - 42 freq
totty - 12 freq
did' - 1 freq
deeit - 1 freq
doud - 1 freq
toud - 1 freq
tatty - 11 freq
todd - 7 freq
tad - 54 freq
'daddy - 7 freq
duddie - 1 freq
'tattie' - 2 freq
doyt - 1 freq
'tit - 1 freq
data - 68 freq
dei'd - 1 freq
dude - 20 freq
toate - 1 freq
dote - 2 freq
'dod - 1 freq
doad - 3 freq
taid - 2 freq
td - 9 freq
dïd - 22 freq
dutie - 1 freq
tottte - 1 freq
date' - 1 freq
'deed' - 1 freq
daad - 5 freq
tood - 1 freq
doughty - 3 freq
deit - 9 freq
'dad - 2 freq
toddy - 15 freq
't'd - 1 freq
t'da - 5 freq
dat - 1391 freq
dadd - 2 freq
toeht - 1 freq
taday - 2 freq
'dat - 2 freq
dadda - 1 freq
taut - 2 freq
deday - 5 freq
dud - 2 freq
totue - 1 freq
'tottie - 1 freq
ti'd - 1 freq
dewtie - 8 freq
tittie - 30 freq
'tod - 4 freq
tod' - 2 freq
'dee'd - 1 freq
'dee'd' - 1 freq
tead - 1 freq
tet - 1 freq
deeid - 15 freq
doit - 3 freq
t'tow - 1 freq
daday - 74 freq
tiyt - 1 freq
dee'at - 1 freq
de'ed - 7 freq
teedee - 1 freq
daid - 5 freq
duid - 1 freq
dowdy - 1 freq
day-daw - 1 freq
tee'd - 2 freq
teet - 13 freq
du'd - 2 freq
tyd - 4 freq
t'die - 1 freq
daddie - 3 freq
daddie' - 1 freq
ded - 4 freq
doute - 2 freq
dey'd - 4 freq
daatie - 6 freq
tedd - 1 freq
dyte - 2 freq
dede - 3 freq
dei't - 2 freq
díed - 1 freq
tide' - 1 freq
dodie - 18 freq
€˜dat - 3 freq
€˜dodie - 1 freq
€œdid - 17 freq
doddy - 4 freq
€œdoddy - 1 freq
€˜duty - 1 freq
töd - 1 freq
tout - 3 freq
taed - 15 freq
dow'd - 1 freq
wyded - 1 freq
det - 3 freq
étude - 1 freq
to-day - 1 freq
tito - 2 freq
€œdad - 2 freq
€œdat - 8 freq
tat' - 1 freq
€™doot - 1 freq
didie - 5 freq
dide - 2 freq
€˜did - 6 freq
€œdate - 1 freq
€œdaddy - 1 freq
daode - 1 freq
dyde - 1 freq
€˜towt - 1 freq
dydie - 3 freq
didi - 1 freq
tad' - 1 freq
tayto - 1 freq
toto - 6 freq
€œtattie - 1 freq
deyd - 13 freq
dada - 12 freq
€œtad - 2 freq
dotie - 1 freq
dode - 2 freq
deud - 1 freq
€œtoday - 1 freq
€™did - 2 freq
doat - 1 freq
yytde - 1 freq
dohd - 1 freq
titi - 1 freq
“daddy - 1 freq
“daaaaaaaaaaaaaaaad” - 1 freq
ditto - 2 freq
dought - 1 freq
dt - 3 freq
‘did - 1 freq
totti - 8 freq
toadie - 1 freq
dado - 1 freq
teady - 1 freq
duyd - 1 freq
dwyhd - 1 freq
dht - 1 freq
diddie - 2 freq
'dido - 1 freq
teduio - 1 freq
tawtie - 1 freq
dood - 1 freq
tatt - 1 freq
d'day - 1 freq
“dey’d - 1 freq
tide” - 1 freq
todo - 1 freq
teat - 1 freq
twt - 1 freq
taeday - 1 freq
toht - 1 freq
DOUGHTY
Time to execute Levenshtein function - 0.315294 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.460973 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029807 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040951 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001021 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.