A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dookit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dookit (0) - 10 freq
dockit (1) - 4 freq
dootit (1) - 2 freq
doomit (1) - 2 freq
jookit (1) - 3 freq
doosit (1) - 1 freq
pookit (1) - 5 freq
hookit (1) - 1 freq
rookit (1) - 5 freq
dookin (1) - 20 freq
doukit (1) - 3 freq
cookit (1) - 3 freq
lookit (1) - 338 freq
bookit (1) - 3 freq
sookit (1) - 27 freq
drookit (1) - 89 freq
nookie (2) - 2 freq
droit (2) - 1 freq
howkit (2) - 29 freq
cookie (2) - 15 freq
yokkit (2) - 4 freq
cookin (2) - 44 freq
dootin (2) - 5 freq
doozie (2) - 1 freq
dooked (2) - 12 freq
dookit (0) - 10 freq
doukit (1) - 3 freq
lookit (2) - 338 freq
cookit (2) - 3 freq
sookit (2) - 27 freq
drookit (2) - 89 freq
doukt (2) - 3 freq
dukit (2) - 1 freq
dookin (2) - 20 freq
bookit (2) - 3 freq
rookit (2) - 5 freq
dootit (2) - 2 freq
doomit (2) - 2 freq
dockit (2) - 4 freq
jookit (2) - 3 freq
hookit (2) - 1 freq
pookit (2) - 5 freq
doosit (2) - 1 freq
boakit (3) - 4 freq
docket (3) - 4 freq
yockit (3) - 5 freq
duckit (3) - 1 freq
droukit (3) - 21 freq
poukit (3) - 7 freq
coukit (3) - 1 freq
SoundEx code - D230
dicht - 96 freq
decide - 121 freq
dooked - 12 freq
doocot - 28 freq
dusty - 18 freq
douked - 4 freq
dust - 89 freq
dozed - 8 freq
decade - 30 freq
dist - 18 freq
dowiest - 1 freq
dazed - 4 freq
dogged - 3 freq
doukt - 3 freq
dockside - 9 freq
dookit - 10 freq
dick'd - 1 freq
dished - 10 freq
doused - 2 freq
deeside - 14 freq
dwight - 1 freq
dockhead - 1 freq
dash't - 2 freq
dowsed - 1 freq
docht - 4 freq
dashit - 2 freq
dousit - 1 freq
daoist - 2 freq
doukit - 3 freq
diseyd - 1 freq
deesyde - 1 freq
dukket - 1 freq
deceit - 5 freq
dickhead - 2 freq
duct - 4 freq
dashed - 14 freq
docket - 4 freq
dis't - 4 freq
decked - 8 freq
doacked - 1 freq
dichtt - 1 freq
ducked - 4 freq
dake-the - 1 freq
dost - 39 freq
duckit - 1 freq
deckit - 6 freq
dosed - 2 freq
dighty - 2 freq
doosht - 2 freq
duist - 1 freq
doughty - 3 freq
dish't - 1 freq
deckt - 3 freq
diced - 4 freq
decode - 2 freq
dight - 3 freq
dekkid - 1 freq
deshed - 1 freq
decayed - 1 freq
distie - 1 freq
disyde - 1 freq
dakota - 1 freq
decid - 1 freq
duggid - 1 freq
deeskit - 1 freq
dae-guid - 2 freq
douchty - 3 freq
dockit - 4 freq
duckweed - 1 freq
dossed - 1 freq
decait - 2 freq
dis-the - 1 freq
dochtie - 3 freq
dayset - 5 freq
doosit - 1 freq
daes't - 1 freq
dizzied - 1 freq
dioxide - 2 freq
diskythe - 2 freq
doosed - 1 freq
dog-shite - 1 freq
dasht - 1 freq
doocoot - 1 freq
daisy'd - 1 freq
duguid - 36 freq
docquet - 2 freq
doo-cot - 1 freq
decayit - 2 freq
dukit - 1 freq
dishit - 1 freq
dug-shite - 1 freq
docked - 2 freq
dichit - 1 freq
dogshit - 2 freq
dugged - 1 freq
dtjkiyd - 1 freq
dought - 1 freq
dogscott - 1 freq
‘dogged’ - 1 freq
decht - 1 freq
dzd - 1 freq
dquyda - 1 freq
djkd - 1 freq
dhgate - 1 freq
MetaPhone code - TKT
dooked - 12 freq
tigged - 2 freq
doocot - 28 freq
tucked - 36 freq
ticket - 172 freq
douked - 4 freq
decade - 30 freq
dogged - 3 freq
ticked - 5 freq
doukt - 3 freq
tuckt - 3 freq
dookit - 10 freq
tackety - 16 freq
tackity - 2 freq
dick'd - 1 freq
tuggit - 3 freq
tugged - 4 freq
teuked - 1 freq
doukit - 3 freq
dukket - 1 freq
duct - 4 freq
docket - 4 freq
takked - 1 freq
decked - 8 freq
ticket'' - 1 freq
doacked - 1 freq
tickit - 3 freq
togged - 1 freq
ducked - 4 freq
taakt - 10 freq
tukt - 2 freq
duckit - 1 freq
deckit - 6 freq
tacked - 2 freq
tackett - 1 freq
taakit - 6 freq
tak'ed - 1 freq
tuckit - 10 freq
toked - 1 freq
deckt - 3 freq
tact - 8 freq
taekit - 4 freq
decode - 2 freq
dekkid - 1 freq
taaked - 13 freq
dakota - 1 freq
tacketie - 2 freq
duggid - 1 freq
dae-guid - 2 freq
tacket - 1 freq
dockit - 4 freq
t'kut - 1 freq
decait - 2 freq
tiggit - 2 freq
ticketie - 1 freq
takkity - 1 freq
tackit - 3 freq
doocoot - 1 freq
duguid - 36 freq
takkit - 2 freq
toukit - 2 freq
tickity - 1 freq
doo-cot - 1 freq
tekked - 1 freq
dukit - 1 freq
t-o-c-h-t - 1 freq
docked - 2 freq
tackitie - 1 freq
taked - 1 freq
takd - 1 freq
dugged - 1 freq
‘dogged’ - 1 freq
ticket' - 4 freq
dquyda - 1 freq
dhgate - 1 freq
tickety - 5 freq
ticketty - 1 freq
toocute - 1 freq
ytgt - 1 freq
DOOKIT
Time to execute Levenshtein function - 0.199354 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341948 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028662 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037167 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000922 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.