A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dokken in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dokken (0) - 1 freq
docken (1) - 30 freq
sokken (1) - 1 freq
lokken (1) - 1 freq
dokkens (1) - 1 freq
darken (2) - 3 freq
doreen (2) - 3 freq
tooken (2) - 1 freq
tikken (2) - 6 freq
pouken (2) - 1 freq
doke (2) - 3 freq
douked (2) - 4 freq
slokken (2) - 1 freq
doukin (2) - 3 freq
pikken (2) - 6 freq
drouken (2) - 1 freq
nokket (2) - 2 freq
donkey (2) - 16 freq
wikken (2) - 1 freq
doen (2) - 4 freq
dokens (2) - 2 freq
dokey (2) - 1 freq
yokkin (2) - 8 freq
wakken (2) - 2 freq
couken (2) - 3 freq
dokken (0) - 1 freq
docken (2) - 30 freq
lokken (2) - 1 freq
dokkens (2) - 1 freq
sokken (2) - 1 freq
dookan (3) - 2 freq
dockin (3) - 3 freq
deken (3) - 4 freq
wakken (3) - 2 freq
dukket (3) - 1 freq
makken (3) - 14 freq
sikken (3) - 11 freq
gokkan (3) - 1 freq
dookin (3) - 20 freq
pakken (3) - 1 freq
takken (3) - 15 freq
yokkin (3) - 8 freq
drukken (3) - 12 freq
sukken (3) - 1 freq
derken (3) - 1 freq
darken (3) - 3 freq
doukin (3) - 3 freq
tikken (3) - 6 freq
drouken (3) - 1 freq
pikken (3) - 6 freq
SoundEx code - D250
dookin - 20 freq
disna - 401 freq
doesna - 90 freq
dozin - 8 freq
diggin - 44 freq
disnae - 586 freq
daikin - 1 freq
deekin - 2 freq
daesna - 71 freq
duisna - 13 freq
disown - 4 freq
doesnae - 176 freq
doggin - 10 freq
doukin - 3 freq
docken - 30 freq
duckin - 3 freq
dacin - 15 freq
dyke-an - 1 freq
dossin - 3 freq
dickson - 12 freq
dashin - 7 freq
dismay - 15 freq
dozen - 39 freq
daesnae - 29 freq
dizzen - 86 freq
diznae - 3 freq
deseen - 1 freq
dousin - 1 freq
deign - 1 freq
deacon - 6 freq
decin - 2 freq
dcein - 1 freq
''dassin'' - 1 freq
dassin - 23 freq
dassen - 28 freq
dizin - 3 freq
dismae - 1 freq
dosnae - 4 freq
dizzyin - 1 freq
daggum - 2 freq
deckin - 2 freq
dizen - 8 freq
dockin - 3 freq
disney - 19 freq
dishin - 12 freq
dizzna - 9 freq
disna¢ - 1 freq
dozin' - 1 freq
dosena - 1 freq
dcemie - 1 freq
dooshin - 1 freq
doosin - 2 freq
dsin - 1 freq
dusna - 18 freq
deusna - 1 freq
dashan - 1 freq
dösna - 1 freq
disno - 35 freq
doesnae-' - 1 freq
'disney - 1 freq
doesn - 11 freq
dookan - 2 freq
diggan - 6 freq
diggen - 1 freq
dusnae - 12 freq
day-suin - 1 freq
dowiesome - 2 freq
doosno - 1 freq
dozan - 1 freq
dowsin - 2 freq
døsna - 1 freq
dizzin - 1 freq
disjune - 2 freq
dokken - 1 freq
€˜disnae - 1 freq
dizna - 1 freq
deusno - 3 freq
disny - 5 freq
decayin - 1 freq
doesne - 1 freq
daesno - 1 freq
disni - 7 freq
dosan - 1 freq
deism - 1 freq
€œdisna - 2 freq
€œdsien - 1 freq
dyshin - 1 freq
dushin - 1 freq
diggin' - 1 freq
dookin' - 1 freq
dousinÂ’ - 1 freq
dyson - 1 freq
duzni - 1 freq
deken - 4 freq
dizni - 3 freq
doesni - 1 freq
dsimmie - 1 freq
dickin - 3 freq
disna' - 1 freq
dyjmh - 1 freq
deekin' - 1 freq
dqm - 1 freq
dequinn - 3 freq
dusney - 1 freq
duisnae - 1 freq
duojum - 1 freq
desine - 1 freq
dysony - 1 freq
“disnae - 1 freq
'diagon - 1 freq
'dookin - 1 freq
dixon - 1 freq
MetaPhone code - TKN
dookin - 20 freq
takkin - 508 freq
takin - 453 freq
token - 12 freq
diggin - 44 freq
taken - 122 freq
daikin - 1 freq
takken - 15 freq
deekin - 2 freq
tiggin - 2 freq
doggin - 10 freq
doukin - 3 freq
docken - 30 freq
duckin - 3 freq
tookna - 1 freq
dyke-an - 1 freq
tokin - 4 freq
tuckin - 9 freq
tookin - 2 freq
takin' - 10 freq
takan - 10 freq
tuggin - 7 freq
taikin - 1 freq
deacon - 6 freq
tikken - 6 freq
taykeen - 1 freq
tak'n - 1 freq
deckin - 2 freq
dockin - 3 freq
tickin - 8 freq
taakin - 107 freq
taggin - 3 freq
tak'in - 1 freq
tackin - 1 freq
takkan - 14 freq
tuck-in - 1 freq
tacn - 2 freq
dookan - 2 freq
diggan - 6 freq
tickan - 2 freq
diggen - 1 freq
taakan - 2 freq
t'ken - 2 freq
takna - 2 freq
tooken - 1 freq
tackan - 1 freq
taiken - 7 freq
taukin - 3 freq
taken' - 1 freq
€œtakkin - 1 freq
dokken - 1 freq
diggin' - 1 freq
dookin' - 1 freq
deken - 4 freq
wdkenny - 1 freq
tacken - 2 freq
dickin - 3 freq
deekin' - 1 freq
dequinn - 3 freq
'diagon - 1 freq
'dookin - 1 freq
tkkun - 1 freq
hdycn - 1 freq
DOKKEN
Time to execute Levenshtein function - 0.228577 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.432944 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.034404 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.054517 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.002460 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.