A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dookan in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dookan (0) - 2 freq
sookan (1) - 3 freq
cookan (1) - 2 freq
lookan (1) - 30 freq
bookan (1) - 1 freq
dookin (1) - 20 freq
doosin (2) - 2 freq
dokken (2) - 1 freq
doorman (2) - 2 freq
dorian (2) - 32 freq
sooran (2) - 1 freq
doonin (2) - 1 freq
dodgan (2) - 1 freq
tooken (2) - 1 freq
smokan (2) - 2 freq
chokan (2) - 1 freq
howkan (2) - 2 freq
cookin (2) - 44 freq
hookah (2) - 29 freq
looman (2) - 1 freq
pookin (2) - 1 freq
sookin (2) - 52 freq
lookin (2) - 985 freq
dook (2) - 32 freq
hookin (2) - 2 freq
dookan (0) - 2 freq
dookin (1) - 20 freq
cookan (2) - 2 freq
doukin (2) - 3 freq
sookan (2) - 3 freq
bookan (2) - 1 freq
lookan (2) - 30 freq
lookn (3) - 1 freq
joukan (3) - 1 freq
kookin (3) - 1 freq
dozan (3) - 1 freq
drookin (3) - 3 freq
bookin (3) - 5 freq
dooin (3) - 2 freq
dockin (3) - 3 freq
dookin' (3) - 1 freq
sooken (3) - 1 freq
dookit (3) - 10 freq
docken (3) - 30 freq
dooron (3) - 5 freq
deekin (3) - 2 freq
deken (3) - 4 freq
daikin (3) - 1 freq
fookin (3) - 2 freq
cooken (3) - 1 freq
SoundEx code - D250
dookin - 20 freq
disna - 401 freq
doesna - 90 freq
dozin - 8 freq
diggin - 44 freq
disnae - 586 freq
daikin - 1 freq
deekin - 2 freq
daesna - 71 freq
duisna - 13 freq
disown - 4 freq
doesnae - 176 freq
doggin - 10 freq
doukin - 3 freq
docken - 30 freq
duckin - 3 freq
dacin - 15 freq
dyke-an - 1 freq
dossin - 3 freq
dickson - 12 freq
dashin - 7 freq
dismay - 15 freq
dozen - 39 freq
daesnae - 29 freq
dizzen - 86 freq
diznae - 3 freq
deseen - 1 freq
dousin - 1 freq
deign - 1 freq
deacon - 6 freq
decin - 2 freq
dcein - 1 freq
''dassin'' - 1 freq
dassin - 23 freq
dassen - 28 freq
dizin - 3 freq
dismae - 1 freq
dosnae - 4 freq
dizzyin - 1 freq
daggum - 2 freq
deckin - 2 freq
dizen - 8 freq
dockin - 3 freq
disney - 19 freq
dishin - 12 freq
dizzna - 9 freq
disna¢ - 1 freq
dozin' - 1 freq
dosena - 1 freq
dcemie - 1 freq
dooshin - 1 freq
doosin - 2 freq
dsin - 1 freq
dusna - 18 freq
deusna - 1 freq
dashan - 1 freq
dösna - 1 freq
disno - 35 freq
doesnae-' - 1 freq
'disney - 1 freq
doesn - 11 freq
dookan - 2 freq
diggan - 6 freq
diggen - 1 freq
dusnae - 12 freq
day-suin - 1 freq
dowiesome - 2 freq
doosno - 1 freq
dozan - 1 freq
dowsin - 2 freq
døsna - 1 freq
dizzin - 1 freq
disjune - 2 freq
dokken - 1 freq
€˜disnae - 1 freq
dizna - 1 freq
deusno - 3 freq
disny - 5 freq
decayin - 1 freq
doesne - 1 freq
daesno - 1 freq
disni - 7 freq
dosan - 1 freq
deism - 1 freq
€œdisna - 2 freq
€œdsien - 1 freq
dyshin - 1 freq
dushin - 1 freq
diggin' - 1 freq
dookin' - 1 freq
dousinÂ’ - 1 freq
dyson - 1 freq
duzni - 1 freq
deken - 4 freq
dizni - 3 freq
doesni - 1 freq
dsimmie - 1 freq
dickin - 3 freq
disna' - 1 freq
dyjmh - 1 freq
deekin' - 1 freq
dqm - 1 freq
dequinn - 3 freq
dusney - 1 freq
duisnae - 1 freq
duojum - 1 freq
desine - 1 freq
dysony - 1 freq
“disnae - 1 freq
'diagon - 1 freq
'dookin - 1 freq
dixon - 1 freq
MetaPhone code - TKN
dookin - 20 freq
takkin - 508 freq
takin - 453 freq
token - 12 freq
diggin - 44 freq
taken - 122 freq
daikin - 1 freq
takken - 15 freq
deekin - 2 freq
tiggin - 2 freq
doggin - 10 freq
doukin - 3 freq
docken - 30 freq
duckin - 3 freq
tookna - 1 freq
dyke-an - 1 freq
tokin - 4 freq
tuckin - 9 freq
tookin - 2 freq
takin' - 10 freq
takan - 10 freq
tuggin - 7 freq
taikin - 1 freq
deacon - 6 freq
tikken - 6 freq
taykeen - 1 freq
tak'n - 1 freq
deckin - 2 freq
dockin - 3 freq
tickin - 8 freq
taakin - 107 freq
taggin - 3 freq
tak'in - 1 freq
tackin - 1 freq
takkan - 14 freq
tuck-in - 1 freq
tacn - 2 freq
dookan - 2 freq
diggan - 6 freq
tickan - 2 freq
diggen - 1 freq
taakan - 2 freq
t'ken - 2 freq
takna - 2 freq
tooken - 1 freq
tackan - 1 freq
taiken - 7 freq
taukin - 3 freq
taken' - 1 freq
€œtakkin - 1 freq
dokken - 1 freq
diggin' - 1 freq
dookin' - 1 freq
deken - 4 freq
wdkenny - 1 freq
tacken - 2 freq
dickin - 3 freq
deekin' - 1 freq
dequinn - 3 freq
'diagon - 1 freq
'dookin - 1 freq
tkkun - 1 freq
hdycn - 1 freq
DOOKAN
Time to execute Levenshtein function - 0.207243 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373951 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029920 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037041 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000863 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.