A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dokens in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dokens (0) - 2 freq
dozens (1) - 3 freq
dokkens (1) - 1 freq
dokes (1) - 3 freq
dockens (1) - 5 freq
tokens (1) - 5 freq
doyens (1) - 1 freq
poke's (2) - 2 freq
dokken (2) - 1 freq
dores (2) - 1 freq
ovens (2) - 4 freq
dozen (2) - 39 freq
opens (2) - 69 freq
womens (2) - 3 freq
tockens (2) - 1 freq
docken (2) - 30 freq
dons (2) - 31 freq
pokes (2) - 22 freq
donkeys (2) - 9 freq
doves (2) - 2 freq
dockins (2) - 1 freq
oens (2) - 1 freq
dukes (2) - 9 freq
dens (2) - 21 freq
dorans (2) - 1 freq
dokens (0) - 2 freq
tokens (2) - 5 freq
dozens (2) - 3 freq
dockens (2) - 5 freq
doyens (2) - 1 freq
dokes (2) - 3 freq
dokkens (2) - 1 freq
dens (3) - 21 freq
dorans (3) - 1 freq
unkens (3) - 1 freq
dockins (3) - 1 freq
dukes (3) - 9 freq
dikes (3) - 3 freq
dizens (3) - 1 freq
dykes (3) - 44 freq
dookers (3) - 10 freq
doks (3) - 1 freq
douns (3) - 2 freq
derkens (3) - 1 freq
duke's (3) - 7 freq
dickens (3) - 13 freq
dakes (3) - 1 freq
doons (3) - 11 freq
doukers (3) - 1 freq
kens (3) - 532 freq
SoundEx code - D252
dizzens - 19 freq
dismissed - 16 freq
diagnosis - 14 freq
douceness - 2 freq
digging - 9 freq
decency - 10 freq
diagnosit - 1 freq
disconnect - 3 freq
deconstruct - 1 freq
dug-chains - 2 freq
diagnosed - 2 freq
dickens - 13 freq
dockens - 5 freq
dykeneuk - 1 freq
daesency - 1 freq
dashing - 4 freq
dacency - 2 freq
dishing - 3 freq
decaying - 2 freq
dizziness - 3 freq
doo's-neck - 1 freq
deacon's - 1 freq
dishonesty - 2 freq
dismissively - 4 freq
dismissive - 5 freq
dishonest - 5 freq
decommissionin - 1 freq
dockmaister - 2 freq
dozens - 3 freq
dismisst - 2 freq
disowns - 1 freq
dickinson - 1 freq
diagnose - 1 freq
desinged - 2 freq
desings - 1 freq
desing - 2 freq
disney's - 3 freq
daecency - 2 freq
djooie-emskit - 1 freq
diggings - 1 freq
disengage - 1 freq
dismiss - 4 freq
dicken's - 2 freq
dogems - 1 freq
disconnected - 2 freq
docken-strewn - 1 freq
dugs-'answers - 1 freq
disengaged - 1 freq
dissension - 2 freq
dysnochtifýin - 1 freq
dokens - 2 freq
decense - 1 freq
diggin's - 1 freq
dizens - 1 freq
disconnects - 1 freq
deconstruction - 1 freq
disconnectit - 1 freq
dismissin - 2 freq
disenchantit - 1 freq
decking - 1 freq
disingenuous - 2 freq
decommissioned - 1 freq
dogmas - 1 freq
dokkens - 1 freq
decommissiont - 1 freq
daicency - 1 freq
disconcerted - 1 freq
dockins - 1 freq
dowsing - 1 freq
descency - 1 freq
dougmcg - 5 freq
dismissal - 1 freq
dickwinchester - 3 freq
dzaimxfjgx - 1 freq
djmacdstv - 1 freq
djhenshall - 4 freq
djjennygreene - 1 freq
ducking - 2 freq
diagnosisdetectives - 1 freq
dismissed' - 2 freq
dickensian - 1 freq
dossing - 1 freq
deaconess - 1 freq
disconcertin - 1 freq
dgcouncil - 1 freq
dissing - 1 freq
doyzmkdcot - 1 freq
MetaPhone code - TKNS
dickens - 13 freq
dockens - 5 freq
tokens - 5 freq
deacon's - 1 freq
diagnose - 1 freq
takins - 2 freq
dicken's - 2 freq
takkins - 1 freq
taikens - 1 freq
dokens - 2 freq
diggin's - 1 freq
dokkens - 1 freq
dockins - 1 freq
tockens - 1 freq
deaconess - 1 freq
DOKENS
Time to execute Levenshtein function - 0.891492 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.012634 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.104880 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.108272 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000877 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.