A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dybbuk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dybbuk's (2) - 1 freq
dyeuk (2) - 8 freq
bbuk (2) - 1 freq
cbbuk (2) - 1 freq
byeuk (3) - 7 freq
buk (3) - 2 freq
yauk (3) - 3 freq
dubbed (3) - 3 freq
dobbid (3) - 1 freq
dabble (3) - 2 freq
disuk (3) - 1 freq
dubby (3) - 12 freq
deuk (3) - 47 freq
dabber (3) - 3 freq
bsuk (3) - 3 freq
duk (3) - 2 freq
douk (3) - 8 freq
dibber (3) - 2 freq
kebbok (3) - 1 freq
dabbed (3) - 9 freq
ybius (3) - 1 freq
dibble (3) - 1 freq
bauk (3) - 2 freq
yeuk (3) - 6 freq
drouk (3) - 1 freq
bbuk (3) - 1 freq
cbbuk (3) - 1 freq
kebbok (4) - 1 freq
dibber (4) - 2 freq
debbie (4) - 2 freq
dabbed (4) - 9 freq
dubbie (4) - 7 freq
dibble (4) - 1 freq
dobbin (4) - 13 freq
dubbin (4) - 1 freq
dabbit (4) - 6 freq
dabber (4) - 3 freq
dabbin (4) - 8 freq
dobber (4) - 4 freq
dybbuk's (4) - 1 freq
dubble (4) - 2 freq
dyeuk (4) - 8 freq
dubbed (4) - 3 freq
dobbid (4) - 1 freq
dubby (4) - 12 freq
dobbie (4) - 1 freq
dabble (4) - 2 freq
kebbuck (5) - 5 freq
bruk (5) - 3 freq
dybin (5) - 1 freq
SoundEx code - D120
dips - 11 freq
dabs - 5 freq
daubs - 1 freq
davis - 7 freq
diffuse - 2 freq
div's - 1 freq
deeps - 7 freq
devious - 4 freq
deep-sea - 4 freq
davies - 13 freq
dives - 7 freq
dowfhike - 1 freq
dabs' - 1 freq
dubs - 84 freq
dobbies - 2 freq
device - 23 freq
daffs - 3 freq
davie's - 36 freq
devise - 2 freq
dybbuk's - 1 freq
depose - 1 freq
divvies - 1 freq
defuse - 2 freq
dubious - 3 freq
doffs - 1 freq
deep-sey - 1 freq
dfs - 2 freq
diffs - 2 freq
davy's - 1 freq
duffy's - 4 freq
deeves - 1 freq
daffik - 2 freq
doupies - 1 freq
deips - 1 freq
daffies - 4 freq
davoo's - 1 freq
dfc - 2 freq
devyse - 2 freq
daffiks - 1 freq
dpis - 1 freq
dowps - 6 freq
daffys - 1 freq
debauch - 1 freq
dpbc - 1 freq
doves - 2 freq
deives - 1 freq
defeck - 1 freq
duffs - 2 freq
dps - 1 freq
defies - 2 freq
dpbz - 1 freq
dpz - 1 freq
dbz - 1 freq
dfpz - 1 freq
dpke - 5 freq
debs - 1 freq
dbyg - 1 freq
daveg - 1 freq
dwbaqh - 1 freq
dfkxa - 1 freq
debbie's - 1 freq
ddaps - 1 freq
dbis - 1 freq
daves - 2 freq
MetaPhone code - TBK
tbc - 4 freq
teabag - 1 freq
taybag - 2 freq
dbyg - 1 freq
dwbaqh - 1 freq
tbq - 1 freq
DYBBUK
Time to execute Levenshtein function - 0.215180 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337110 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027762 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036877 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.002509 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.