A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to kubby in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
kubby (0) - 1 freq
dubby (1) - 12 freq
cubby (1) - 1 freq
hubby (1) - 10 freq
tubby (1) - 1 freq
ruby (2) - 8 freq
hubbys (2) - 1 freq
webby (2) - 1 freq
tubey (2) - 1 freq
yubpy (2) - 4 freq
nebby (2) - 2 freq
bibby (2) - 3 freq
nobby (2) - 1 freq
cuby (2) - 1 freq
hubba (2) - 1 freq
bobby (2) - 45 freq
mubba (2) - 1 freq
lobby (2) - 87 freq
grubby (2) - 2 freq
kumbh (2) - 3 freq
kirby (2) - 3 freq
busby (2) - 1 freq
upby (2) - 9 freq
gibby (2) - 6 freq
abby (2) - 16 freq
kubby (0) - 1 freq
tubby (2) - 1 freq
hubby (2) - 10 freq
kabb (2) - 1 freq
dubby (2) - 12 freq
cubby (2) - 1 freq
kerby (3) - 2 freq
kubis (3) - 3 freq
rubb (3) - 1 freq
jobby (3) - 13 freq
abby (3) - 16 freq
bubba (3) - 1 freq
fabby (3) - 68 freq
robby (3) - 5 freq
tibby (3) - 16 freq
mibby (3) - 27 freq
tabby (3) - 15 freq
hibby (3) - 24 freq
habby (3) - 2 freq
babby (3) - 25 freq
gabby (3) - 1 freq
gibby (3) - 6 freq
libby (3) - 10 freq
hobby (3) - 14 freq
bobby (3) - 45 freq
SoundEx code - K100
keep - 1547 freq
kep - 136 freq
kip - 38 freq
keevee - 1 freq
'keep - 23 freq
keip - 4 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
koffie - 1 freq
kep' - 2 freq
kaip - 2 freq
kaif - 1 freq
€™kiep - 1 freq
kypie - 1 freq
€œkeep - 8 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
kb - 4 freq
kyoab - 1 freq
kee-vee - 2 freq
keb - 2 freq
€™keep - 1 freq
kubby - 1 freq
kgb - 3 freq
kv - 10 freq
kwf - 1 freq
ksjyvvf - 1 freq
kyf - 1 freq
kcf - 1 freq
kaypee - 2 freq
kf - 3 freq
kp - 6 freq
kev - 6 freq
kif - 1 freq
kwuavb - 1 freq
kkv - 1 freq
kabb - 1 freq
kufae - 1 freq
ko-fi - 1 freq
kxf - 1 freq
kcwphh - 1 freq
kgv - 1 freq
kpv - 1 freq
kffi - 1 freq
kjcsf - 2 freq
kbp - 1 freq
kapo - 1 freq
MetaPhone code - KB
gab - 32 freq
cowboy - 18 freq
gob - 15 freq
cb - 3 freq
gub - 30 freq
cube - 5 freq
gub' - 2 freq
gabbie - 1 freq
cub - 3 freq
cubbie - 2 freq
goab - 1 freq
cab - 16 freq
gbh - 3 freq
cowboay' - 1 freq
qub - 4 freq
go-by - 2 freq
wgbh - 1 freq
gaub - 1 freq
gabe - 1 freq
cabbie - 3 freq
kb - 4 freq
keb - 2 freq
kubby - 1 freq
cubby - 1 freq
qb - 2 freq
hkhb - 1 freq
gooby - 2 freq
qby - 1 freq
cbb - 4 freq
gb - 2 freq
gbbo - 1 freq
cuby - 1 freq
coyb - 1 freq
kabb - 1 freq
cbh - 1 freq
gabby - 1 freq
ckb - 1 freq
KUBBY
Time to execute Levenshtein function - 0.182909 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.349130 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027560 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037483 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000837 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.