A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to kubby in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
kubby (0) - 1 freq
dubby (1) - 12 freq
cubby (1) - 1 freq
tubby (1) - 1 freq
hubby (1) - 10 freq
grubby (2) - 2 freq
abby (2) - 16 freq
tibby (2) - 16 freq
jobby (2) - 10 freq
kubis (2) - 3 freq
kerby (2) - 2 freq
chubby (2) - 9 freq
nobby (2) - 1 freq
scubby (2) - 2 freq
webby (2) - 1 freq
nebby (2) - 2 freq
hobby (2) - 14 freq
bibby (2) - 3 freq
mibby (2) - 27 freq
kabb (2) - 1 freq
robby (2) - 5 freq
outby (2) - 20 freq
kumbh (2) - 3 freq
publy (2) - 1 freq
gibby (2) - 6 freq
kubby (0) - 1 freq
kabb (2) - 1 freq
tubby (2) - 1 freq
hubby (2) - 10 freq
dubby (2) - 12 freq
cubby (2) - 1 freq
lobby (3) - 79 freq
hubba (3) - 1 freq
libby (3) - 10 freq
hibby (3) - 24 freq
bubba (3) - 1 freq
mubba (3) - 1 freq
fabby (3) - 68 freq
habby (3) - 2 freq
bobby (3) - 45 freq
kebab (3) - 6 freq
gabby (3) - 1 freq
kirby (3) - 3 freq
babby (3) - 25 freq
gibby (3) - 6 freq
rubb (3) - 1 freq
tabby (3) - 15 freq
tibby (3) - 16 freq
webby (3) - 1 freq
nobby (3) - 1 freq
SoundEx code - K100
keep - 1525 freq
kep - 136 freq
kip - 38 freq
keevee - 1 freq
'keep - 23 freq
keip - 4 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
koffie - 1 freq
kep' - 2 freq
kaip - 2 freq
kaif - 1 freq
€™kiep - 1 freq
kypie - 1 freq
€œkeep - 8 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
kb - 4 freq
kyoab - 1 freq
kee-vee - 2 freq
keb - 2 freq
€™keep - 1 freq
kubby - 1 freq
kgb - 3 freq
kv - 10 freq
kwf - 1 freq
ksjyvvf - 1 freq
kyf - 1 freq
kcf - 1 freq
kaypee - 2 freq
kf - 3 freq
kp - 6 freq
kev - 6 freq
kif - 1 freq
kwuavb - 1 freq
kkv - 1 freq
kabb - 1 freq
kufae - 1 freq
ko-fi - 1 freq
kxf - 1 freq
kcwphh - 1 freq
kgv - 1 freq
kpv - 1 freq
kffi - 1 freq
kjcsf - 2 freq
kbp - 1 freq
kapo - 1 freq
MetaPhone code - KB
gab - 32 freq
cowboy - 16 freq
gob - 14 freq
cb - 3 freq
gub - 30 freq
cube - 4 freq
gub' - 2 freq
gabbie - 1 freq
cub - 3 freq
cubbie - 2 freq
gbh - 3 freq
cowboay' - 1 freq
cab - 15 freq
qub - 4 freq
go-by - 2 freq
wgbh - 1 freq
gaub - 1 freq
gabe - 1 freq
cabbie - 3 freq
kb - 4 freq
keb - 2 freq
kubby - 1 freq
cubby - 1 freq
qb - 2 freq
hkhb - 1 freq
gooby - 2 freq
qby - 1 freq
cbb - 4 freq
gb - 2 freq
gbbo - 1 freq
cuby - 1 freq
coyb - 1 freq
kabb - 1 freq
cbh - 1 freq
gabby - 1 freq
ckb - 1 freq
KUBBY
Time to execute Levenshtein function - 0.726359 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.871943 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.092369 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.092650 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000815 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.