A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to keep in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
keep (0) - 1454 freq
'keep (1) - 23 freq
neep (1) - 126 freq
eep (1) - 1 freq
keeps (1) - 151 freq
weep (1) - 14 freq
keel (1) - 9 freq
keip (1) - 1 freq
kep (1) - 135 freq
reep (1) - 1 freq
deep (1) - 533 freq
kecp (1) - 1 freq
meep (1) - 1 freq
keen (1) - 196 freq
beep (1) - 9 freq
jeep (1) - 4 freq
peep (1) - 44 freq
seep (1) - 5 freq
kemp (1) - 46 freq
kelp (1) - 15 freq
leep (1) - 2 freq
keek (1) - 195 freq
keepy (1) - 1 freq
sees (2) - 181 freq
ees (2) - 86 freq
keep (0) - 1454 freq
keepy (1) - 1 freq
kep (1) - 135 freq
keip (1) - 1 freq
keek (2) - 195 freq
leep (2) - 2 freq
kelp (2) - 15 freq
seep (2) - 5 freq
kop (2) - 2 freq
kp (2) - 6 freq
kip (2) - 38 freq
kepe (2) - 4 freq
peep (2) - 44 freq
kap (2) - 1 freq
kaip (2) - 2 freq
kemp (2) - 46 freq
jeep (2) - 4 freq
eep (2) - 1 freq
reep (2) - 1 freq
keel (2) - 9 freq
weep (2) - 14 freq
keeps (2) - 151 freq
neep (2) - 126 freq
'keep (2) - 23 freq
beep (2) - 9 freq
SoundEx code - K100
keep - 1454 freq
kep - 135 freq
kip - 38 freq
keevee - 1 freq
'keep - 23 freq
keip - 1 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
koffie - 1 freq
kep' - 2 freq
kaip - 2 freq
kaif - 1 freq
€™kiep - 1 freq
kypie - 1 freq
€œkeep - 8 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
kb - 4 freq
kyoab - 1 freq
kee-vee - 2 freq
keb - 2 freq
€™keep - 1 freq
kubby - 1 freq
kgb - 3 freq
kv - 10 freq
kwf - 1 freq
ksjyvvf - 1 freq
kyf - 1 freq
kcf - 1 freq
kaypee - 2 freq
kf - 3 freq
kp - 6 freq
kev - 6 freq
kif - 1 freq
kwuavb - 1 freq
kkv - 1 freq
kabb - 1 freq
kufae - 1 freq
ko-fi - 1 freq
kxf - 1 freq
kcwphh - 1 freq
kgv - 1 freq
kpv - 1 freq
kffi - 1 freq
kjcsf - 2 freq
kbp - 1 freq
kapo - 1 freq
MetaPhone code - KP
cup - 302 freq
keep - 1454 freq
cowp - 62 freq
kep - 135 freq
kip - 38 freq
gap - 48 freq
cap - 45 freq
cope - 34 freq
gawp - 9 freq
coup - 20 freq
copy - 96 freq
'keep - 23 freq
gaup - 4 freq
cuppa - 21 freq
cuppie - 25 freq
co-op - 15 freq
caip - 3 freq
quip - 2 freq
cop - 4 freq
keip - 1 freq
coapy - 2 freq
gpo - 4 freq
gowp - 65 freq
caup - 11 freq
cape - 11 freq
'cowp - 3 freq
gp - 9 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
copie - 17 freq
copp - 1 freq
cappie - 3 freq
coapie - 4 freq
kep' - 2 freq
kaip - 2 freq
caap - 6 freq
gaip - 1 freq
gup - 2 freq
€™kiep - 1 freq
gaap - 1 freq
kypie - 1 freq
gape - 4 freq
coopie - 2 freq
€œkeep - 8 freq
coop - 1 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
€˜copy - 1 freq
cuppy - 3 freq
€™keep - 1 freq
co-opy - 2 freq
co-oopie - 1 freq
cappy - 3 freq
kaypee - 2 freq
kp - 6 freq
qp - 7 freq
qpo - 1 freq
ykypy - 1 freq
hqpw - 1 freq
ckp - 1 freq
cp - 4 freq
kapo - 1 freq
KEEP
keep - 1454 freq
keeping - 42 freq
keepin - 211 freq
kept - 487 freq
Time to execute Levenshtein function - 0.188659 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.356004 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027731 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041357 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001152 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.