A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to kum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
kum (0) - 8 freq
dum (1) - 16 freq
sum (1) - 416 freq
kuo (1) - 3 freq
kums (1) - 5 freq
kmm (1) - 1 freq
mum (1) - 181 freq
pum (1) - 3 freq
kim (1) - 10 freq
cum (1) - 643 freq
hum (1) - 45 freq
bum (1) - 44 freq
kuq (1) - 1 freq
gum (1) - 19 freq
lum (1) - 140 freq
eum (1) - 1 freq
kjm (1) - 1 freq
rum (1) - 32 freq
'kum (1) - 1 freq
kam (1) - 24 freq
km (1) - 4 freq
fum (1) - 1 freq
wum (1) - 1 freq
yum (1) - 22 freq
um (1) - 151 freq
kum (0) - 8 freq
kom (1) - 2 freq
kim (1) - 10 freq
kam (1) - 24 freq
km (1) - 4 freq
ku (2) - 3 freq
kua (2) - 1 freq
um (2) - 151 freq
fum (2) - 1 freq
wum (2) - 1 freq
yum (2) - 22 freq
'um (2) - 35 freq
koam (2) - 1 freq
akim (2) - 2 freq
kaam (2) - 1 freq
kame (2) - 6 freq
tum (2) - 17 freq
kaim (2) - 13 freq
jum (2) - 3 freq
kyam (2) - 1 freq
dum (2) - 16 freq
'kum (2) - 1 freq
pum (2) - 3 freq
sum (2) - 416 freq
kmm (2) - 1 freq
SoundEx code - K500
ken - 5071 freq
kinna - 281 freq
know - 1205 freq
keen - 205 freq
knee - 126 freq
kin - 1383 freq
knew - 227 freq
kan - 28 freq
kin- - 1 freq
kyn - 20 freq
knaw - 16 freq
kenna - 11 freq
ken-nae - 3 freq
ken- - 11 freq
knowe - 56 freq
kaim - 13 freq
kenny - 46 freq
'ken - 8 freq
kemnay - 4 freq
ken' - 3 freq
kinnae - 10 freq
'know - 2 freq
kine - 101 freq
k-ken - 1 freq
'kin - 7 freq
kynea - 1 freq
kcyna - 1 freq
kcyn - 1 freq
keyn - 5 freq
keyna - 4 freq
kyne - 11 freq
kinn - 4 freq
keno - 1 freq
kim - 10 freq
khayyam - 1 freq
kame - 6 freq
kenya - 6 freq
kane - 31 freq
knee-heh - 2 freq
kin' - 2 freq
kïssin - 1 freq
kanna - 1 freq
kmee - 1 freq
km - 4 freq
kyin - 1 freq
'ken' - 4 freq
k'nie - 1 freq
kinno - 24 freq
kin-' - 1 freq
kum - 8 freq
kam - 24 freq
kunna - 1 freq
kehm - 1 freq
'kum - 1 freq
kaam - 1 freq
kaen - 12 freq
know' - 1 freq
ken-no - 1 freq
-kin - 1 freq
-ken - 1 freq
«kin - 1 freq
'know' - 1 freq
'knowe' - 1 freq
khan - 3 freq
kennawha - 9 freq
kni - 1 freq
kenny' - 1 freq
kno - 16 freq
koam - 1 freq
€œken - 2 freq
kene - 1 freq
know-how - 1 freq
kian - 1 freq
kennae - 1 freq
€˜ken - 2 freq
€˜know - 3 freq
€™know - 6 freq
€˜kin - 1 freq
koen - 2 freq
'kinna - 1 freq
€˜kenny - 1 freq
€œkum - 1 freq
keyin - 1 freq
€œkin - 2 freq
kain - 2 freq
'keen - 2 freq
kannae - 1 freq
kahn - 2 freq
€™kin - 2 freq
€™ken - 1 freq
kÂ’in - 1 freq
‘kin - 1 freq
kxn - 1 freq
kgeyom - 1 freq
“ken - 1 freq
kein - 1 freq
kano - 1 freq
kkano - 1 freq
kyam - 1 freq
kahoona - 1 freq
kom - 2 freq
kjm - 1 freq
knei - 1 freq
keen' - 1 freq
kiyan - 1 freq
kanyou - 1 freq
kmm - 1 freq
‘ken’ - 1 freq
keano - 1 freq
kennie - 1 freq
knee' - 1 freq
knaa - 1 freq
MetaPhone code - KM
cam - 2629 freq
come - 3162 freq
game - 648 freq
gamie - 12 freq
cum - 643 freq
came - 899 freq
'come - 73 freq
combo - 5 freq
caum - 36 freq
'c'm - 1 freq
kaim - 13 freq
gum - 19 freq
gammy - 8 freq
com - 134 freq
gammie - 4 freq
'cum - 4 freq
caim - 58 freq
caam - 11 freq
cam' - 5 freq
comb - 19 freq
'caum - 1 freq
coma - 13 freq
kim - 10 freq
'gome - 1 freq
kame - 6 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
gam - 2 freq
co'm - 1 freq
gaime - 1 freq
come' - 2 freq
cammy - 10 freq
kmee - 1 freq
km - 4 freq
gaem - 1 freq
gummy - 6 freq
kum - 8 freq
kam - 24 freq
gm - 10 freq
gmb - 4 freq
kehm - 1 freq
'kum - 1 freq
kaam - 1 freq
goom - 1 freq
cama - 1 freq
gome - 1 freq
cameo - 4 freq
guiami - 1 freq
camm - 1 freq
€˜come - 8 freq
koam - 1 freq
€œcm - 1 freq
cumbie - 1 freq
caimb - 9 freq
€œcum - 4 freq
coum - 2 freq
gme - 45 freq
€œcome - 32 freq
gamma - 9 freq
gambo - 1 freq
comm - 2 freq
€˜cam - 1 freq
combe - 1 freq
€œkum - 1 freq
kumbh - 3 freq
como - 1 freq
comme - 3 freq
qmh - 1 freq
cmmh - 1 freq
cm - 13 freq
qmy - 1 freq
cmo - 1 freq
cammay - 1 freq
qwmi - 1 freq
game” - 1 freq
gaim - 1 freq
kom - 2 freq
'comma - 1 freq
comma - 1 freq
camby - 1 freq
qm - 1 freq
kmm - 1 freq
qqmw - 1 freq
game' - 1 freq
cùm - 1 freq
camo - 1 freq
KUM
Time to execute Levenshtein function - 0.233489 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.375052 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028035 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037310 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000944 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.