A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to kïss in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
kïss (0) - 3 freq
kïsts (1) - 1 freq
kïsst (1) - 2 freq
kïst (1) - 1 freq
kyiss (2) - 1 freq
fïts (2) - 1 freq
ïts (2) - 16 freq
kloss (2) - 4 freq
hïts (2) - 1 freq
sïns (2) - 12 freq
kness (2) - 1 freq
pïgs (2) - 7 freq
kïll (2) - 13 freq
kïng (2) - 26 freq
keess (2) - 10 freq
skïns (2) - 3 freq
kïssin (2) - 1 freq
bïds (2) - 1 freq
kïngs (2) - 3 freq
fïsh (2) - 3 freq
mïsst (2) - 1 freq
ïs (2) - 260 freq
fïgs (2) - 3 freq
sïts (2) - 1 freq
bïts (2) - 7 freq
kïss (0) - 3 freq
kïsts (2) - 1 freq
kïst (2) - 1 freq
kïsst (2) - 2 freq
kïssin (3) - 1 freq
mïsst (4) - 1 freq
ïs (4) - 260 freq
kïngs (4) - 3 freq
fïgs (4) - 3 freq
fïsh (4) - 3 freq
kiss (4) - 123 freq
kless (4) - 1 freq
kess (4) - 5 freq
hïs (4) - 331 freq
bïts (4) - 7 freq
sïts (4) - 1 freq
bïds (4) - 1 freq
kloss (4) - 4 freq
sïns (4) - 12 freq
ïts (4) - 16 freq
fïts (4) - 1 freq
kyiss (4) - 1 freq
kness (4) - 1 freq
hïts (4) - 1 freq
pïgs (4) - 7 freq
SoundEx code - K000
kye - 140 freq
ka - 12 freq
k - 205 freq
key - 219 freq
kg - 4 freq
ko - 7 freq
k's - 1 freq
kk - 4 freq
ky - 6 freq
ke - 18 freq
kaay - 1 freq
kiwi - 2 freq
kïss - 3 freq
kay - 67 freq
kiæie - 1 freq
kaa - 2 freq
kew - 1 freq
kau - 2 freq
'ke - 1 freq
k - 3 freq
k - 3 freq
k - 3 freq
kyo - 2 freq
ku - 3 freq
kye' - 1 freq
kko - 3 freq
kkaw - 1 freq
kh - 5 freq
kaye - 4 freq
kie - 16 freq
k - 3 freq
kuei - 2 freq
kso - 2 freq
kuo - 3 freq
ks - 3 freq
kye- - 1 freq
kke - 1 freq
kc - 7 freq
kx - 6 freq
kzeyi - 1 freq
kz - 4 freq
kczoe - 1 freq
kj - 4 freq
kxx - 1 freq
kca - 1 freq
kky - 1 freq
kyu - 1 freq
kqu - 1 freq
kge - 1 freq
ki - 3 freq
kqks - 1 freq
kci - 1 freq
ksc - 1 freq
kgs - 1 freq
kks - 1 freq
kqe - 1 freq
kjkz - 1 freq
kgq - 1 freq
kxo - 1 freq
kxiw - 1 freq
kwai - 1 freq
kjow - 1 freq
kua - 1 freq
kw - 1 freq
MetaPhone code - KS
gauze - 7 freq
cosy - 57 freq
goes - 331 freq
case - 447 freq
gaze - 44 freq
keys - 63 freq
kis - 142 freq
cause - 1186 freq
guess - 148 freq
goose - 28 freq
kiss - 123 freq
gausie - 3 freq
'cause - 18 freq
gas - 88 freq
queues - 9 freq
coos - 74 freq
cos - 456 freq
quiz - 19 freq
guys - 472 freq
guy's - 13 freq
causey - 25 freq
'quiz - 1 freq
caas - 44 freq
cuz - 8 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
gaes - 173 freq
cou's - 1 freq
hkes - 1 freq
caws - 36 freq
'guess' - 1 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
guse - 1 freq
quasi - 1 freq
ca's - 5 freq
gous - 1 freq
guys' - 1 freq
queue's - 1 freq
gus - 21 freq
queys - 1 freq
quoys - 24 freq
couse - 23 freq
cus - 130 freq
caise - 6 freq
coasee - 1 freq
gce - 4 freq
gass - 4 freq
guiy's - 1 freq
k's - 1 freq
casey - 7 freq
guise - 17 freq
gays - 2 freq
key's - 1 freq
cosie - 23 freq
causie - 9 freq
csi - 3 freq
cow's - 2 freq
cozy - 4 freq
casie - 3 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
caase - 29 freq
cozzie - 3 freq
coo's - 16 freq
gause - 4 freq
goss - 1 freq
kïss - 3 freq
caause - 2 freq
gawsie - 4 freq
gaius - 1 freq
caa's - 1 freq
cass - 14 freq
keess - 10 freq
kaiys - 1 freq
coose - 8 freq
gös - 2 freq
causay - 2 freq
kay's - 2 freq
gos - 1 freq
kiz - 18 freq
kess - 5 freq
cas - 4 freq
queasy - 2 freq
ga's - 1 freq
gauzy - 1 freq
ccea - 1 freq
cce - 1 freq
cues - 3 freq
cs - 5 freq
kaas - 2 freq
cais - 1 freq
caess - 3 freq
cows' - 1 freq
gøs - 1 freq
kissie - 1 freq
c's - 1 freq
cozie - 1 freq
gaws - 1 freq
queazy - 1 freq
guiss - 1 freq
gauss - 2 freq
cace - 1 freq
kace - 2 freq
kies - 1 freq
cusa - 1 freq
casey - 1 freq
goosey - 6 freq
kays - 1 freq
kehs - 1 freq
cos - 1 freq
cous - 1 freq
caese - 2 freq
case - 1 freq
kezia - 5 freq
cos - 1 freq
cause - 7 freq
cause - 2 freq
caius - 3 freq
cause - 2 freq
kso - 2 freq
guiss - 1 freq
ks - 3 freq
gows - 2 freq
cos - 4 freq
cus - 1 freq
czy - 1 freq
gz - 3 freq
cz - 5 freq
coo’s - 3 freq
qois - 1 freq
kz - 4 freq
khza - 1 freq
casa - 1 freq
gs - 3 freq
kaz - 3 freq
gaz - 1 freq
qcy - 1 freq
ckhhyys - 1 freq
gaza - 1 freq
hx - 5 freq
cossy - 1 freq
wx - 4 freq
qwz - 1 freq
ga’s - 1 freq
gazza - 3 freq
gsy - 1 freq
cos' - 1 freq
yx - 2 freq
hgggssss - 1 freq
wks - 1 freq
wxaw - 1 freq
kci - 1 freq
wxi - 1 freq
kazoo - 1 freq
kks - 1 freq
hhqs - 1 freq
ccyh - 1 freq
qzw - 1 freq
qs - 2 freq
güs - 1 freq
csw - 2 freq
hxi - 1 freq
qaz - 1 freq
qis - 1 freq
guz - 1 freq
czw - 1 freq
goz - 1 freq
kes - 1 freq
yxa - 1 freq
kiss' - 1 freq
gazzah - 3 freq
qz - 1 freq
csa - 1 freq
hhx - 1 freq
yxx - 1 freq
gassy - 1 freq
KÏSS
Time to execute Levenshtein function - 0.228192 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.385901 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028342 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041878 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001126 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.