A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cake in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cake (0) - 165 freq
eake (1) - 1 freq
cace (1) - 1 freq
caie (1) - 2 freq
caked (1) - 6 freq
crake (1) - 1 freq
fake (1) - 57 freq
kake (1) - 1 freq
cage (1) - 58 freq
cae (1) - 7 freq
coke (1) - 34 freq
cafe (1) - 31 freq
cave (1) - 79 freq
wake (1) - 92 freq
cape (1) - 11 freq
ake (1) - 7 freq
pake (1) - 1 freq
care (1) - 462 freq
cak (1) - 1 freq
case (1) - 440 freq
lake (1) - 111 freq
jake (1) - 71 freq
make (1) - 622 freq
came (1) - 890 freq
bake (1) - 37 freq
cake (0) - 165 freq
cak (1) - 1 freq
coke (1) - 34 freq
cane (2) - 22 freq
dake (2) - 4 freq
sake (2) - 241 freq
cakes (2) - 54 freq
make (2) - 622 freq
came (2) - 890 freq
bake (2) - 37 freq
cate (2) - 2 freq
take (2) - 676 freq
ck (2) - 14 freq
ckie (2) - 1 freq
ciky (2) - 1 freq
cauk (2) - 5 freq
cokey (2) - 1 freq
rake (2) - 31 freq
cok (2) - 1 freq
jake (2) - 71 freq
hake (2) - 3 freq
cace (2) - 1 freq
eake (2) - 1 freq
cage (2) - 58 freq
fake (2) - 57 freq
SoundEx code - C200
check - 202 freq
cosy - 55 freq
chucks - 4 freq
cheek - 137 freq
cheeks - 75 freq
case - 440 freq
cake - 165 freq
cook - 201 freq
cocky - 12 freq
cog - 3 freq
cause - 1187 freq
choosy-hoi - 1 freq
choke - 22 freq
cheeky - 91 freq
chuck - 29 freq
choice - 152 freq
cage - 58 freq
cease - 10 freq
chuise - 15 freq
'cause - 16 freq
cosh - 12 freq
cuzco - 1 freq
coos - 71 freq
chug - 2 freq
chaws - 1 freq
cash - 84 freq
couch - 67 freq
cos - 449 freq
chose - 60 freq
chessie - 1 freq
cock - 79 freq
cogs - 2 freq
causey - 25 freq
cheese - 138 freq
cheque - 20 freq
caas - 44 freq
chaise - 3 freq
chase - 48 freq
chis - 2 freq
chicks - 8 freq
cuz - 8 freq
cough - 59 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
coach - 51 freq
choose - 79 freq
chowk - 6 freq
cack - 4 freq
chowks - 9 freq
cushie - 15 freq
cou's - 1 freq
chuckie - 10 freq
cushy - 3 freq
chik - 11 freq
cissie - 7 freq
caws - 36 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
cuiks - 2 freq
coggie - 5 freq
czech - 7 freq
ciseau - 1 freq
cowk - 4 freq
chyce - 51 freq
chez - 2 freq
cocks - 11 freq
chack - 8 freq
ca's - 4 freq
cuik - 4 freq
cheesy - 15 freq
chookie - 66 freq
chakks - 1 freq
chico - 5 freq
czesc - 1 freq
chikk - 1 freq
cooks - 5 freq
chese - 1 freq
cogie - 6 freq
chaos - 24 freq
chooky - 6 freq
chuse - 9 freq
chess - 4 freq
cheesey - 3 freq
cauk - 5 freq
cock's - 3 freq
chooks - 18 freq
chice - 3 freq
couse - 23 freq
cus - 130 freq
cis - 26 freq
caise - 6 freq
coasee - 1 freq
ciss - 1 freq
chooch - 1 freq
chuk - 1 freq
choyse - 1 freq
coke - 34 freq
casey - 7 freq
choice' - 2 freq
chook - 4 freq
casks - 2 freq
chak - 5 freq
cookie - 15 freq
coco - 5 freq
'cash - 1 freq
cow's - 2 freq
ciggie - 4 freq
cagey - 4 freq
cozy - 4 freq
cosie - 22 freq
checky - 3 freq
'check - 6 freq
'cook - 1 freq
casie - 3 freq
chiky - 3 freq
chiks - 17 freq
cheege - 1 freq
choosie - 1 freq
'cocky' - 2 freq
cheus - 6 freq
caase - 29 freq
cox - 5 freq
chyach - 1 freq
cozzie - 3 freq
chazza - 6 freq
coo's - 16 freq
cokey - 1 freq
chick - 16 freq
cauge - 1 freq
coca - 1 freq
chuza - 2 freq
choc - 4 freq
cacao - 14 freq
choo's - 1 freq
caause - 2 freq
chews - 1 freq
cigs - 1 freq
caa's - 1 freq
cass - 14 freq
cooch - 7 freq
'chookie - 2 freq
chucky - 1 freq
coose - 8 freq
causay - 2 freq
checks - 12 freq
cask - 2 freq
'cheeky - 1 freq
cowie's - 2 freq
cowie''s - 1 freq
cox's - 1 freq
cak - 1 freq
chukkie - 1 freq
coks - 1 freq
cheke - 5 freq
cheuss - 2 freq
choss - 2 freq
chuggie - 2 freq
coax - 4 freq
cas - 4 freq
chough - 1 freq
cocoa - 3 freq
chic - 10 freq
cha's - 1 freq
chocs - 1 freq
chaas - 2 freq
cues - 3 freq
cheik - 1 freq
cecco - 18 freq
ciaes - 1 freq
cais - 1 freq
caskie - 4 freq
cowks - 1 freq
chock - 1 freq
caess - 3 freq
cows' - 1 freq
coax' - 1 freq
caa-caa - 1 freq
chouks - 5 freq
cheise - 1 freq
chouk - 1 freq
coq - 1 freq
cheesie - 1 freq
cookie' - 2 freq
chyaach - 1 freq
choosy - 1 freq
€˜cozie - 1 freq
€˜cacks - 1 freq
chasse - 2 freq
choiss - 2 freq
ckeck - 1 freq
chakk - 4 freq
chikks - 6 freq
cok - 1 freq
causie - 8 freq
cheis - 1 freq
cace - 1 freq
caig - 1 freq
cusa - 1 freq
€™casey - 1 freq
chows - 1 freq
cuckoo - 9 freq
causeway - 3 freq
€œcos - 1 freq
chuggy - 1 freq
chasie - 1 freq
cous - 1 freq
cac - 1 freq
cses - 2 freq
caese - 2 freq
chos - 1 freq
chugs - 1 freq
€œcuckoo - 4 freq
€˜case - 1 freq
€˜cock - 1 freq
€¦cos - 1 freq
€˜cause - 7 freq
€œcause - 2 freq
caius - 3 freq
€™cause - 2 freq
€œcheeky - 1 freq
€œcowk - 2 freq
chacks - 1 freq
chackie - 2 freq
chuko - 1 freq
chu-ko - 3 freq
chise - 2 freq
€˜cos - 4 freq
€™cheeky - 1 freq
cag - 1 freq
cheeeeese - 1 freq
€˜cus - 1 freq
ckykzz - 1 freq
chaz - 19 freq
cooÂ’s - 3 freq
czazo - 1 freq
casa - 1 freq
ciesx - 1 freq
chegs - 3 freq
ckhhyys - 1 freq
csez - 1 freq
cossye - 1 freq
cossy - 1 freq
coxÂ’ - 1 freq
cheezy - 1 freq
cxoj - 1 freq
cos' - 1 freq
ckwg - 1 freq
cashe - 1 freq
chacha - 1 freq
chase' - 1 freq
cheuch - 2 freq
cec - 1 freq
chek - 5 freq
'cheugh' - 1 freq
cig - 1 freq
ccxic - 1 freq
cheeko' - 2 freq
chck - 1 freq
ciky - 1 freq
cookoo - 1 freq
cuhj - 1 freq
chag - 1 freq
casc - 1 freq
cach - 1 freq
'cach - 1 freq
MetaPhone code - KK
quick - 371 freq
keek - 199 freq
gowk - 47 freq
cake - 165 freq
cook - 201 freq
cocky - 12 freq
cog - 3 freq
kick - 120 freq
cock - 79 freq
kg - 4 freq
guckie - 1 freq
guig - 1 freq
gaig - 1 freq
cack - 4 freq
gawkie - 1 freq
'quick - 5 freq
kowk - 1 freq
coggie - 5 freq
gq - 3 freq
cowk - 4 freq
cuik - 4 freq
c-cou - 1 freq
queek - 22 freq
gaga - 1 freq
cauk - 5 freq
cc - 19 freq
coke - 34 freq
quickie - 1 freq
cookie - 15 freq
quake - 7 freq
ga-ga - 1 freq
coco - 5 freq
'cook - 1 freq
quack - 7 freq
gawk - 3 freq
gok - 1 freq
'cocky' - 2 freq
keekiy - 1 freq
kock - 1 freq
cokey - 1 freq
goog - 1 freq
coca - 1 freq
keik - 13 freq
quïck - 6 freq
gaeg - 1 freq
cacao - 14 freq
gaawk - 1 freq
guik - 1 freq
cak - 1 freq
quik - 5 freq
kik - 1 freq
cocoa - 3 freq
koko - 1 freq
queeck - 1 freq
kek - 1 freq
quaak - 3 freq
caa-caa - 1 freq
keekie - 1 freq
guga - 1 freq
coq - 1 freq
cookie' - 2 freq
gog - 1 freq
kiek - 1 freq
€œquick - 3 freq
ckeck - 1 freq
cok - 1 freq
caig - 1 freq
kake - 1 freq
quyk - 1 freq
cuckoo - 9 freq
cac - 1 freq
€˜quick - 1 freq
€œcuckoo - 4 freq
€˜cock - 1 freq
€œcowk - 2 freq
gawky - 1 freq
keg - 2 freq
quäck - 1 freq
cag - 1 freq
gowkie - 1 freq
€œgowk - 1 freq
gag - 3 freq
kc - 7 freq
kyg - 1 freq
gk - 2 freq
cqah - 1 freq
qwqa - 1 freq
cg - 3 freq
qku - 1 freq
ygk - 1 freq
yqc - 1 freq
kuq - 1 freq
qg - 1 freq
ygq - 1 freq
gca - 1 freq
yqg - 1 freq
gki - 1 freq
qcu - 1 freq
cqu - 1 freq
wkkc - 1 freq
gkk - 1 freq
kca - 1 freq
kic - 1 freq
gco - 1 freq
qc - 2 freq
hcg - 1 freq
kqu - 1 freq
ckwg - 1 freq
gc - 3 freq
hqec - 1 freq
qgu - 1 freq
'keek' - 3 freq
'gowk' - 1 freq
kqe - 1 freq
ggc - 1 freq
ykg - 1 freq
gquo - 1 freq
kwc - 1 freq
gcuo - 1 freq
qk - 1 freq
cqe - 1 freq
gqe - 1 freq
cookoo - 1 freq
wcq - 1 freq
ygka - 1 freq
gkh - 1 freq
ckkg - 1 freq
gqo - 1 freq
qiki - 1 freq
hcc - 1 freq
CAKE
Time to execute Levenshtein function - 0.176399 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.329798 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027132 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036358 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000812 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.