A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cec in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cec (0) - 1 freq
cem (1) - 1 freq
rec (1) - 1 freq
ec (1) - 18 freq
ce (1) - 7 freq
mec (1) - 1 freq
cer (1) - 1 freq
eec (1) - 2 freq
cet (1) - 1 freq
wec (1) - 1 freq
cee (1) - 2 freq
cep (1) - 1 freq
cea (1) - 1 freq
cel (1) - 1 freq
clc (1) - 1 freq
nec (1) - 2 freq
cac (1) - 1 freq
dec (1) - 9 freq
lec (1) - 1 freq
cen (1) - 12 freq
cei (1) - 1 freq
cc (1) - 19 freq
cvc (1) - 2 freq
sec (1) - 10 freq
ceo (1) - 3 freq
cec (0) - 1 freq
cac (1) - 1 freq
cc (1) - 19 freq
cmc (2) - 1 freq
yec (2) - 1 freq
gec (2) - 1 freq
cvc (2) - 2 freq
cei (2) - 1 freq
cem (2) - 1 freq
ceo (2) - 3 freq
coco (2) - 5 freq
icic (2) - 1 freq
aecc (2) - 5 freq
ccea (2) - 1 freq
ecc (2) - 1 freq
coca (2) - 1 freq
cace (2) - 1 freq
cce (2) - 1 freq
cen (2) - 12 freq
sec (2) - 10 freq
eec (2) - 2 freq
lec (2) - 1 freq
wec (2) - 1 freq
cer (2) - 1 freq
mec (2) - 1 freq
SoundEx code - C200
check - 202 freq
cosy - 55 freq
chucks - 4 freq
cheek - 137 freq
cheeks - 75 freq
case - 440 freq
cake - 165 freq
cook - 201 freq
cocky - 12 freq
cog - 3 freq
cause - 1187 freq
choosy-hoi - 1 freq
choke - 22 freq
cheeky - 91 freq
chuck - 29 freq
choice - 152 freq
cage - 58 freq
cease - 10 freq
chuise - 15 freq
'cause - 16 freq
cosh - 12 freq
cuzco - 1 freq
coos - 71 freq
chug - 2 freq
chaws - 1 freq
cash - 84 freq
couch - 67 freq
cos - 449 freq
chose - 60 freq
chessie - 1 freq
cock - 79 freq
cogs - 2 freq
causey - 25 freq
cheese - 138 freq
cheque - 20 freq
caas - 44 freq
chaise - 3 freq
chase - 48 freq
chis - 2 freq
chicks - 8 freq
cuz - 8 freq
cough - 59 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
coach - 51 freq
choose - 79 freq
chowk - 6 freq
cack - 4 freq
chowks - 9 freq
cushie - 15 freq
cou's - 1 freq
chuckie - 10 freq
cushy - 3 freq
chik - 11 freq
cissie - 7 freq
caws - 36 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
cuiks - 2 freq
coggie - 5 freq
czech - 7 freq
ciseau - 1 freq
cowk - 4 freq
chyce - 51 freq
chez - 2 freq
cocks - 11 freq
chack - 8 freq
ca's - 4 freq
cuik - 4 freq
cheesy - 15 freq
chookie - 66 freq
chakks - 1 freq
chico - 5 freq
czesc - 1 freq
chikk - 1 freq
cooks - 5 freq
chese - 1 freq
cogie - 6 freq
chaos - 24 freq
chooky - 6 freq
chuse - 9 freq
chess - 4 freq
cheesey - 3 freq
cauk - 5 freq
cock's - 3 freq
chooks - 18 freq
chice - 3 freq
couse - 23 freq
cus - 130 freq
cis - 26 freq
caise - 6 freq
coasee - 1 freq
ciss - 1 freq
chooch - 1 freq
chuk - 1 freq
choyse - 1 freq
coke - 34 freq
casey - 7 freq
choice' - 2 freq
chook - 4 freq
casks - 2 freq
chak - 5 freq
cookie - 15 freq
coco - 5 freq
'cash - 1 freq
cow's - 2 freq
ciggie - 4 freq
cagey - 4 freq
cozy - 4 freq
cosie - 22 freq
checky - 3 freq
'check - 6 freq
'cook - 1 freq
casie - 3 freq
chiky - 3 freq
chiks - 17 freq
cheege - 1 freq
choosie - 1 freq
'cocky' - 2 freq
cheus - 6 freq
caase - 29 freq
cox - 5 freq
chyach - 1 freq
cozzie - 3 freq
chazza - 6 freq
coo's - 16 freq
cokey - 1 freq
chick - 16 freq
cauge - 1 freq
coca - 1 freq
chuza - 2 freq
choc - 4 freq
cacao - 14 freq
choo's - 1 freq
caause - 2 freq
chews - 1 freq
cigs - 1 freq
caa's - 1 freq
cass - 14 freq
cooch - 7 freq
'chookie - 2 freq
chucky - 1 freq
coose - 8 freq
causay - 2 freq
checks - 12 freq
cask - 2 freq
'cheeky - 1 freq
cowie's - 2 freq
cowie''s - 1 freq
cox's - 1 freq
cak - 1 freq
chukkie - 1 freq
coks - 1 freq
cheke - 5 freq
cheuss - 2 freq
choss - 2 freq
chuggie - 2 freq
coax - 4 freq
cas - 4 freq
chough - 1 freq
cocoa - 3 freq
chic - 10 freq
cha's - 1 freq
chocs - 1 freq
chaas - 2 freq
cues - 3 freq
cheik - 1 freq
cecco - 18 freq
ciaes - 1 freq
cais - 1 freq
caskie - 4 freq
cowks - 1 freq
chock - 1 freq
caess - 3 freq
cows' - 1 freq
coax' - 1 freq
caa-caa - 1 freq
chouks - 5 freq
cheise - 1 freq
chouk - 1 freq
coq - 1 freq
cheesie - 1 freq
cookie' - 2 freq
chyaach - 1 freq
choosy - 1 freq
€˜cozie - 1 freq
€˜cacks - 1 freq
chasse - 2 freq
choiss - 2 freq
ckeck - 1 freq
chakk - 4 freq
chikks - 6 freq
cok - 1 freq
causie - 8 freq
cheis - 1 freq
cace - 1 freq
caig - 1 freq
cusa - 1 freq
€™casey - 1 freq
chows - 1 freq
cuckoo - 9 freq
causeway - 3 freq
€œcos - 1 freq
chuggy - 1 freq
chasie - 1 freq
cous - 1 freq
cac - 1 freq
cses - 2 freq
caese - 2 freq
chos - 1 freq
chugs - 1 freq
€œcuckoo - 4 freq
€˜case - 1 freq
€˜cock - 1 freq
€¦cos - 1 freq
€˜cause - 7 freq
€œcause - 2 freq
caius - 3 freq
€™cause - 2 freq
€œcheeky - 1 freq
€œcowk - 2 freq
chacks - 1 freq
chackie - 2 freq
chuko - 1 freq
chu-ko - 3 freq
chise - 2 freq
€˜cos - 4 freq
€™cheeky - 1 freq
cag - 1 freq
cheeeeese - 1 freq
€˜cus - 1 freq
ckykzz - 1 freq
chaz - 19 freq
cooÂ’s - 3 freq
czazo - 1 freq
casa - 1 freq
ciesx - 1 freq
chegs - 3 freq
ckhhyys - 1 freq
csez - 1 freq
cossye - 1 freq
cossy - 1 freq
coxÂ’ - 1 freq
cheezy - 1 freq
cxoj - 1 freq
cos' - 1 freq
ckwg - 1 freq
cashe - 1 freq
chacha - 1 freq
chase' - 1 freq
cheuch - 2 freq
cec - 1 freq
chek - 5 freq
'cheugh' - 1 freq
cig - 1 freq
ccxic - 1 freq
cheeko' - 2 freq
chck - 1 freq
ciky - 1 freq
cookoo - 1 freq
cuhj - 1 freq
chag - 1 freq
casc - 1 freq
cach - 1 freq
'cach - 1 freq
MetaPhone code - SK
sky - 429 freq
seik - 44 freq
seek - 199 freq
sic - 1408 freq
sake - 241 freq
sec - 10 freq
sac - 14 freq
'sic - 3 freq
seeck - 23 freq
sick - 148 freq
saga - 12 freq
syke - 5 freq
seck - 16 freq
seick - 7 freq
souk - 14 freq
sock - 26 freq
sack - 43 freq
sik - 49 freq
soggy - 15 freq
sook - 63 freq
ski - 7 freq
sikk - 10 freq
sica - 2 freq
sca - 4 freq
zouk - 1 freq
soak - 15 freq
suck - 9 freq
sek - 4 freq
ciggie - 4 freq
soc - 3 freq
saig - 3 freq
siggy - 2 freq
'sky' - 1 freq
'sook - 1 freq
skie - 2 freq
seq - 1 freq
saek - 3 freq
seak - 2 freq
sikh - 8 freq
zig - 6 freq
zag - 7 freq
saiq - 2 freq
squ - 1 freq
sicka - 1 freq
sag - 4 freq
soeak - 1 freq
sg - 15 freq
'suki - 1 freq
seck' - 1 freq
sook' - 1 freq
seg - 1 freq
zac - 4 freq
'zac' - 1 freq
zak - 4 freq
zowk - 1 freq
sig- - 1 freq
sig - 3 freq
hysk - 1 freq
sooky - 2 freq
sookie - 1 freq
ssc - 7 freq
skeo - 1 freq
soack - 1 freq
suk - 2 freq
'soak' - 1 freq
sc - 14 freq
saick - 1 freq
seec - 1 freq
sco - 13 freq
sake' - 1 freq
sqe - 1 freq
sqa - 24 freq
€œsky - 2 freq
seke - 9 freq
sickie - 2 freq
sago - 1 freq
€œsco - 1 freq
skeh - 1 freq
sae-caa - 1 freq
€˜sake - 3 freq
€œsick - 1 freq
sic- - 1 freq
skew - 1 freq
€œsook - 1 freq
€˜sic - 1 freq
zc - 3 freq
syc - 1 freq
seek' - 1 freq
sk - 4 freq
hcycu - 1 freq
zg - 6 freq
zk - 3 freq
sq - 1 freq
zq - 6 freq
zeke - 1 freq
seggy - 2 freq
hzq - 1 freq
xwgo - 1 freq
hzqe - 1 freq
hsq - 1 freq
zzg - 1 freq
xc - 2 freq
sega - 1 freq
xk - 1 freq
sicwy - 1 freq
saggy - 1 freq
saag - 1 freq
swq - 1 freq
cec - 1 freq
xiq - 1 freq
hzc - 1 freq
xxxk - 1 freq
skaw - 1 freq
xq - 2 freq
cig - 1 freq
xuc - 1 freq
xqu - 2 freq
ciky - 1 freq
zzhq - 1 freq
zico - 1 freq
swk - 39 freq
xxg - 1 freq
swc - 1 freq
zoioigaa - 1 freq
wskea - 1 freq
yzek - 2 freq
zqe - 1 freq
xhg - 1 freq
CEC
Time to execute Levenshtein function - 0.189995 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.354879 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031059 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037518 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000887 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.