A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cim (0) - 5 freq
gim (1) - 1 freq
fim (1) - 4 freq
cin (1) - 306 freq
cam (1) - 2629 freq
wim (1) - 2 freq
caim (1) - 58 freq
cid (1) - 25 freq
cis (1) - 26 freq
cm (1) - 13 freq
im (1) - 370 freq
cit (1) - 4 freq
kim (1) - 10 freq
dim (1) - 51 freq
aim (1) - 51 freq
jim (1) - 235 freq
clim (1) - 33 freq
cims (1) - 1 freq
ci (1) - 4 freq
yim (1) - 1 freq
sim (1) - 43 freq
'im (1) - 5 freq
bim (1) - 2 freq
tim (1) - 47 freq
vim (1) - 1 freq
cim (0) - 5 freq
com (1) - 134 freq
cm (1) - 13 freq
caim (1) - 58 freq
cem (1) - 1 freq
cym (1) - 6 freq
cum (1) - 643 freq
cam (1) - 2629 freq
cbm (2) - 2 freq
ciu (2) - 1 freq
cif (2) - 4 freq
oim (2) - 14 freq
uim (2) - 1 freq
cih (2) - 2 freq
iim (2) - 2 freq
cig (2) - 1 freq
him (2) - 8459 freq
camo (2) - 1 freq
come (2) - 3162 freq
cmo (2) - 1 freq
como (2) - 1 freq
coum (2) - 2 freq
coma (2) - 13 freq
acum (2) - 1 freq
came (2) - 899 freq
SoundEx code - C500
cam - 2629 freq
can - 4783 freq
come - 3162 freq
canna - 1230 freq
cawin - 62 freq
cannae - 1702 freq
canny - 214 freq
'can - 37 freq
chum - 21 freq
cane - 22 freq
caain - 50 freq
cannie - 93 freq
cum - 643 freq
came - 899 freq
chain - 62 freq
chin - 151 freq
chimney - 22 freq
china - 53 freq
chyne - 10 freq
chaain - 7 freq
cin - 306 freq
cannae-' - 1 freq
chein - 2 freq
'cannae - 11 freq
chewin - 16 freq
'come - 73 freq
coun - 1 freq
chime - 6 freq
cuin - 8 freq
cheenae - 3 freq
'cuin - 2 freq
caum - 36 freq
cun- - 1 freq
con- - 3 freq
can- - 4 freq
caunny - 2 freq
caa'n - 1 freq
'c'm - 1 freq
'cawin - 1 freq
con - 21 freq
ca'in - 8 freq
conn - 14 freq
cunyie - 5 freq
canmnae - 1 freq
cen - 13 freq
chowin - 16 freq
chawin - 42 freq
com - 134 freq
chaine - 6 freq
coin - 46 freq
'cum - 4 freq
canne - 7 freq
caim - 58 freq
cheyn - 4 freq
cheyne - 4 freq
choon - 6 freq
chawen - 2 freq
cone - 16 freq
cn - 2 freq
caam - 11 freq
caaen - 4 freq
cna - 2 freq
caawen - 1 freq
cheeny - 4 freq
cann'a - 2 freq
caine - 1 freq
chan - 2 freq
caunie - 1 freq
cam' - 5 freq
cheena - 5 freq
caun - 5 freq
'caum - 1 freq
coma - 13 freq
chowan - 2 freq
caa'in - 5 freq
chaaen - 1 freq
chanee - 1 freq
cowan - 7 freq
chon - 3 freq
'cin - 1 freq
cyein - 1 freq
cain - 24 freq
cohen - 1 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
ciné - 2 freq
ca'n - 6 freq
caen - 4 freq
caan - 2 freq
co'm - 1 freq
chine - 2 freq
canae - 28 freq
come' - 2 freq
cammy - 10 freq
cöshin - 2 freq
cooin - 6 freq
cheinie - 1 freq
canno - 26 freq
cunyo - 2 freq
canno- - 1 freq
chemo - 2 freq
chem - 1 freq
chimhey - 1 freq
chewan - 1 freq
cunna - 1 freq
chewnie - 1 freq
cowin - 1 freq
coney - 2 freq
chean - 8 freq
cheen - 2 freq
cown - 2 freq
chen - 4 freq
cama - 1 freq
cameo - 4 freq
cheyenne - 1 freq
caa-an - 1 freq
camm - 1 freq
cana - 29 freq
cim - 5 freq
cannae' - 1 freq
ca'ain - 1 freq
come - 8 freq
cm - 1 freq
cawn - 1 freq
chainey - 1 freq
cum - 4 freq
coum - 2 freq
cann - 1 freq
can' - 1 freq
come - 32 freq
chinee - 1 freq
cheenie - 1 freq
cin - 1 freq
cheemo - 2 freq
cannae - 2 freq
''cannae - 1 freq
cun - 18 freq
can - 22 freq
cany - 5 freq
comm - 2 freq
chën - 1 freq
chewin' - 1 freq
cam - 1 freq
canni - 6 freq
caani - 1 freq
ceann - 1 freq
can - 1 freq
china - 1 freq
como - 1 freq
comme - 3 freq
cem - 1 freq
chowin' - 1 freq
chane - 1 freq
cunno - 1 freq
cheann - 1 freq
can - 1 freq
cjem - 1 freq
chawi'n - 1 freq
connie - 2 freq
cyn - 1 freq
cmmh - 1 freq
cm - 13 freq
chinny - 1 freq
cannaee - 1 freq
cani - 2 freq
connae - 1 freq
'choon' - 1 freq
cnn - 1 freq
cooney - 2 freq
cmo - 1 freq
cammay - 1 freq
cheyney - 1 freq
canio - 1 freq
ca’in - 1 freq
ccmw - 1 freq
ccwm - 3 freq
‘cone’ - 2 freq
conway - 1 freq
chummie - 1 freq
csum - 1 freq
cwyouim - 1 freq
'comma - 1 freq
comma - 1 freq
cauin - 1 freq
chiyan - 1 freq
chun - 1 freq
cannea - 10 freq
cym - 6 freq
ceen - 2 freq
co'en - 2 freq
cùm - 1 freq
camo - 1 freq
MetaPhone code - SM
some - 4208 freq
same - 1730 freq
seem - 404 freq
sum - 416 freq
sma - 268 freq
smaw - 49 freq
smue - 2 freq
sammy - 44 freq
'sammy - 1 freq
smaa - 123 freq
sea-maw - 2 freq
somme - 12 freq
semi - 22 freq
sam - 363 freq
syme - 22 freq
soum - 10 freq
soum-' - 1 freq
sim - 43 freq
soom - 10 freq
same' - 1 freq
zombie - 12 freq
zomb - 1 freq
'syme - 2 freq
'some - 10 freq
zoom - 19 freq
sma' - 15 freq
saim - 21 freq
smey' - 1 freq
'same - 4 freq
some' - 7 freq
'ssssssaaaaamy - 1 freq
saam - 2 freq
sume - 2 freq
symbo - 1 freq
simba - 1 freq
smmaa - 1 freq
som' - 2 freq
sum' - 1 freq
so'm - 1 freq
some- - 2 freq
sei'm - 2 freq
smoo - 1 freq
som - 2 freq
samba - 1 freq
sm - 4 freq
seam - 6 freq
sam' - 1 freq
zumba - 2 freq
saime - 14 freq
suomi - 2 freq
sámi - 1 freq
sie-maw - 1 freq
xoom - 1 freq
cim - 5 freq
somb - 1 freq
summ - 1 freq
some - 5 freq
soume - 1 freq
seme - 1 freq
sum - 1 freq
some - 11 freq
same - 1 freq
some - 1 freq
summa - 1 freq
same - 1 freq
saem - 4 freq
sommie - 1 freq
sæm - 4 freq
smou - 3 freq
smeu - 2 freq
cem - 1 freq
sime - 1 freq
xm - 1 freq
simmy - 1 freq
xoem - 1 freq
smo - 1 freq
hhsm - 1 freq
ssm - 1 freq
somewhy - 1 freq
smh - 3 freq
zma - 1 freq
zm - 1 freq
xxm - 1 freq
cym - 6 freq
sammi - 1 freq
sem - 1 freq
CIM
Time to execute Levenshtein function - 0.504159 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.877512 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027785 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.085332 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001335 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.