A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cim (0) - 5 freq
nim (1) - 9 freq
vim (1) - 1 freq
cfm (1) - 1 freq
'im (1) - 5 freq
gim (1) - 1 freq
cis (1) - 26 freq
cif (1) - 4 freq
cum (1) - 641 freq
iim (1) - 2 freq
cig (1) - 1 freq
cm (1) - 13 freq
dim (1) - 50 freq
cam (1) - 2618 freq
wim (1) - 2 freq
cih (1) - 2 freq
jim (1) - 234 freq
com (1) - 131 freq
oim (1) - 14 freq
clim (1) - 33 freq
rim (1) - 14 freq
ciu (1) - 1 freq
uim (1) - 1 freq
cims (1) - 1 freq
aim (1) - 50 freq
cim (0) - 5 freq
cm (1) - 13 freq
cam (1) - 2618 freq
caim (1) - 57 freq
com (1) - 131 freq
cum (1) - 641 freq
cem (1) - 1 freq
cym (1) - 6 freq
cit (2) - 4 freq
ci (2) - 4 freq
bim (2) - 2 freq
cin (2) - 289 freq
him (2) - 8386 freq
cbm (2) - 2 freq
cia (2) - 3 freq
kim (2) - 10 freq
mim (2) - 4 freq
como (2) - 1 freq
coma (2) - 13 freq
cama (2) - 1 freq
coum (2) - 2 freq
come (2) - 3111 freq
caam (2) - 11 freq
cmo (2) - 1 freq
acum (2) - 1 freq
SoundEx code - C500
cam - 2618 freq
can - 4706 freq
come - 3111 freq
canna - 1221 freq
cawin - 60 freq
cannae - 1640 freq
canny - 212 freq
'can - 36 freq
chum - 21 freq
cane - 22 freq
caain - 50 freq
cannie - 92 freq
cum - 641 freq
came - 890 freq
chain - 61 freq
chin - 149 freq
chimney - 22 freq
china - 53 freq
chyne - 10 freq
chaain - 7 freq
cin - 289 freq
cannae-' - 1 freq
chein - 2 freq
'cannae - 11 freq
chewin - 15 freq
'come - 73 freq
coun - 1 freq
chime - 6 freq
cuin - 8 freq
cheenae - 3 freq
'cuin - 2 freq
caum - 36 freq
cun- - 1 freq
con- - 3 freq
can- - 4 freq
caunny - 2 freq
caa'n - 1 freq
'c'm - 1 freq
'cawin - 1 freq
con - 21 freq
ca'in - 8 freq
conn - 14 freq
cunyie - 5 freq
canmnae - 1 freq
cen - 12 freq
chowin - 16 freq
chawin - 42 freq
com - 131 freq
chaine - 6 freq
coin - 46 freq
'cum - 4 freq
canne - 7 freq
caim - 57 freq
cheyn - 4 freq
cheyne - 4 freq
choon - 6 freq
chawen - 2 freq
cone - 16 freq
cn - 2 freq
caam - 11 freq
caaen - 4 freq
cna - 2 freq
caawen - 1 freq
cheeny - 4 freq
cann'a - 2 freq
caine - 1 freq
cheena - 5 freq
caun - 5 freq
'caum - 1 freq
coma - 13 freq
chowan - 2 freq
caa'in - 5 freq
chaaen - 1 freq
chanee - 1 freq
cowan - 7 freq
chon - 3 freq
'cin - 1 freq
cyein - 1 freq
cain - 24 freq
cohen - 1 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
ciné - 2 freq
ca'n - 6 freq
caen - 4 freq
caan - 2 freq
co'm - 1 freq
chine - 2 freq
canae - 28 freq
come' - 2 freq
cammy - 10 freq
cöshin - 2 freq
cam' - 4 freq
cooin - 6 freq
cheinie - 1 freq
canno - 26 freq
cunyo - 2 freq
canno- - 1 freq
chemo - 2 freq
chem - 1 freq
chimhey - 1 freq
chewan - 1 freq
cunna - 1 freq
chewnie - 1 freq
cowin - 1 freq
coney - 2 freq
chean - 8 freq
cheen - 2 freq
cown - 2 freq
chen - 4 freq
cama - 1 freq
cameo - 4 freq
cheyenne - 1 freq
caa-an - 1 freq
camm - 1 freq
cana - 29 freq
cim - 5 freq
cannae' - 1 freq
ca'ain - 1 freq
come - 8 freq
cm - 1 freq
cawn - 1 freq
chainey - 1 freq
cum - 4 freq
coum - 2 freq
cann - 1 freq
can' - 1 freq
come - 32 freq
chinee - 1 freq
cheenie - 1 freq
cheemo - 2 freq
cannae - 2 freq
''cannae - 1 freq
cun - 18 freq
can - 22 freq
cany - 5 freq
comm - 2 freq
chën - 1 freq
chewin' - 1 freq
cam - 1 freq
canni - 6 freq
caani - 1 freq
ceann - 1 freq
can - 1 freq
china - 1 freq
como - 1 freq
comme - 3 freq
cem - 1 freq
chowin' - 1 freq
chane - 1 freq
cunno - 1 freq
cheann - 1 freq
can - 1 freq
cjem - 1 freq
chawi'n - 1 freq
connie - 2 freq
cyn - 1 freq
cmmh - 1 freq
cm - 13 freq
chinny - 1 freq
cannaee - 1 freq
cani - 2 freq
connae - 1 freq
'choon' - 1 freq
cnn - 1 freq
cooney - 2 freq
cmo - 1 freq
cammay - 1 freq
cheyney - 1 freq
canio - 1 freq
ca’in - 1 freq
ccmw - 1 freq
ccwm - 3 freq
‘cone’ - 2 freq
conway - 1 freq
chummie - 1 freq
csum - 1 freq
cwyouim - 1 freq
'comma - 1 freq
comma - 1 freq
cauin - 1 freq
chiyan - 1 freq
chun - 1 freq
cannea - 10 freq
chan - 1 freq
cym - 6 freq
ceen - 2 freq
co'en - 2 freq
cùm - 1 freq
camo - 1 freq
MetaPhone code - SM
some - 4095 freq
same - 1698 freq
seem - 392 freq
sum - 416 freq
sma - 266 freq
smaw - 49 freq
smue - 2 freq
sammy - 44 freq
'sammy - 1 freq
smaa - 123 freq
sea-maw - 2 freq
somme - 12 freq
semi - 20 freq
sam - 364 freq
syme - 22 freq
soum - 10 freq
soum-' - 1 freq
sim - 43 freq
soom - 10 freq
same' - 1 freq
zombie - 11 freq
zomb - 1 freq
'syme - 2 freq
'some - 11 freq
zoom - 19 freq
sma' - 13 freq
saim - 21 freq
smey' - 1 freq
some' - 7 freq
'ssssssaaaaamy - 1 freq
saam - 2 freq
sume - 2 freq
symbo - 1 freq
simba - 1 freq
smmaa - 1 freq
som' - 2 freq
sum' - 1 freq
'same - 3 freq
so'm - 1 freq
some- - 2 freq
sei'm - 2 freq
smoo - 1 freq
som - 2 freq
samba - 1 freq
sm - 4 freq
seam - 6 freq
sam' - 1 freq
zumba - 2 freq
saime - 14 freq
suomi - 2 freq
sámi - 1 freq
sie-maw - 1 freq
xoom - 1 freq
cim - 5 freq
somb - 1 freq
summ - 1 freq
some - 5 freq
soume - 1 freq
seme - 1 freq
sum - 1 freq
some - 11 freq
same - 1 freq
some - 1 freq
summa - 1 freq
same - 1 freq
saem - 4 freq
sommie - 1 freq
sæm - 4 freq
smou - 3 freq
smeu - 2 freq
cem - 1 freq
sime - 1 freq
xm - 1 freq
simmy - 1 freq
xoem - 1 freq
smo - 1 freq
hhsm - 1 freq
ssm - 1 freq
somewhy - 1 freq
smh - 3 freq
zma - 1 freq
zm - 1 freq
xxm - 1 freq
cym - 6 freq
sammi - 1 freq
sem - 1 freq
CIM
Time to execute Levenshtein function - 0.178499 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.391560 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027403 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036539 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000839 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.