A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to csum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
csum (0) - 1 freq
crum (1) - 1 freq
chum (1) - 21 freq
cum (1) - 643 freq
coum (1) - 2 freq
caum (1) - 36 freq
sum (1) - 416 freq
wum (2) - 1 freq
erum (2) - 1 freq
'hum (2) - 2 freq
spm (2) - 1 freq
cscd (2) - 1 freq
colum (2) - 2 freq
caul (2) - 79 freq
crux (2) - 5 freq
nfum (2) - 2 freq
tsgm (2) - 1 freq
clam (2) - 10 freq
crums (2) - 1 freq
sum (2) - 1 freq
cus (2) - 130 freq
cvu (2) - 1 freq
sua (2) - 1 freq
scrum (2) - 2 freq
chu (2) - 8 freq
csum (0) - 1 freq
caum (2) - 36 freq
cosam (2) - 3 freq
coum (2) - 2 freq
sum (2) - 416 freq
chum (2) - 21 freq
cum (2) - 643 freq
crum (2) - 1 freq
ssm (3) - 1 freq
csr (3) - 1 freq
sume (3) - 2 freq
cem (3) - 1 freq
csp (3) - 2 freq
calm (3) - 133 freq
chem (3) - 1 freq
csd (3) - 11 freq
cram (3) - 2 freq
cfm (3) - 1 freq
caulm (3) - 5 freq
sim (3) - 43 freq
cses (3) - 2 freq
cvem (3) - 1 freq
comm (3) - 2 freq
caus (3) - 8 freq
msm (3) - 29 freq
SoundEx code - C500
cam - 2629 freq
can - 4783 freq
come - 3162 freq
canna - 1230 freq
cawin - 62 freq
cannae - 1702 freq
canny - 214 freq
'can - 37 freq
chum - 21 freq
cane - 22 freq
caain - 50 freq
cannie - 93 freq
cum - 643 freq
came - 899 freq
chain - 62 freq
chin - 151 freq
chimney - 22 freq
china - 53 freq
chyne - 10 freq
chaain - 7 freq
cin - 306 freq
cannae-' - 1 freq
chein - 2 freq
'cannae - 11 freq
chewin - 16 freq
'come - 73 freq
coun - 1 freq
chime - 6 freq
cuin - 8 freq
cheenae - 3 freq
'cuin - 2 freq
caum - 36 freq
cun- - 1 freq
con- - 3 freq
can- - 4 freq
caunny - 2 freq
caa'n - 1 freq
'c'm - 1 freq
'cawin - 1 freq
con - 21 freq
ca'in - 8 freq
conn - 14 freq
cunyie - 5 freq
canmnae - 1 freq
cen - 13 freq
chowin - 16 freq
chawin - 42 freq
com - 134 freq
chaine - 6 freq
coin - 46 freq
'cum - 4 freq
canne - 7 freq
caim - 58 freq
cheyn - 4 freq
cheyne - 4 freq
choon - 6 freq
chawen - 2 freq
cone - 16 freq
cn - 2 freq
caam - 11 freq
caaen - 4 freq
cna - 2 freq
caawen - 1 freq
cheeny - 4 freq
cann'a - 2 freq
caine - 1 freq
chan - 2 freq
caunie - 1 freq
cam' - 5 freq
cheena - 5 freq
caun - 5 freq
'caum - 1 freq
coma - 13 freq
chowan - 2 freq
caa'in - 5 freq
chaaen - 1 freq
chanee - 1 freq
cowan - 7 freq
chon - 3 freq
'cin - 1 freq
cyein - 1 freq
cain - 24 freq
cohen - 1 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
ciné - 2 freq
ca'n - 6 freq
caen - 4 freq
caan - 2 freq
co'm - 1 freq
chine - 2 freq
canae - 28 freq
come' - 2 freq
cammy - 10 freq
cöshin - 2 freq
cooin - 6 freq
cheinie - 1 freq
canno - 26 freq
cunyo - 2 freq
canno- - 1 freq
chemo - 2 freq
chem - 1 freq
chimhey - 1 freq
chewan - 1 freq
cunna - 1 freq
chewnie - 1 freq
cowin - 1 freq
coney - 2 freq
chean - 8 freq
cheen - 2 freq
cown - 2 freq
chen - 4 freq
cama - 1 freq
cameo - 4 freq
cheyenne - 1 freq
caa-an - 1 freq
camm - 1 freq
cana - 29 freq
cim - 5 freq
cannae' - 1 freq
ca'ain - 1 freq
come - 8 freq
cm - 1 freq
cawn - 1 freq
chainey - 1 freq
cum - 4 freq
coum - 2 freq
cann - 1 freq
can' - 1 freq
come - 32 freq
chinee - 1 freq
cheenie - 1 freq
cin - 1 freq
cheemo - 2 freq
cannae - 2 freq
''cannae - 1 freq
cun - 18 freq
can - 22 freq
cany - 5 freq
comm - 2 freq
chën - 1 freq
chewin' - 1 freq
cam - 1 freq
canni - 6 freq
caani - 1 freq
ceann - 1 freq
can - 1 freq
china - 1 freq
como - 1 freq
comme - 3 freq
cem - 1 freq
chowin' - 1 freq
chane - 1 freq
cunno - 1 freq
cheann - 1 freq
can - 1 freq
cjem - 1 freq
chawi'n - 1 freq
connie - 2 freq
cyn - 1 freq
cmmh - 1 freq
cm - 13 freq
chinny - 1 freq
cannaee - 1 freq
cani - 2 freq
connae - 1 freq
'choon' - 1 freq
cnn - 1 freq
cooney - 2 freq
cmo - 1 freq
cammay - 1 freq
cheyney - 1 freq
canio - 1 freq
ca’in - 1 freq
ccmw - 1 freq
ccwm - 3 freq
‘cone’ - 2 freq
conway - 1 freq
chummie - 1 freq
csum - 1 freq
cwyouim - 1 freq
'comma - 1 freq
comma - 1 freq
cauin - 1 freq
chiyan - 1 freq
chun - 1 freq
cannea - 10 freq
cym - 6 freq
ceen - 2 freq
co'en - 2 freq
cùm - 1 freq
camo - 1 freq
MetaPhone code - KSM
gzemm - 1 freq
cosam - 3 freq
khsmi - 1 freq
csmb - 1 freq
csum - 1 freq
qasiiimh - 2 freq
CSUM
Time to execute Levenshtein function - 0.218028 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.367062 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030988 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038990 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001208 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.