A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to csi in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
csi (0) - 3 freq
csa (1) - 1 freq
chi (1) - 8 freq
cxi (1) - 1 freq
csp (1) - 2 freq
ci (1) - 4 freq
dsi (1) - 1 freq
cei (1) - 1 freq
csw (1) - 2 freq
cs (1) - 5 freq
cgi (1) - 3 freq
si (1) - 43 freq
csd (1) - 11 freq
ysi (1) - 1 freq
csr (1) - 1 freq
coi (1) - 1 freq
hsi (1) - 2 freq
asti (2) - 1 freq
caa (2) - 353 freq
msp (2) - 53 freq
cup (2) - 316 freq
cpg (2) - 20 freq
fsl (2) - 1 freq
cses (2) - 2 freq
sij (2) - 2 freq
csi (0) - 3 freq
cs (1) - 5 freq
csa (1) - 1 freq
cosy (2) - 57 freq
casa (2) - 1 freq
hsi (2) - 2 freq
casie (2) - 3 freq
cosie (2) - 23 freq
case (2) - 447 freq
cusa (2) - 1 freq
cis (2) - 26 freq
cos (2) - 456 freq
cais (2) - 1 freq
cus (2) - 130 freq
coi (2) - 1 freq
cas (2) - 4 freq
cgi (2) - 3 freq
si (2) - 43 freq
csp (2) - 2 freq
cxi (2) - 1 freq
csw (2) - 2 freq
ci (2) - 4 freq
dsi (2) - 1 freq
cei (2) - 1 freq
csr (2) - 1 freq
SoundEx code - C000
caa - 353 freq
chow - 12 freq
caw - 192 freq
ca - 194 freq
chaa - 7 freq
chaw - 24 freq
cow - 41 freq
chew - 14 freq
che - 8 freq
'ca - 2 freq
c - 465 freq
chou - 4 freq
'caw - 7 freq
coo-ee - 1 freq
co - 5195 freq
ce - 7 freq
coo - 190 freq
chey - 1 freq
cue - 10 freq
chihuahua - 4 freq
chi - 8 freq
chae - 7 freq
cou - 8 freq
c-cou - 1 freq
coy - 8 freq
coo' - 1 freq
cow' - 2 freq
chewy - 5 freq
choo - 3 freq
cih - 2 freq
chaw'y - 1 freq
cc - 19 freq
ca' - 24 freq
cau - 3 freq
csi - 3 freq
ceo - 3 freq
cj - 8 freq
'caa - 1 freq
cha - 19 freq
cae - 7 freq
ckie - 1 freq
ck - 14 freq
chihuaha - 1 freq
'cawa - 1 freq
«che - 1 freq
'co' - 1 freq
'c' - 1 freq
cu - 7 freq
cösh - 1 freq
ch - 38 freq
-c - 2 freq
'ch' - 2 freq
chowe - 4 freq
cia - 3 freq
cgi - 3 freq
c'wa - 6 freq
ccea - 1 freq
cce - 1 freq
cs - 5 freq
ca'a - 2 freq
c'é - 1 freq
ci - 4 freq
chu - 8 freq
chao-i - 1 freq
ch'u - 2 freq
chieh - 2 freq
chu-i - 3 freq
c's - 1 freq
°c - 2 freq
€˜caa - 2 freq
€œcow - 1 freq
cowe - 1 freq
caie - 2 freq
coe - 1 freq
€˜ch - 8 freq
€œcoh- - 1 freq
coia - 1 freq
ºc - 3 freq
€œc - 6 freq
€˜c - 5 freq
choie - 1 freq
€“c - 2 freq
cw - 6 freq
€œca - 1 freq
chü-i - 5 freq
€¦coo - 1 freq
chiao - 2 freq
chao - 2 freq
€œcoo - 2 freq
chiu - 1 freq
€˜-ch - 1 freq
chai - 1 freq
€œcaa - 1 freq
czy - 1 freq
€œch - 5 freq
€™c - 2 freq
cah - 1 freq
cx - 5 freq
czj - 1 freq
cz - 5 freq
cqah - 1 freq
cg - 3 freq
chy - 1 freq
“cei - 1 freq
cei - 1 freq
‘c’ - 1 freq
cqu - 1 freq
chh - 1 freq
chowie - 2 freq
cjq - 1 freq
caÂ’ - 1 freq
czxy - 1 freq
“ce - 1 freq
cxg - 1 freq
coi - 1 freq
cy - 4 freq
cxxg - 1 freq
ccyh - 1 freq
cxi - 1 freq
caiw - 1 freq
csw - 2 freq
czw - 1 freq
cxy - 1 freq
cqe - 1 freq
cghee - 1 freq
cjh - 1 freq
cccg - 1 freq
csco - 1 freq
cea - 1 freq
csihi - 1 freq
ciu - 1 freq
c- - 2 freq
caai - 1 freq
csa - 1 freq
cgsa - 1 freq
ckkg - 1 freq
cee - 2 freq
MetaPhone code - KS
gauze - 7 freq
cosy - 57 freq
goes - 331 freq
case - 447 freq
gaze - 44 freq
keys - 63 freq
kis - 142 freq
cause - 1186 freq
guess - 148 freq
goose - 28 freq
kiss - 123 freq
gausie - 3 freq
'cause - 18 freq
gas - 88 freq
queues - 9 freq
coos - 74 freq
cos - 456 freq
quiz - 19 freq
guys - 472 freq
guy's - 13 freq
causey - 25 freq
'quiz - 1 freq
caas - 44 freq
cuz - 8 freq
coz - 44 freq
'cos - 7 freq
cows - 5 freq
gaes - 173 freq
cou's - 1 freq
hkes - 1 freq
caws - 36 freq
'guess' - 1 freq
caus - 8 freq
caz - 2 freq
cassie - 10 freq
guse - 1 freq
quasi - 1 freq
ca's - 5 freq
gous - 1 freq
guys' - 1 freq
queue's - 1 freq
gus - 21 freq
queys - 1 freq
quoys - 24 freq
couse - 23 freq
cus - 130 freq
caise - 6 freq
coasee - 1 freq
gce - 4 freq
gass - 4 freq
guiy's - 1 freq
k's - 1 freq
casey - 7 freq
guise - 17 freq
gays - 2 freq
key's - 1 freq
cosie - 23 freq
causie - 9 freq
csi - 3 freq
cow's - 2 freq
cozy - 4 freq
casie - 3 freq
gsoh - 2 freq
'gsoh' - 1 freq
gsoh' - 1 freq
caase - 29 freq
cozzie - 3 freq
coo's - 16 freq
gause - 4 freq
goss - 1 freq
kïss - 3 freq
caause - 2 freq
gawsie - 4 freq
gaius - 1 freq
caa's - 1 freq
cass - 14 freq
keess - 10 freq
kaiys - 1 freq
coose - 8 freq
gös - 2 freq
causay - 2 freq
kay's - 2 freq
gos - 1 freq
kiz - 18 freq
kess - 5 freq
cas - 4 freq
queasy - 2 freq
ga's - 1 freq
gauzy - 1 freq
ccea - 1 freq
cce - 1 freq
cues - 3 freq
cs - 5 freq
kaas - 2 freq
cais - 1 freq
caess - 3 freq
cows' - 1 freq
gøs - 1 freq
kissie - 1 freq
c's - 1 freq
€˜cozie - 1 freq
gaws - 1 freq
queazy - 1 freq
€œguiss - 1 freq
gauss - 2 freq
cace - 1 freq
kace - 2 freq
kies - 1 freq
cusa - 1 freq
€™casey - 1 freq
goosey - 6 freq
kays - 1 freq
kehs - 1 freq
€œcos - 1 freq
cous - 1 freq
caese - 2 freq
€˜case - 1 freq
kezia - 5 freq
€¦cos - 1 freq
€˜cause - 7 freq
€œcause - 2 freq
caius - 3 freq
€™cause - 2 freq
kso - 2 freq
guiss - 1 freq
ks - 3 freq
gows - 2 freq
€˜cos - 4 freq
€˜cus - 1 freq
czy - 1 freq
gz - 3 freq
cz - 5 freq
cooÂ’s - 3 freq
qois - 1 freq
kz - 4 freq
khza - 1 freq
casa - 1 freq
gs - 3 freq
kaz - 3 freq
gaz - 1 freq
qcy - 1 freq
ckhhyys - 1 freq
gaza - 1 freq
hx - 5 freq
cossy - 1 freq
wx - 4 freq
qwz - 1 freq
gaÂ’s - 1 freq
gazza - 3 freq
gsy - 1 freq
cos' - 1 freq
yx - 2 freq
hgggssss - 1 freq
wks - 1 freq
wxaw - 1 freq
kci - 1 freq
wxi - 1 freq
kazoo - 1 freq
kks - 1 freq
hhqs - 1 freq
ccyh - 1 freq
qzw - 1 freq
qs - 2 freq
güs - 1 freq
csw - 2 freq
hxi - 1 freq
qaz - 1 freq
qis - 1 freq
guz - 1 freq
czw - 1 freq
goz - 1 freq
kes - 1 freq
yxa - 1 freq
kiss' - 1 freq
gazzah - 3 freq
qz - 1 freq
csa - 1 freq
hhx - 1 freq
yxx - 1 freq
gassy - 1 freq
CSI
Time to execute Levenshtein function - 0.466300 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.712176 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028111 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.092091 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001044 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.