A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to csd in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
csd (0) - 11 freq
sd (1) - 7 freq
csi (1) - 3 freq
cqd (1) - 1 freq
cnd (1) - 8 freq
cd (1) - 50 freq
isd (1) - 1 freq
csa (1) - 1 freq
usd (1) - 1 freq
esd (1) - 1 freq
cscd (1) - 1 freq
vsd (1) - 1 freq
cs (1) - 5 freq
lsd (1) - 2 freq
cud (1) - 955 freq
cod (1) - 20 freq
ssd (1) - 12 freq
csp (1) - 2 freq
cpd (1) - 3 freq
cbd (1) - 1 freq
cid (1) - 25 freq
cad (1) - 40 freq
csw (1) - 2 freq
csr (1) - 1 freq
asd (1) - 1 freq
csd (0) - 11 freq
csp (2) - 2 freq
ssd (2) - 12 freq
cud (2) - 955 freq
cpd (2) - 3 freq
cod (2) - 20 freq
cad (2) - 40 freq
asd (2) - 1 freq
csr (2) - 1 freq
csw (2) - 2 freq
lsd (2) - 2 freq
cid (2) - 25 freq
cbd (2) - 1 freq
cqd (2) - 1 freq
cd (2) - 50 freq
csi (2) - 3 freq
sd (2) - 7 freq
cs (2) - 5 freq
isd (2) - 1 freq
cnd (2) - 8 freq
esd (2) - 1 freq
vsd (2) - 1 freq
usd (2) - 1 freq
cscd (2) - 1 freq
csa (2) - 1 freq
SoundEx code - C300
cuddy - 55 freq
cut - 461 freq
city - 288 freq
cuid - 810 freq
caa'd - 64 freq
couthy - 23 freq
cawed - 263 freq
cuttie - 30 freq
coat - 165 freq
cathy - 146 freq
cawd - 33 freq
chat - 71 freq
cat - 569 freq
chatte - 6 freq
cute - 41 freq
cat'd - 1 freq
ceety - 28 freq
coud - 152 freq
cowed - 7 freq
cuddie - 43 freq
ca'd - 34 freq
coda - 3 freq
'coud - 1 freq
ceity - 22 freq
cd - 50 freq
'cuid - 7 freq
'cud - 2 freq
cud - 955 freq
caad - 306 freq
cad - 40 freq
chute - 15 freq
cheat - 6 freq
caad- - 1 freq
cheetie - 23 freq
cou-the - 1 freq
couthie - 71 freq
cutty - 33 freq
cott - 4 freq
cuitie - 1 freq
cot - 44 freq
cid - 25 freq
catty - 27 freq
cae'd - 1 freq
chewed - 6 freq
city' - 2 freq
chuddie - 5 freq
chide - 4 freq
chatty - 6 freq
coothie - 4 freq
chit - 4 freq
ca'ed - 76 freq
cattie - 2 freq
cwid - 99 freq
cod - 20 freq
ca-ed - 1 freq
caaed - 95 freq
coohide - 1 freq
ceetie - 3 freq
chowed - 9 freq
cood - 257 freq
cit - 4 freq
cyedeea - 1 freq
cooda - 3 freq
ct - 8 freq
chaad - 3 freq
chawed - 17 freq
code - 38 freq
caud - 7 freq
chowdie - 1 freq
cathai - 1 freq
'cathy - 13 freq
chad - 3 freq
'cut - 5 freq
chaaed - 6 freq
cawit - 1 freq
ca'ad - 11 freq
csd - 11 freq
cut' - 2 freq
caa't - 4 freq
caa'ed - 5 freq
couttie - 12 freq
chewit - 1 freq
couid - 1 freq
chowit - 1 freq
chat-e-a-u - 2 freq
cha-too - 2 freq
cutte - 1 freq
cïtie - 6 freq
citie - 10 freq
chait - 1 freq
cudda - 2 freq
cóat - 1 freq
chaithie - 1 freq
ca't - 1 freq
cat' - 1 freq
cath - 4 freq
cato - 1 freq
catia - 1 freq
cwyte - 3 freq
chett - 1 freq
cöst - 3 freq
'cood - 1 freq
cite - 2 freq
'city - 1 freq
château - 4 freq
'château - 4 freq
cowid - 3 freq
coot - 2 freq
'cutty - 1 freq
coit - 7 freq
cutt - 3 freq
codd - 7 freq
ceti - 1 freq
'cut' - 1 freq
cüst - 2 freq
caeth - 1 freq
cheatie - 1 freq
coatie - 5 freq
cuida - 2 freq
caat - 9 freq
choweit - 1 freq
cøt - 1 freq
cewid - 1 freq
ceitie - 19 freq
cathie - 1 freq
caed - 4 freq
cowt - 2 freq
cweed - 3 freq
€žcuddy - 1 freq
cootie - 1 freq
€™-cat - 1 freq
couth - 1 freq
cuit - 2 freq
chuddy - 2 freq
cude - 1 freq
caddie - 2 freq
cahute - 1 freq
€˜cat - 1 freq
cote - 4 freq
€œcuid - 2 freq
€˜city - 1 freq
cata - 1 freq
chateau - 2 freq
cathay - 1 freq
€œcatty - 1 freq
€˜catty - 2 freq
€œcat - 2 freq
€œcut - 1 freq
€˜cauthe - 1 freq
caddy - 2 freq
chattie - 1 freq
€˜cut - 1 freq
€œcud - 2 freq
cody - 1 freq
cede - 1 freq
€œcode - 1 freq
chitty - 1 freq
ceud - 1 freq
cscd - 1 freq
cto - 1 freq
“cwid - 1 freq
ceedee - 1 freq
cqhod - 1 freq
cgutwy - 1 freq
cscott - 5 freq
cittie - 2 freq
cctd - 1 freq
caÂ’d - 2 freq
cet - 1 freq
cutie - 4 freq
ctau - 1 freq
ctw - 1 freq
couddae - 1 freq
cate - 2 freq
cait - 2 freq
cqd - 1 freq
cowdie - 2 freq
cutey - 1 freq
MetaPhone code - KST
kissed - 66 freq
kist - 117 freq
cast - 227 freq
'kist - 1 freq
kistie - 14 freq
coast - 119 freq
caused - 78 freq
guessed - 31 freq
kisst - 6 freq
gast - 5 freq
cost - 114 freq
gust - 18 freq
cuist - 14 freq
gazed - 15 freq
gowstie - 7 freq
guest - 42 freq
gazette - 3 freq
kest - 18 freq
cassidy - 4 freq
keest - 3 freq
cassette - 3 freq
costa - 7 freq
quest - 18 freq
goustie - 4 freq
'goustie - 1 freq
caist - 4 freq
guisit - 1 freq
causit - 3 freq
gceid - 1 freq
gassed - 3 freq
gcid - 4 freq
goast - 1 freq
casset - 2 freq
gowsty - 5 freq
'c'est - 1 freq
caste - 2 freq
cassat - 1 freq
'cost - 1 freq
keistie - 1 freq
caased - 6 freq
guesst - 5 freq
csd - 11 freq
quayside - 3 freq
kïsst - 2 freq
kïst - 1 freq
cosset - 1 freq
gusto - 6 freq
cöst - 3 freq
c'est - 1 freq
kist' - 5 freq
coist - 4 freq
cüst - 2 freq
gaist - 2 freq
kist'' - 1 freq
kost - 1 freq
costo - 1 freq
coost - 1 freq
cosst - 1 freq
gussied - 1 freq
gustie - 3 freq
coasta - 1 freq
€˜kist - 1 freq
€œkist - 2 freq
gusset - 1 freq
cousteau - 1 freq
€˜kssst - 1 freq
gusd - 1 freq
goest - 1 freq
coste - 1 freq
kis't - 1 freq
quizzed - 1 freq
costie - 1 freq
qzd - 1 freq
qzitw - 1 freq
'cast' - 1 freq
qzt - 1 freq
gzet - 1 freq
kstew - 1 freq
yxd - 1 freq
CSD
Time to execute Levenshtein function - 0.184543 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.389733 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.064905 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040606 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001076 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.