A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cds in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cds (0) - 20 freq
cus (1) - 130 freq
ids (1) - 63 freq
cods (1) - 2 freq
cd (1) - 50 freq
cs (1) - 5 freq
tds (1) - 3 freq
ads (1) - 9 freq
rds (1) - 1 freq
cdms (1) - 1 freq
eds (1) - 9 freq
ds (1) - 9 freq
cdb (1) - 1 freq
cts (1) - 1 freq
cd's (1) - 2 freq
cas (1) - 4 freq
c's (1) - 1 freq
cos (1) - 456 freq
cis (1) - 26 freq
cdv (1) - 1 freq
cvs (1) - 1 freq
dm (2) - 12 freq
casa (2) - 1 freq
ess (2) - 258 freq
ldq (2) - 1 freq
cds (0) - 20 freq
cods (1) - 2 freq
cos (2) - 456 freq
c's (2) - 1 freq
cd's (2) - 2 freq
cis (2) - 26 freq
cas (2) - 4 freq
cvs (2) - 1 freq
coads (2) - 1 freq
codes (2) - 4 freq
cts (2) - 1 freq
acids (2) - 1 freq
cdv (2) - 1 freq
cus (2) - 130 freq
cd (2) - 50 freq
ids (2) - 63 freq
cdb (2) - 1 freq
cs (2) - 5 freq
tds (2) - 3 freq
eds (2) - 9 freq
ds (2) - 9 freq
cdms (2) - 1 freq
rds (2) - 1 freq
ads (2) - 9 freq
cude (3) - 1 freq
SoundEx code - C320
cottage - 49 freq
catch - 353 freq
cities - 43 freq
city's - 10 freq
cats - 124 freq
cat's - 32 freq
ceeties - 7 freq
cuddies - 49 freq
cuddie's - 10 freq
codes - 4 freq
cuts - 45 freq
coats - 28 freq
chats - 3 freq
cotch - 8 freq
cds - 20 freq
cadiz - 2 freq
cute's - 1 freq
cd's - 2 freq
cothous - 9 freq
cot-hous - 2 freq
cahoots - 2 freq
chat's - 1 freq
cuits - 2 freq
cuddies' - 2 freq
cut's - 2 freq
cit's - 1 freq
cathoose - 1 freq
cautch - 1 freq
coat's - 4 freq
cathy's - 27 freq
'cathy's - 2 freq
'cuddies - 1 freq
citz - 2 freq
cautious - 8 freq
cheats - 2 freq
cats' - 2 freq
catties - 3 freq
c-c-d's - 1 freq
'cheats' - 2 freq
couttie's - 3 freq
chotce - 1 freq
cïties - 2 freq
cottage' - 1 freq
'catch - 2 freq
cuithes - 4 freq
cots - 2 freq
cadgy - 1 freq
châteaus - 1 freq
chates - 1 freq
cadgie - 2 freq
cahootchie - 2 freq
catchy - 3 freq
cotts - 3 freq
cöts - 3 freq
caddies - 2 freq
chutes - 1 freq
cutties - 3 freq
cyties - 1 freq
cities' - 1 freq
cits - 1 freq
'catchie' - 1 freq
cites - 12 freq
caats - 4 freq
codgie - 2 freq
cowtious - 1 freq
cahoutchy - 1 freq
cootch - 3 freq
ceities - 8 freq
codds - 2 freq
cadge - 2 freq
coits - 1 freq
coutch - 1 freq
cutesy - 1 freq
chits - 1 freq
cíties - 1 freq
coattage - 1 freq
cooties - 1 freq
chaotic - 3 freq
coots - 1 freq
cods - 2 freq
€˜cuddies - 1 freq
coads - 1 freq
catchie - 2 freq
cottige - 1 freq
ceuithes - 1 freq
cattie's - 1 freq
czdq - 1 freq
caddis - 1 freq
cts - 1 freq
cuddy's - 1 freq
cattyish - 20 freq
cuddys - 1 freq
chdk - 1 freq
cyatcy - 1 freq
czdxi - 1 freq
cuties - 2 freq
coutts - 1 freq
caithess - 1 freq
ctdg - 1 freq
cedk - 1 freq
catwawk - 1 freq
MetaPhone code - KTS
goad's - 11 freq
cats - 124 freq
cat's - 32 freq
kites - 6 freq
guts - 73 freq
gods - 73 freq
guids - 24 freq
cuddies - 49 freq
cuddie's - 10 freq
gutsy - 7 freq
gaits - 30 freq
god's - 127 freq
codes - 4 freq
cuts - 45 freq
kate's - 26 freq
gates - 104 freq
guides - 15 freq
quids' - 1 freq
kiddies - 2 freq
gutties - 11 freq
coats - 28 freq
kids - 87 freq
goats - 17 freq
goddess - 53 freq
quits - 3 freq
goods - 21 freq
gads - 8 freq
cds - 20 freq
quotes - 15 freq
cadiz - 2 freq
kitty's - 1 freq
cute's - 1 freq
good's - 3 freq
cd's - 2 freq
gtice - 1 freq
goads - 30 freq
'goads - 1 freq
goads' - 1 freq
goodies - 6 freq
cuits - 2 freq
cuddies' - 2 freq
goddis - 3 freq
gait's - 4 freq
cut's - 2 freq
gut's - 4 freq
gats - 1 freq
coat's - 4 freq
kits - 4 freq
guid's - 4 freq
gowdie's - 1 freq
'cuddies - 1 freq
gode's - 1 freq
'gutsie - 1 freq
queets - 6 freq
cats' - 2 freq
catties - 3 freq
ket's - 1 freq
kidz - 1 freq
couttie's - 3 freq
cïties - 2 freq
gaudi's - 1 freq
kytes - 2 freq
kat's - 15 freq
'kat's - 1 freq
cots - 2 freq
göd's - 2 freq
goat's - 2 freq
göds - 1 freq
gaets - 4 freq
godes - 2 freq
'god's - 2 freq
cotts - 3 freq
gude's - 1 freq
godds - 1 freq
cöts - 3 freq
caddies - 2 freq
cutties - 3 freq
gaetes - 1 freq
quotas - 4 freq
kuts - 1 freq
gate's - 1 freq
caats - 4 freq
'katze - 1 freq
gowdies - 2 freq
guidis - 1 freq
quytes - 1 freq
gutsie - 2 freq
'gad's - 1 freq
€˜quotes - 1 freq
guds - 1 freq
codds - 2 freq
gowds - 1 freq
coits - 1 freq
gudis - 3 freq
kidds - 2 freq
€œgaits - 1 freq
cutesy - 1 freq
cíties - 1 freq
wkds - 2 freq
cooties - 1 freq
kudos - 2 freq
gods' - 1 freq
coots - 1 freq
godddess - 1 freq
goddesss - 1 freq
cods - 2 freq
queats - 2 freq
€˜cuddies - 1 freq
€œgods - 2 freq
quats - 1 freq
coads - 1 freq
€œgoddis - 1 freq
€œgads - 1 freq
cattie's - 1 freq
gawds - 3 freq
gtz - 1 freq
caddis - 1 freq
qwwdci - 1 freq
godÂ’s - 3 freq
kdaz - 1 freq
cts - 1 freq
cuddy's - 1 freq
kdz - 1 freq
gadz - 1 freq
gouts - 1 freq
gout's - 1 freq
cuddys - 1 freq
kid's - 1 freq
wgd's - 1 freq
ketts - 1 freq
cuties - 2 freq
coutts - 1 freq
ktze - 1 freq
gatesÂ’ - 1 freq
CDS
Time to execute Levenshtein function - 0.163050 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.363126 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027731 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036930 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000939 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.