A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cab in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cab (0) - 15 freq
caa (1) - 353 freq
cad (1) - 40 freq
ca (1) - 194 freq
caw (1) - 189 freq
cb (1) - 3 freq
cag (1) - 1 freq
ckb (1) - 1 freq
cap (1) - 47 freq
caz (1) - 2 freq
sab (1) - 9 freq
cjb (1) - 5 freq
car (1) - 404 freq
cah (1) - 1 freq
scab (1) - 12 freq
cas (1) - 4 freq
cub (1) - 3 freq
nab (1) - 10 freq
-ab (1) - 1 freq
cak (1) - 1 freq
aab (1) - 3 freq
cav (1) - 1 freq
cac (1) - 1 freq
wab (1) - 126 freq
fab (1) - 18 freq
cab (0) - 15 freq
cb (1) - 3 freq
cub (1) - 3 freq
dab (2) - 58 freq
lab (2) - 29 freq
jab (2) - 20 freq
ab (2) - 25 freq
caf (2) - 2 freq
gab (2) - 32 freq
cbb (2) - 4 freq
ca' (2) - 23 freq
cae (2) - 7 freq
crab (2) - 32 freq
cau (2) - 2 freq
tab (2) - 6 freq
cuby (2) - 1 freq
coyb (2) - 1 freq
cube (2) - 4 freq
cdb (2) - 1 freq
cat (2) - 557 freq
cam (2) - 2618 freq
can (2) - 4706 freq
bab (2) - 13 freq
cal (2) - 29 freq
rab (2) - 169 freq
SoundEx code - C100
cup - 308 freq
chap - 125 freq
cowp - 63 freq
coff - 19 freq
chief - 60 freq
chop - 25 freq
cowboy - 16 freq
cove - 19 freq
coffee - 156 freq
cheap - 51 freq
cap - 47 freq
cope - 36 freq
chubby - 9 freq
coup - 25 freq
cuif - 5 freq
cafe - 31 freq
copy - 97 freq
'chap - 1 freq
cb - 3 freq
cif - 4 freq
cave - 79 freq
cube - 4 freq
chuffie - 3 freq
chip - 41 freq
cuff - 4 freq
chape - 6 freq
cheep - 13 freq
cuppa - 21 freq
cuppie - 27 freq
cauf - 10 freq
coaf - 1 freq
co-op - 15 freq
caip - 3 freq
cop - 4 freq
coapy - 2 freq
choob - 4 freq
chafe - 2 freq
coffey - 3 freq
chave - 3 freq
chippy - 38 freq
cub - 3 freq
choppy - 5 freq
cavie - 2 freq
cheif - 3 freq
chaav - 1 freq
couff - 2 freq
chaw'v - 1 freq
cubbie - 2 freq
chaip - 15 freq
choppie - 6 freq
cheyp - 1 freq
coap - 2 freq
chaff - 7 freq
caff - 13 freq
caif - 1 freq
caup - 11 freq
cape - 11 freq
cowboay' - 1 freq
chef - 13 freq
chappie - 6 freq
'cowp - 3 freq
chippie - 9 freq
chaave - 3 freq
caaf - 4 freq
café - 3 freq
cheap-o - 1 freq
café - 25 freq
cepp - 1 freq
coof - 7 freq
chib - 8 freq
chive - 1 freq
chauve - 7 freq
chaep - 4 freq
cfe - 4 freq
chufa - 1 freq
cab - 15 freq
copie - 17 freq
copp - 1 freq
chiefie - 5 freq
caavie - 2 freq
cv - 4 freq
chivvy - 1 freq
chob - 1 freq
cep - 1 freq
chaffy - 1 freq
cappie - 3 freq
coapie - 4 freq
civvy - 1 freq
caap - 6 freq
cf - 12 freq
chyave - 2 freq
csp - 2 freq
€˜coff - 1 freq
caffy - 1 freq
coopie - 2 freq
coop - 1 freq
€˜cave - 1 freq
€™chief - 1 freq
chouf - 1 freq
€˜copy - 1 freq
cabbie - 3 freq
cuppy - 3 freq
€œcoffee - 1 freq
cava - 1 freq
coaffay - 1 freq
cfp - 2 freq
chappy - 2 freq
caffee - 2 freq
co-opy - 2 freq
co-oopie - 1 freq
chuff - 1 freq
cubby - 1 freq
caffe - 1 freq
chapo - 1 freq
cauve - 1 freq
chapeau - 1 freq
cuv - 3 freq
cappy - 3 freq
caf - 2 freq
cqsveh - 1 freq
ckif - 1 freq
cuhf - 1 freq
coffee” - 1 freq
cav - 1 freq
chippa - 1 freq
cúiv - 1 freq
ccv - 1 freq
‘cove - 1 freq
chav - 3 freq
ckf - 1 freq
cbb - 4 freq
cheffy - 1 freq
cuby - 1 freq
cvu - 1 freq
coyb - 1 freq
chuva - 1 freq
coaffe - 1 freq
czp - 1 freq
cov - 1 freq
cva - 1 freq
cyp - 1 freq
'cfe - 1 freq
cfe' - 1 freq
cfai - 1 freq
ckp - 1 freq
cvh - 1 freq
cbh - 1 freq
coffee' - 1 freq
ccp - 3 freq
cjb - 5 freq
cp - 4 freq
covvy - 16 freq
ckb - 1 freq
ckhf - 1 freq
MetaPhone code - KB
gab - 32 freq
cowboy - 16 freq
gob - 14 freq
cb - 3 freq
gub - 30 freq
cube - 4 freq
gub' - 2 freq
gabbie - 1 freq
cub - 3 freq
cubbie - 2 freq
gbh - 3 freq
cowboay' - 1 freq
cab - 15 freq
qub - 4 freq
go-by - 2 freq
wgbh - 1 freq
gaub - 1 freq
gabe - 1 freq
cabbie - 3 freq
kb - 4 freq
keb - 2 freq
kubby - 1 freq
cubby - 1 freq
qb - 2 freq
hkhb - 1 freq
gooby - 2 freq
qby - 1 freq
cbb - 4 freq
gb - 2 freq
gbbo - 1 freq
cuby - 1 freq
coyb - 1 freq
kabb - 1 freq
cbh - 1 freq
gabby - 1 freq
ckb - 1 freq
CAB
Time to execute Levenshtein function - 0.321518 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.492810 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.068524 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037413 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000808 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.