A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to caap in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
caap (0) - 6 freq
chap (1) - 126 freq
haap (1) - 1 freq
caip (1) - 3 freq
oaap (1) - 1 freq
caan (1) - 2 freq
camp (1) - 53 freq
crap (1) - 85 freq
cap (1) - 47 freq
caaf (1) - 4 freq
aap (1) - 26 freq
caa (1) - 353 freq
caup (1) - 11 freq
caai (1) - 1 freq
caar (1) - 1 freq
caam (1) - 11 freq
carp (1) - 1 freq
caas (1) - 44 freq
caad (1) - 306 freq
caat (1) - 9 freq
clap (1) - 56 freq
coap (1) - 2 freq
gaap (1) - 1 freq
yaap (1) - 1 freq
caal (1) - 98 freq
caap (0) - 6 freq
cap (1) - 47 freq
caip (1) - 3 freq
caup (1) - 11 freq
coap (1) - 2 freq
cop (2) - 4 freq
yaap (2) - 1 freq
gaap (2) - 1 freq
cep (2) - 1 freq
caal (2) - 98 freq
coapy (2) - 2 freq
cyp (2) - 1 freq
cup (2) - 316 freq
coop (2) - 1 freq
cape (2) - 14 freq
cp (2) - 4 freq
caat (2) - 9 freq
coup (2) - 25 freq
capo (2) - 2 freq
clap (2) - 56 freq
aap (2) - 26 freq
haap (2) - 1 freq
caaf (2) - 4 freq
crap (2) - 85 freq
camp (2) - 53 freq
SoundEx code - C100
cup - 316 freq
chap - 126 freq
cowp - 64 freq
coff - 19 freq
chief - 60 freq
chop - 25 freq
cowboy - 18 freq
cove - 20 freq
coffee - 173 freq
cheap - 55 freq
cap - 47 freq
cope - 36 freq
chubby - 9 freq
coup - 25 freq
cuif - 5 freq
cafe - 31 freq
copy - 100 freq
'chap - 1 freq
cb - 3 freq
cif - 4 freq
cave - 79 freq
cube - 5 freq
chuffie - 3 freq
chip - 42 freq
cuff - 4 freq
chape - 6 freq
cheep - 13 freq
cuppa - 21 freq
cuppie - 27 freq
cauf - 10 freq
coaf - 1 freq
co-op - 15 freq
caip - 3 freq
cop - 4 freq
coapy - 2 freq
choob - 4 freq
chafe - 2 freq
coffey - 3 freq
chave - 3 freq
chippy - 38 freq
cub - 3 freq
choppy - 6 freq
cavie - 2 freq
cheif - 3 freq
chaav - 1 freq
couff - 2 freq
chaw'v - 1 freq
cubbie - 2 freq
chaip - 15 freq
choppie - 6 freq
cheyp - 1 freq
coap - 2 freq
chaff - 8 freq
cape - 14 freq
capo - 2 freq
chippie - 10 freq
cab - 16 freq
caff - 13 freq
caif - 1 freq
caup - 11 freq
cowboay' - 1 freq
chef - 13 freq
chappie - 6 freq
'cowp - 3 freq
chaave - 3 freq
caaf - 4 freq
café - 3 freq
cheap-o - 1 freq
café - 25 freq
cepp - 1 freq
coof - 7 freq
chib - 8 freq
chive - 1 freq
chauve - 7 freq
chaep - 4 freq
cfe - 4 freq
chufa - 1 freq
copie - 17 freq
copp - 1 freq
chiefie - 5 freq
caavie - 2 freq
cv - 4 freq
chivvy - 1 freq
chob - 1 freq
cep - 1 freq
chaffy - 1 freq
cappie - 3 freq
coapie - 4 freq
civvy - 1 freq
caap - 6 freq
cf - 12 freq
chyave - 2 freq
csp - 2 freq
€˜coff - 1 freq
caffy - 1 freq
coopie - 2 freq
coop - 1 freq
€˜cave - 1 freq
€™chief - 1 freq
chouf - 1 freq
€˜copy - 1 freq
cabbie - 3 freq
cuppy - 3 freq
€œcoffee - 1 freq
cava - 1 freq
coaffay - 1 freq
cfp - 2 freq
chappy - 2 freq
caffee - 2 freq
co-opy - 2 freq
co-oopie - 1 freq
chuff - 1 freq
cubby - 1 freq
caffe - 1 freq
chapo - 1 freq
cauve - 1 freq
chapeau - 1 freq
cuv - 3 freq
cappy - 3 freq
caf - 2 freq
cqsveh - 1 freq
ckif - 1 freq
cuhf - 1 freq
coffee” - 1 freq
cav - 1 freq
chippa - 1 freq
cúiv - 1 freq
ccv - 1 freq
‘cove - 1 freq
chav - 3 freq
ckf - 1 freq
cbb - 4 freq
cheffy - 1 freq
cuby - 1 freq
cvu - 1 freq
coyb - 1 freq
chuva - 1 freq
coaffe - 1 freq
czp - 1 freq
cov - 1 freq
cva - 1 freq
cyp - 1 freq
'cfe - 1 freq
cfe' - 1 freq
cfai - 1 freq
ckp - 1 freq
cvh - 1 freq
cbh - 1 freq
coffee' - 1 freq
ccp - 3 freq
cjb - 5 freq
cp - 4 freq
covvy - 16 freq
ckb - 1 freq
ckhf - 1 freq
MetaPhone code - KP
cup - 316 freq
keep - 1547 freq
cowp - 64 freq
kep - 136 freq
kip - 38 freq
gap - 49 freq
cap - 47 freq
cope - 36 freq
gawp - 9 freq
coup - 25 freq
copy - 100 freq
'keep - 23 freq
gaup - 4 freq
cuppa - 21 freq
cuppie - 27 freq
co-op - 15 freq
caip - 3 freq
quip - 2 freq
cop - 4 freq
keip - 4 freq
coapy - 2 freq
gpo - 4 freq
coap - 2 freq
cape - 14 freq
capo - 2 freq
gowp - 65 freq
caup - 11 freq
'cowp - 3 freq
gp - 9 freq
keepy - 1 freq
kap - 1 freq
'kappa' - 1 freq
kowp - 1 freq
kop - 2 freq
kepe - 4 freq
copie - 17 freq
copp - 1 freq
cappie - 3 freq
coapie - 4 freq
kep' - 2 freq
kaip - 2 freq
caap - 6 freq
gaip - 1 freq
gup - 2 freq
€™kiep - 1 freq
gaap - 1 freq
kypie - 1 freq
gape - 4 freq
coopie - 2 freq
€œkeep - 8 freq
coop - 1 freq
keppy - 1 freq
€˜keep - 2 freq
kappa - 1 freq
€˜copy - 1 freq
cuppy - 3 freq
€™keep - 1 freq
co-opy - 2 freq
co-oopie - 1 freq
cappy - 3 freq
kaypee - 2 freq
kp - 6 freq
qp - 7 freq
qpo - 1 freq
ykypy - 1 freq
hqpw - 1 freq
ckp - 1 freq
cp - 4 freq
kapo - 1 freq
CAAP
Time to execute Levenshtein function - 0.195861 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.327215 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027996 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.059165 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000864 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.