A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to caw-cannie in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
caw-cannie (0) - 3 freq
'caw-cannie (1) - 1 freq
cawd-cannie (1) - 1 freq
tax-mannie (3) - 3 freq
snaw-mannie (3) - 1 freq
cowp-carlie (4) - 3 freq
uncannie (4) - 14 freq
tax-mannies (4) - 3 freq
pack-mannie (4) - 1 freq
cannie (4) - 92 freq
coomannie (4) - 1 freq
can-can (4) - 3 freq
no-cannin (4) - 1 freq
jaw-bane (4) - 1 freq
taxmannie (4) - 1 freq
crannie (4) - 11 freq
janeannie (4) - 1 freq
a-rinnin (5) - 1 freq
clannies (5) - 1 freq
coo-cake (5) - 1 freq
caw-ins (5) - 1 freq
winnie (5) - 80 freq
paw-phone (5) - 1 freq
connie (5) - 2 freq
cairnie (5) - 2 freq
caw-cannie (0) - 3 freq
cawd-cannie (2) - 1 freq
'caw-cannie (2) - 1 freq
snaw-mannie (6) - 1 freq
can-can (6) - 3 freq
tax-mannie (6) - 3 freq
cowp-carlie (7) - 3 freq
crannie (7) - 11 freq
caw-ins (7) - 1 freq
no-cannin (7) - 1 freq
cocainynie (7) - 1 freq
jaw-bane (7) - 1 freq
pack-mannie (7) - 1 freq
uncannie (7) - 14 freq
cannie (7) - 92 freq
coomannie (7) - 1 freq
cocaine (8) - 6 freq
cannae (8) - 1640 freq
whunnie (8) - 1 freq
canni (8) - 6 freq
candy-cane (8) - 1 freq
twennie (8) - 1 freq
carcanet (8) - 2 freq
''cannae (8) - 1 freq
tin-canned (8) - 1 freq
SoundEx code - C250
chasin - 40 freq
cookin - 43 freq
caw-cannie - 3 freq
chokin - 28 freq
coaxin - 5 freq
chuckin - 17 freq
choosin - 13 freq
cushion - 28 freq
cushin - 3 freq
chosen - 48 freq
chuggin - 4 freq
checkin - 44 freq
chicken - 82 freq
'chicken - 1 freq
chuckie-hen - 1 freq
cousin - 100 freq
causin - 14 freq
'caw-cannie - 1 freq
cuikin - 7 freq
cuisin - 1 freq
caukin - 1 freq
cocaine - 6 freq
cassen - 27 freq
'cousin' - 1 freq
coughin - 14 freq
chucken - 17 freq
chasan - 2 freq
chusin - 4 freq
checkin' - 4 freq
chasin' - 1 freq
causen - 3 freq
chukkin - 1 freq
chukken - 4 freq
caasen - 1 freq
cooken - 1 freq
couken - 3 freq
cooshen - 1 freq
chasm - 2 freq
cizzin - 1 freq
cookin' - 3 freq
caizzen - 1 freq
'chasin - 1 freq
cuisine - 6 freq
cowkin - 2 freq
chuckeny - 1 freq
cashin - 1 freq
casino - 9 freq
chozen - 1 freq
caasin - 3 freq
casin - 3 freq
cosam - 3 freq
cockin - 8 freq
'chucken - 1 freq
chechen - 1 freq
cassin - 4 freq
chysen - 1 freq
coaxan - 3 freq
coagin - 1 freq
cohesion - 8 freq
chackin - 3 freq
cookeen - 3 freq
cocoon - 2 freq
cookan - 2 freq
checkan - 3 freq
chuckan - 1 freq
cuzzin - 1 freq
cuckin - 1 freq
cheussan - 2 freq
coseen - 1 freq
chossen - 1 freq
chowkin - 1 freq
chookie-hen - 1 freq
chusen - 2 freq
cockney - 8 freq
cuissen - 1 freq
choosan - 2 freq
chokan - 1 freq
chokkin - 1 freq
cheisen - 1 freq
chuisen - 1 freq
cassini - 2 freq
chuisin - 4 freq
€˜chukken - 1 freq
cocainyie - 1 freq
chakkin - 1 freq
cessioun - 1 freq
€œchicken - 3 freq
cackin - 1 freq
coogan - 1 freq
coushin - 1 freq
coccoon - 1 freq
cassino - 2 freq
chasni - 2 freq
chukin - 1 freq
chiichan - 1 freq
coachin - 1 freq
chejoanna - 2 freq
chegwin - 1 freq
cooshion - 1 freq
choakin - 1 freq
cooshin - 1 freq
MetaPhone code - KKN
keekin - 126 freq
cookin - 43 freq
caw-cannie - 3 freq
kickin - 64 freq
gawkin - 13 freq
kickin' - 2 freq
'caw-cannie - 1 freq
cuikin - 7 freq
caukin - 1 freq
keikin - 10 freq
cocaine - 6 freq
k-ken - 1 freq
gaggin - 7 freq
cooken - 1 freq
couken - 3 freq
cookin' - 3 freq
cowkin - 2 freq
quakin - 3 freq
quaik'an - 1 freq
keckin - 1 freq
cockin - 8 freq
quicken - 4 freq
cookeen - 3 freq
cocoon - 2 freq
cookan - 2 freq
cuckin - 1 freq
gokkan - 1 freq
gawkan - 1 freq
kickan - 2 freq
cockney - 8 freq
keekan - 2 freq
gowkin - 2 freq
kowkin - 1 freq
cackin - 1 freq
coogan - 1 freq
kookin - 1 freq
kickoan - 1 freq
keegan - 2 freq
kickinÂ’ - 1 freq
CAW-CANNIE
Time to execute Levenshtein function - 0.354891 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.399551 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028889 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037059 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000863 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.