A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cousins in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cousins (0) - 46 freq
cousin's (1) - 15 freq
cousin (1) - 100 freq
coupin (2) - 7 freq
couping (2) - 1 freq
cursins (2) - 1 freq
causing (2) - 8 freq
causin (2) - 14 freq
consis (2) - 1 freq
crusin (2) - 1 freq
housing (2) - 4 freq
cousinÂ’s (2) - 1 freq
cosies (2) - 1 freq
coupons (2) - 10 freq
cousinly (2) - 1 freq
cuisins (2) - 1 freq
musins (2) - 2 freq
'cousin' (2) - 1 freq
coins (2) - 50 freq
causies (2) - 6 freq
consi's (2) - 1 freq
collins (2) - 3 freq
courin (2) - 1 freq
codlins (2) - 4 freq
chusin (2) - 4 freq
cousins (0) - 46 freq
cousin's (2) - 15 freq
cousin (2) - 100 freq
cuisins (2) - 1 freq
musins (3) - 2 freq
casinos (3) - 1 freq
cousinly (3) - 1 freq
causies (3) - 6 freq
comins (3) - 4 freq
cosies (3) - 1 freq
coins (3) - 50 freq
coupons (3) - 10 freq
causing (3) - 8 freq
causin (3) - 14 freq
cursins (3) - 1 freq
poisons (4) - 1 freq
crouns (4) - 4 freq
casino (4) - 9 freq
cains (4) - 1 freq
causeys (4) - 4 freq
risins (4) - 2 freq
cosying (4) - 1 freq
casin (4) - 3 freq
causays (4) - 1 freq
cons (4) - 4 freq
SoundEx code - C252
chickens - 30 freq
cousins - 46 freq
chuckneys - 2 freq
chuckie-hens - 4 freq
cousin's - 15 freq
cushions - 11 freq
casing - 1 freq
coggins - 1 freq
cessnock - 8 freq
cooking - 15 freq
ceacencw - 1 freq
chacking - 1 freq
chukin's - 1 freq
chasing - 5 freq
casinos - 1 freq
choking - 4 freq
cosmic - 11 freq
cake-makin - 2 freq
cognoscenti - 1 freq
chucknies - 1 freq
cosmos - 3 freq
coughing - 2 freq
chuckens - 5 freq
co-conspirator - 1 freq
causing - 8 freq
€œchukkens - 1 freq
chukkens - 3 freq
cognosce - 1 freq
cashing - 1 freq
cuisins - 1 freq
cockneys - 1 freq
chucking - 2 freq
checking - 4 freq
cocoons - 1 freq
cowgang - 1 freq
chicken-cocks - 1 freq
cosying - 1 freq
cojones - 1 freq
coo-scones - 3 freq
casxnk - 1 freq
choosing - 3 freq
chugging - 1 freq
coaching - 13 freq
chickmckenna - 3 freq
chasmically - 2 freq
cousinÂ’s - 1 freq
casino's - 1 freq
cockenzie - 1 freq
cizzens - 1 freq
MetaPhone code - KSNS
cousins - 46 freq
kizzens - 6 freq
cousin's - 15 freq
casinos - 1 freq
cuisins - 1 freq
cousinÂ’s - 1 freq
casino's - 1 freq
COUSINS
Time to execute Levenshtein function - 0.185963 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342896 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027259 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036840 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.