A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cohesion in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cohesion (0) - 8 freq
coherin (2) - 1 freq
cohesive (2) - 9 freq
meesion (3) - 1 freq
cowerin (3) - 5 freq
cousin (3) - 100 freq
confession (3) - 10 freq
creesin (3) - 1 freq
comedian (3) - 4 freq
robeson (3) - 1 freq
cheriot (3) - 2 freq
collusion (3) - 1 freq
coursin (3) - 1 freq
veesion (3) - 25 freq
cjohnston (3) - 1 freq
collision (3) - 5 freq
chein (3) - 2 freq
commeision (3) - 1 freq
couerin (3) - 1 freq
cooshion (3) - 1 freq
conversion (3) - 3 freq
foosion (3) - 1 freq
johnston (3) - 18 freq
johnson (3) - 59 freq
coverin (3) - 29 freq
cohesion (0) - 8 freq
chusin (3) - 4 freq
chasin (3) - 40 freq
cohesive (3) - 9 freq
coherin (3) - 1 freq
chewin (4) - 15 freq
coorsin (4) - 2 freq
cheisen (4) - 1 freq
chosen (4) - 48 freq
chusen (4) - 2 freq
chasan (4) - 2 freq
chysen (4) - 1 freq
choosin (4) - 13 freq
chein (4) - 2 freq
chuisin (4) - 4 freq
cohen (4) - 1 freq
coulson (4) - 2 freq
creesin (4) - 1 freq
coursin (4) - 1 freq
cousin (4) - 100 freq
cheyn (5) - 4 freq
caasin (5) - 3 freq
chasie (5) - 1 freq
chuisen (5) - 1 freq
chasing (5) - 5 freq
SoundEx code - C250
chasin - 40 freq
cookin - 43 freq
caw-cannie - 3 freq
chokin - 28 freq
coaxin - 5 freq
chuckin - 17 freq
choosin - 13 freq
cushion - 28 freq
cushin - 3 freq
chosen - 48 freq
chuggin - 4 freq
checkin - 44 freq
chicken - 82 freq
'chicken - 1 freq
chuckie-hen - 1 freq
cousin - 100 freq
causin - 14 freq
'caw-cannie - 1 freq
cuikin - 7 freq
cuisin - 1 freq
caukin - 1 freq
cocaine - 6 freq
cassen - 27 freq
'cousin' - 1 freq
coughin - 14 freq
chucken - 17 freq
chasan - 2 freq
chusin - 4 freq
checkin' - 4 freq
chasin' - 1 freq
causen - 3 freq
chukkin - 1 freq
chukken - 4 freq
caasen - 1 freq
cooken - 1 freq
couken - 3 freq
cooshen - 1 freq
chasm - 2 freq
cizzin - 1 freq
cookin' - 3 freq
caizzen - 1 freq
'chasin - 1 freq
cuisine - 6 freq
cowkin - 2 freq
chuckeny - 1 freq
cashin - 1 freq
casino - 9 freq
chozen - 1 freq
caasin - 3 freq
casin - 3 freq
cosam - 3 freq
cockin - 8 freq
'chucken - 1 freq
chechen - 1 freq
cassin - 4 freq
chysen - 1 freq
coaxan - 3 freq
coagin - 1 freq
cohesion - 8 freq
chackin - 3 freq
cookeen - 3 freq
cocoon - 2 freq
cookan - 2 freq
checkan - 3 freq
chuckan - 1 freq
cuzzin - 1 freq
cuckin - 1 freq
cheussan - 2 freq
coseen - 1 freq
chossen - 1 freq
chowkin - 1 freq
chookie-hen - 1 freq
chusen - 2 freq
cockney - 8 freq
cuissen - 1 freq
choosan - 2 freq
chokan - 1 freq
chokkin - 1 freq
cheisen - 1 freq
chuisen - 1 freq
cassini - 2 freq
chuisin - 4 freq
€˜chukken - 1 freq
cocainyie - 1 freq
chakkin - 1 freq
cessioun - 1 freq
€œchicken - 3 freq
cackin - 1 freq
coogan - 1 freq
coushin - 1 freq
coccoon - 1 freq
cassino - 2 freq
chasni - 2 freq
chukin - 1 freq
chiichan - 1 freq
coachin - 1 freq
chejoanna - 2 freq
chegwin - 1 freq
cooshion - 1 freq
choakin - 1 freq
cooshin - 1 freq
MetaPhone code - KHXN
cohesion - 8 freq
COHESION
Time to execute Levenshtein function - 0.180442 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.367126 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033521 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.046668 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000958 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.