A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cohort in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cohort (0) - 4 freq
coort (1) - 125 freq
colourt (2) - 6 freq
clort (2) - 3 freq
court (2) - 153 freq
coor (2) - 4 freq
cloort (2) - 1 freq
oort (2) - 8 freq
chord (2) - 7 freq
comfort (2) - 95 freq
hort (2) - 5 freq
color (2) - 2 freq
chert (2) - 2 freq
moort (2) - 4 freq
couart (2) - 1 freq
coont (2) - 347 freq
chokt (2) - 2 freq
coot (2) - 2 freq
coorit (2) - 2 freq
athort (2) - 122 freq
eoort (2) - 1 freq
chork (2) - 1 freq
soort (2) - 2 freq
short (2) - 319 freq
poort (2) - 5 freq
cohort (0) - 4 freq
chert (2) - 2 freq
chart (2) - 4 freq
coort (2) - 125 freq
chairt (3) - 17 freq
chork (3) - 1 freq
athort (3) - 122 freq
coorit (3) - 2 freq
short (3) - 319 freq
schort (3) - 5 freq
cheert (3) - 1 freq
cowert (3) - 2 freq
culort (3) - 1 freq
chore (3) - 11 freq
covert (3) - 38 freq
caiort (3) - 1 freq
chokt (3) - 2 freq
hort (3) - 5 freq
chord (3) - 7 freq
cloort (3) - 1 freq
court (3) - 153 freq
clort (3) - 3 freq
colourt (3) - 6 freq
couart (3) - 1 freq
unhurt (4) - 1 freq
SoundEx code - C630
cried - 979 freq
carrot - 19 freq
cooried - 63 freq
cairt - 92 freq
cairried - 143 freq
carried - 35 freq
caird - 96 freq
cheered - 27 freq
crood - 111 freq
cairrit - 27 freq
crowd - 127 freq
court - 153 freq
creaut - 4 freq
coward - 12 freq
cryed - 97 freq
cairiet - 7 freq
carte - 4 freq
create - 50 freq
caird' - 1 freq
criet - 1 freq
corat - 1 freq
cared - 32 freq
cured - 29 freq
chord - 7 freq
coort - 125 freq
curd - 5 freq
courit - 7 freq
crawed - 7 freq
charity - 41 freq
crude - 5 freq
cryit - 35 freq
cheritie - 3 freq
cairriet - 70 freq
couardie - 3 freq
crathie - 1 freq
craad - 2 freq
cairit - 6 freq
crowdie - 4 freq
cardie - 2 freq
card - 61 freq
coor't - 1 freq
coo'ard - 1 freq
curate - 7 freq
charade - 2 freq
cord - 18 freq
coard - 3 freq
cairied - 12 freq
croud - 8 freq
chairt - 17 freq
croudie - 2 freq
cooriet - 8 freq
cheerit - 1 freq
charitie - 2 freq
chirrie-wuid - 1 freq
chariot - 4 freq
crooed - 1 freq
cure't - 2 freq
crowed - 2 freq
carret - 3 freq
caireyt - 1 freq
cheert - 1 freq
cardee - 1 freq
couriet - 1 freq
creed - 9 freq
chored - 9 freq
couried - 6 freq
crout - 1 freq
cert - 1 freq
cairtie - 12 freq
'cairry-oot' - 2 freq
cairdie - 1 freq
cairred - 2 freq
corrodi - 1 freq
cooardy - 3 freq
cairryit - 2 freq
cart - 7 freq
chart - 4 freq
crate - 6 freq
car'ied - 1 freq
cairret - 2 freq
cairry-oot - 4 freq
carred - 1 freq
cowert - 2 freq
cyaard - 35 freq
cairryt - 1 freq
chert - 2 freq
cooardie - 1 freq
c--road - 1 freq
cardi - 2 freq
cyard - 1 freq
cerried - 3 freq
cartie - 1 freq
carriet - 15 freq
chaired - 6 freq
croat - 1 freq
coured - 1 freq
certy - 3 freq
cooerd - 6 freq
cooard - 1 freq
coorit - 2 freq
certie - 4 freq
crait - 1 freq
corda - 1 freq
curt - 1 freq
crete - 14 freq
cheriot - 2 freq
croatia - 7 freq
curried - 2 freq
courie't - 1 freq
curdoo - 1 freq
curday - 1 freq
couart - 1 freq
'cyaard - 1 freq
cairry'd - 1 freq
coored - 2 freq
'cured' - 1 freq
cairie-oot - 1 freq
crehd - 2 freq
caiort - 1 freq
courried - 1 freq
cairrie-out - 1 freq
couered - 1 freq
€”croatia - 1 freq
chard - 1 freq
cardy - 1 freq
chairity - 3 freq
cardio - 1 freq
carruth - 1 freq
cígarette - 1 freq
chairiot - 2 freq
coordy - 1 freq
cohort - 4 freq
cerry-oot - 1 freq
craa-tae - 1 freq
charred - 1 freq
cerd - 6 freq
chord' - 1 freq
carryout - 1 freq
cairt” - 1 freq
'court - 1 freq
crit - 1 freq
charaid - 1 freq
carat - 1 freq
MetaPhone code - KHRT
cohort - 4 freq
COHORT
Time to execute Levenshtein function - 0.290414 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.407498 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.061539 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037430 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000856 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.