A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cannot in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cannot (0) - 19 freq
canno (1) - 26 freq
canno- (1) - 1 freq
lannot (1) - 4 freq
cannon (1) - 13 freq
cannit (1) - 1 freq
cannle (2) - 7 freq
cain't (2) - 1 freq
canton (2) - 2 freq
canÂ’t (2) - 11 freq
hannet (2) - 2 freq
connor (2) - 5 freq
cannae (2) - 1702 freq
canns (2) - 1 freq
anno (2) - 5 freq
canst (2) - 2 freq
cantor (2) - 1 freq
canio (2) - 1 freq
cannen (2) - 3 freq
wannt (2) - 1 freq
cunno (2) - 1 freq
canna (2) - 1230 freq
mannt (2) - 1 freq
canned (2) - 2 freq
canner (2) - 1 freq
cannot (0) - 19 freq
cannit (1) - 1 freq
cannon (2) - 13 freq
cainnt (2) - 1 freq
lannot (2) - 4 freq
canno (2) - 26 freq
canno- (2) - 1 freq
cant (3) - 40 freq
canyon (3) - 1 freq
cannin (3) - 3 freq
canne (3) - 7 freq
cann (3) - 1 freq
bannet (3) - 1 freq
can't (3) - 70 freq
cannel (3) - 18 freq
lannit (3) - 3 freq
bannit (3) - 2 freq
cantit (3) - 1 freq
mannit (3) - 2 freq
cannie (3) - 93 freq
canto (3) - 27 freq
caint (3) - 3 freq
canni (3) - 6 freq
canny (3) - 214 freq
cannes (3) - 1 freq
SoundEx code - C530
can't - 70 freq
comet - 12 freq
canty - 28 freq
coonty - 11 freq
cantie - 77 freq
comatie - 2 freq
count - 45 freq
cunt - 447 freq
comedy - 33 freq
committee - 25 freq
chant - 26 freq
canada - 39 freq
coont - 348 freq
cent - 20 freq
cannot - 19 freq
cant - 40 freq
comed - 12 freq
caumed - 3 freq
chenyit - 1 freq
commute - 4 freq
candy - 26 freq
commit - 17 freq
cometh - 6 freq
chimed - 8 freq
chaunt - 5 freq
cyanide - 1 freq
caamed - 1 freq
caam't - 1 freq
'cunt' - 1 freq
cindy - 6 freq
cind - 2 freq
comte - 1 freq
cain't - 1 freq
county - 38 freq
coontie - 8 freq
commuity - 1 freq
'cometh - 1 freq
caaaandy - 1 freq
canned - 2 freq
comatee - 104 freq
conned - 1 freq
commït - 2 freq
cundie - 10 freq
chuntie - 4 freq
chanty - 10 freq
chain'd - 1 freq
commita - 1 freq
cned - 1 freq
canto - 27 freq
chante - 1 freq
'coont - 1 freq
chummed - 1 freq
commïttee - 1 freq
canadaw - 1 freq
cheined - 1 freq
'count - 1 freq
chained - 3 freq
cumd - 1 freq
cummed - 1 freq
canute - 2 freq
cnd - 8 freq
chinned - 1 freq
chianti - 4 freq
chyint - 1 freq
chainéd - 1 freq
cowned - 9 freq
cainnt - 1 freq
chantie' - 1 freq
comedie - 3 freq
canadae - 6 freq
coohaund - 1 freq
chunty - 6 freq
comethe - 1 freq
€œcantie - 1 freq
comeat - 1 freq
chantie - 1 freq
cunned - 1 freq
comete - 1 freq
€˜cyanide - 1 freq
coined - 3 freq
coomed - 1 freq
cunto - 1 freq
cammatee - 1 freq
€˜can-do - 1 freq
canÂ’t - 11 freq
cnut - 1 freq
cnuty - 1 freq
coooooooooooont - 1 freq
cannit - 1 freq
countÂ’ - 1 freq
caint - 3 freq
chuntey - 1 freq
cmtyea - 1 freq
cont - 2 freq
candy' - 1 freq
cahunt - 1 freq
chunt - 4 freq
MetaPhone code - KNT
kind - 691 freq
kent - 1832 freq
can't - 70 freq
canty - 28 freq
coonty - 11 freq
kindae - 23 freq
quaint - 5 freq
queen'd - 1 freq
cantie - 77 freq
kynd - 24 freq
kennt - 151 freq
count - 45 freq
cunt - 447 freq
gandy - 1 freq
kinda - 175 freq
canada - 39 freq
gant - 15 freq
coont - 348 freq
kenned - 33 freq
cannot - 19 freq
queen-it - 1 freq
cant - 40 freq
kint - 58 freq
gained - 15 freq
kant - 4 freq
candy - 26 freq
gannet - 10 freq
gantae - 6 freq
gaantae - 2 freq
kin't - 2 freq
'kind' - 1 freq
ken't - 19 freq
g'noot - 1 freq
'cunt' - 1 freq
gaunt - 8 freq
cain't - 1 freq
county - 38 freq
gontae - 7 freq
goantae - 1 freq
gunned - 1 freq
coontie - 8 freq
wykened - 1 freq
kinnd - 2 freq
caaaandy - 1 freq
canned - 2 freq
kindo - 32 freq
conned - 1 freq
cundie - 10 freq
gundy - 4 freq
quinty - 1 freq
gauntae - 1 freq
'kent - 1 freq
cned - 1 freq
canto - 27 freq
quanta - 1 freq
gaaned - 1 freq
'coont - 1 freq
canadaw - 1 freq
keind - 1 freq
k'nit - 2 freq
'count - 1 freq
kynda - 3 freq
canute - 2 freq
cnd - 8 freq
cowned - 9 freq
cainnt - 1 freq
kennedy - 23 freq
quando - 1 freq
gainit - 2 freq
kennet - 1 freq
canadae - 6 freq
kanada - 1 freq
kennedie - 1 freq
€œcantie - 1 freq
kandy - 1 freq
cunned - 1 freq
€˜quanto - 1 freq
kaint - 4 freq
gointy - 2 freq
gonty - 1 freq
kindey - 1 freq
kindy - 1 freq
kenn'd - 1 freq
'kind - 1 freq
gaen-oot - 1 freq
coined - 3 freq
gaint - 1 freq
cunto - 1 freq
€˜can-do - 1 freq
keynote - 1 freq
€œkinda - 1 freq
gond - 1 freq
kennit - 1 freq
kenÂ’t - 1 freq
wknd - 2 freq
canÂ’t - 11 freq
cnut - 1 freq
cnuty - 1 freq
coooooooooooont - 1 freq
cannit - 1 freq
‘kind - 1 freq
countÂ’ - 1 freq
caint - 3 freq
wkend - 1 freq
kendo - 2 freq
qnd - 1 freq
cont - 2 freq
quint - 1 freq
candy' - 1 freq
'kent' - 1 freq
CANNOT
Time to execute Levenshtein function - 0.213686 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380434 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.038062 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039127 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001135 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.