A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to canes in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
canes (0) - 2 freq
cakes (1) - 54 freq
canns (1) - 1 freq
canus (1) - 1 freq
cants (1) - 1 freq
banes (1) - 205 freq
lanes (1) - 11 freq
caves (1) - 17 freq
cafes (1) - 6 freq
kanes (1) - 2 freq
cans (1) - 55 freq
wanes (1) - 2 freq
cones (1) - 3 freq
panes (1) - 9 freq
cane (1) - 22 freq
cares (1) - 61 freq
cases (1) - 56 freq
can's (1) - 1 freq
cranes (1) - 7 freq
cages (1) - 9 freq
scanes (1) - 1 freq
vanes (1) - 2 freq
danes (1) - 2 freq
manes (1) - 11 freq
cannes (1) - 1 freq
canes (0) - 2 freq
canus (1) - 1 freq
cans (1) - 55 freq
cones (1) - 3 freq
vanes (2) - 2 freq
scanes (2) - 1 freq
cranes (2) - 7 freq
cages (2) - 9 freq
manes (2) - 11 freq
anes (2) - 222 freq
cons (2) - 4 freq
cuns (2) - 1 freq
cains (2) - 1 freq
coneys (2) - 1 freq
can's (2) - 1 freq
cannes (2) - 1 freq
danes (2) - 2 freq
cakes (2) - 54 freq
lanes (2) - 11 freq
canns (2) - 1 freq
cases (2) - 56 freq
banes (2) - 205 freq
cants (2) - 1 freq
cafes (2) - 6 freq
caves (2) - 17 freq
SoundEx code - C520
comes - 962 freq
chance - 548 freq
coins - 52 freq
chinese - 272 freq
chang - 5 freq
chaunce - 49 freq
chink - 13 freq
cheyns - 1 freq
cong' - 1 freq
chynge - 228 freq
cums - 111 freq
chinge - 23 freq
chimneys - 2 freq
chains - 38 freq
chunk - 17 freq
chancy - 6 freq
change - 308 freq
chunks - 9 freq
chimes - 8 freq
chunce - 82 freq
chinwag - 3 freq
ching - 14 freq
cans - 55 freq
chunky - 8 freq
chins - 5 freq
cumnock - 35 freq
cheynge - 15 freq
cams - 54 freq
coamic - 5 freq
cheenge - 42 freq
comic - 31 freq
chynes - 2 freq
cheinge - 12 freq
conn's - 4 freq
conshie - 2 freq
chainge - 50 freq
caaing - 1 freq
caw-ins - 1 freq
cons - 4 freq
chines - 7 freq
connach - 9 freq
cones - 3 freq
cammocks - 1 freq
cum's - 5 freq
chunse - 15 freq
cheyneese - 1 freq
cheynees - 1 freq
conk's - 1 freq
connie's - 2 freq
chiyns - 1 freq
comics - 20 freq
cinch - 4 freq
cence - 3 freq
can's - 1 freq
cynics - 3 freq
connecks - 3 freq
chns - 1 freq
canes - 2 freq
coing - 1 freq
cheynes - 1 freq
chinks - 1 freq
chan's' - 1 freq
chung's' - 1 freq
chung's - 3 freq
cumes - 1 freq
cannes - 1 freq
chinky - 1 freq
come's - 1 freq
coinneach - 1 freq
cynic - 3 freq
chenge - 11 freq
'chynge' - 1 freq
'comics - 1 freq
change' - 1 freq
'change' - 1 freq
cunyies - 8 freq
cannas - 2 freq
cheins - 2 freq
chaenge - 6 freq
conk - 2 freq
chums - 13 freq
chöns - 1 freq
chymic - 1 freq
cuns - 1 freq
cheans - 2 freq
chemise - 2 freq
chaynge - 1 freq
coinage - 8 freq
conneck - 3 freq
conneks - 2 freq
china's - 1 freq
cheens - 4 freq
cowan's - 1 freq
chuang - 2 freq
cain's - 1 freq
conchie - 1 freq
con's - 3 freq
cims - 1 freq
chinese' - 1 freq
cumms - 2 freq
chancie - 2 freq
congo - 1 freq
conns - 1 freq
cammy's - 1 freq
chymmnis - 1 freq
canns - 1 freq
cheenese - 1 freq
coneys - 1 freq
€˜comic - 2 freq
conc - 1 freq
canus - 1 freq
'chums' - 1 freq
chung-chou - 1 freq
chainsaw - 1 freq
cinq - 1 freq
cawing - 3 freq
chomsky - 1 freq
congee - 1 freq
cheenj - 1 freq
€œchinese - 1 freq
commas - 3 freq
chans - 1 freq
changey - 1 freq
chummys - 1 freq
cmsy - 1 freq
chuncey - 1 freq
cnag - 1 freq
consi - 8 freq
chinos - 1 freq
cmq - 1 freq
cains - 1 freq
cmc - 1 freq
cxonwk - 1 freq
cnoc - 1 freq
comms - 1 freq
chunnaic - 1 freq
cwuhmuso - 1 freq
chamoix - 1 freq
cmack - 1 freq
choons - 1 freq
cumnocks - 1 freq
chinook - 1 freq
camz - 1 freq
MetaPhone code - KNS
kens - 532 freq
coins - 52 freq
queen's - 55 freq
queens - 40 freq
guns - 73 freq
quine's - 18 freq
quines - 156 freq
gans - 51 freq
gansey - 17 freq
quines' - 2 freq
queyn's - 2 freq
kenzie - 49 freq
'kenzie - 2 freq
cans - 55 freq
gowns - 2 freq
guinness - 16 freq
conn's - 4 freq
kynes - 11 freq
ken's - 6 freq
gunns - 2 freq
caw-ins - 1 freq
cons - 4 freq
cones - 3 freq
kins - 15 freq
connie's - 2 freq
quinie's - 12 freq
queans - 5 freq
guineas - 4 freq
queanie's - 1 freq
kines - 21 freq
quinies - 7 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gauns - 30 freq
can's - 1 freq
gainsay - 2 freq
ganues - 1 freq
gayness - 1 freq
canes - 2 freq
cannes - 1 freq
'guinness - 1 freq
cannas - 2 freq
gunnie's - 8 freq
goonies - 3 freq
gonzo - 1 freq
ganzie - 14 freq
kenny's - 1 freq
cuns - 1 freq
gansie - 3 freq
gouns - 12 freq
quince - 2 freq
kyns - 2 freq
kynness - 1 freq
cain's - 1 freq
gains - 8 freq
con's - 3 freq
gaens - 1 freq
kaens - 2 freq
queenis - 1 freq
conns - 1 freq
gunes - 2 freq
canns - 1 freq
coneys - 1 freq
kains - 1 freq
gaeins - 1 freq
canus - 1 freq
goins - 2 freq
keenness - 1 freq
quynes - 1 freq
€˜guinness - 1 freq
quineÂ’s - 1 freq
guans - 1 freq
quins - 1 freq
consi - 8 freq
cains - 1 freq
guiness - 2 freq
kanes - 2 freq
queenies - 1 freq
kenÂ’s - 1 freq
goineasy - 1 freq
CANES
Time to execute Levenshtein function - 0.198528 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.380130 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029083 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037834 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.