A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to cans in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
cans (0) - 45 freq
can's (1) - 1 freq
canes (1) - 2 freq
ca's (1) - 4 freq
mans (1) - 21 freq
aans (1) - 1 freq
pans (1) - 23 freq
clans (1) - 26 freq
dans (1) - 3 freq
gans (1) - 51 freq
chans (1) - 1 freq
eans (1) - 10 freq
can (1) - 4706 freq
cane (1) - 22 freq
cany (1) - 5 freq
lans (1) - 15 freq
cams (1) - 54 freq
cant (1) - 40 freq
scans (1) - 4 freq
wans (1) - 167 freq
chns (1) - 1 freq
can- (1) - 4 freq
ans (1) - 2 freq
cani (1) - 2 freq
fans (1) - 141 freq
cans (0) - 45 freq
cuns (1) - 1 freq
cains (1) - 1 freq
canus (1) - 1 freq
cons (1) - 4 freq
canes (1) - 2 freq
caws (2) - 36 freq
tans (2) - 4 freq
cana (2) - 29 freq
cann (2) - 1 freq
canns (2) - 1 freq
vans (2) - 22 freq
rans (2) - 1 freq
cars (2) - 91 freq
cais (2) - 1 freq
sans (2) - 7 freq
crans (2) - 1 freq
cats (2) - 122 freq
consi (2) - 8 freq
coins (2) - 50 freq
oceans (2) - 16 freq
cones (2) - 2 freq
caus (2) - 8 freq
bans (2) - 35 freq
caas (2) - 44 freq
SoundEx code - C520
comes - 926 freq
chance - 530 freq
coins - 50 freq
chinese - 270 freq
chang - 5 freq
chaunce - 49 freq
chink - 13 freq
cheyns - 1 freq
cong' - 1 freq
chynge - 223 freq
cums - 111 freq
chinge - 23 freq
chimneys - 2 freq
chains - 38 freq
chunk - 17 freq
chancy - 6 freq
change - 299 freq
chunks - 8 freq
chimes - 7 freq
chunce - 82 freq
chinwag - 3 freq
ching - 14 freq
cans - 45 freq
chunky - 7 freq
chins - 5 freq
cumnock - 35 freq
cheynge - 15 freq
cams - 54 freq
coamic - 5 freq
cheenge - 42 freq
comic - 31 freq
chynes - 2 freq
cheinge - 12 freq
conn's - 4 freq
conshie - 2 freq
chainge - 50 freq
caaing - 1 freq
caw-ins - 1 freq
cons - 4 freq
chines - 7 freq
connach - 9 freq
cones - 2 freq
cammocks - 1 freq
cum's - 5 freq
chunse - 15 freq
cheyneese - 1 freq
cheynees - 1 freq
conk's - 1 freq
connie's - 2 freq
cinch - 4 freq
cence - 3 freq
can's - 1 freq
cynics - 3 freq
connecks - 3 freq
chns - 1 freq
canes - 2 freq
coing - 1 freq
cheynes - 1 freq
chinks - 1 freq
chan's' - 1 freq
chung's' - 1 freq
chung's - 3 freq
cumes - 1 freq
cannes - 1 freq
chinky - 1 freq
come's - 1 freq
comics - 19 freq
coinneach - 1 freq
cynic - 3 freq
chenge - 11 freq
'chynge' - 1 freq
'comics - 1 freq
change' - 1 freq
'change' - 1 freq
cunyies - 8 freq
cannas - 2 freq
cheins - 2 freq
chaenge - 6 freq
conk - 2 freq
chums - 13 freq
chöns - 1 freq
chymic - 1 freq
cuns - 1 freq
cheans - 2 freq
chemise - 2 freq
chaynge - 1 freq
coinage - 8 freq
conneck - 3 freq
conneks - 2 freq
china's - 1 freq
cheens - 4 freq
cowan's - 1 freq
chuang - 2 freq
cain's - 1 freq
conchie - 1 freq
con's - 3 freq
cims - 1 freq
chinese' - 1 freq
cumms - 2 freq
chancie - 2 freq
congo - 1 freq
conns - 1 freq
cammy's - 1 freq
chymmnis - 1 freq
canns - 1 freq
cheenese - 1 freq
coneys - 1 freq
€˜comic - 2 freq
conc - 1 freq
canus - 1 freq
'chums' - 1 freq
chung-chou - 1 freq
chainsaw - 1 freq
cinq - 1 freq
cawing - 3 freq
chomsky - 1 freq
congee - 1 freq
cheenj - 1 freq
€œchinese - 1 freq
commas - 3 freq
chans - 1 freq
changey - 1 freq
chummys - 1 freq
cmsy - 1 freq
chuncey - 1 freq
cnag - 1 freq
consi - 8 freq
chinos - 1 freq
cmq - 1 freq
cains - 1 freq
cmc - 1 freq
cxonwk - 1 freq
cnoc - 1 freq
comms - 1 freq
chunnaic - 1 freq
cwuhmuso - 1 freq
chamoix - 1 freq
cmack - 1 freq
choons - 1 freq
cumnocks - 1 freq
chinook - 1 freq
camz - 1 freq
MetaPhone code - KNS
kens - 531 freq
coins - 50 freq
queen's - 52 freq
queens - 37 freq
guns - 71 freq
quine's - 18 freq
quines - 155 freq
gans - 51 freq
gansey - 17 freq
quines' - 2 freq
queyn's - 2 freq
kenzie - 49 freq
'kenzie - 2 freq
cans - 45 freq
gowns - 2 freq
guinness - 9 freq
conn's - 4 freq
kynes - 11 freq
ken's - 6 freq
gunns - 2 freq
caw-ins - 1 freq
cons - 4 freq
cones - 2 freq
kins - 15 freq
connie's - 2 freq
quinie's - 12 freq
queans - 5 freq
guineas - 4 freq
queanie's - 1 freq
kines - 21 freq
quinies - 7 freq
gun's - 1 freq
guns' - 1 freq
goons - 9 freq
gauns - 30 freq
can's - 1 freq
gainsay - 2 freq
ganues - 1 freq
gayness - 1 freq
canes - 2 freq
cannes - 1 freq
'guinness - 1 freq
cannas - 2 freq
gunnie's - 8 freq
goonies - 3 freq
gonzo - 1 freq
ganzie - 14 freq
kenny's - 1 freq
cuns - 1 freq
gansie - 3 freq
gouns - 12 freq
quince - 2 freq
kyns - 2 freq
kynness - 1 freq
cain's - 1 freq
gains - 8 freq
con's - 3 freq
gaens - 1 freq
kaens - 2 freq
queenis - 1 freq
conns - 1 freq
gunes - 2 freq
canns - 1 freq
coneys - 1 freq
kains - 1 freq
gaeins - 1 freq
canus - 1 freq
goins - 2 freq
keenness - 1 freq
quynes - 1 freq
€˜guinness - 1 freq
quineÂ’s - 1 freq
guans - 1 freq
quins - 1 freq
consi - 8 freq
cains - 1 freq
guiness - 2 freq
kanes - 2 freq
queenies - 1 freq
kenÂ’s - 1 freq
goineasy - 1 freq
CANS
Time to execute Levenshtein function - 0.170596 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.316568 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027368 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036605 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000831 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.