A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to coll in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
coll (0) - 7 freq
toll (1) - 16 freq
cool (1) - 138 freq
boll (1) - 1 freq
woll (1) - 1 freq
oll (1) - 1 freq
voll (1) - 1 freq
coln (1) - 2 freq
roll (1) - 108 freq
coull (1) - 27 freq
colt (1) - 1 freq
cull (1) - 7 freq
colly (1) - 1 freq
holl (1) - 5 freq
cole (1) - 1 freq
poll (1) - 42 freq
colz (1) - 10 freq
cell (1) - 38 freq
coul (1) - 56 freq
colls (1) - 1 freq
cold (1) - 45 freq
cola (1) - 10 freq
coil (1) - 1 freq
call (1) - 220 freq
doll (1) - 40 freq
coll (0) - 7 freq
cull (1) - 7 freq
colly (1) - 1 freq
call (1) - 220 freq
cell (1) - 38 freq
coull (1) - 27 freq
coil (2) - 1 freq
caall (2) - 2 freq
cola (2) - 10 freq
calle (2) - 2 freq
cold (2) - 45 freq
doll (2) - 40 freq
cello (2) - 2 freq
coal (2) - 134 freq
cowl (2) - 8 freq
colls (2) - 1 freq
coli (2) - 1 freq
coolly (2) - 2 freq
col (2) - 11 freq
collie (2) - 27 freq
toll (2) - 16 freq
boll (2) - 1 freq
woll (2) - 1 freq
oll (2) - 1 freq
coul (2) - 56 freq
SoundEx code - C400
cool - 138 freq
cal - 29 freq
chiel - 456 freq
caal - 98 freq
coal - 134 freq
cloy - 1 freq
clue - 87 freq
caul - 79 freq
coyle - 53 freq
cheil - 85 freq
cell - 38 freq
call - 220 freq
coolie - 3 freq
chill - 45 freq
'chill - 2 freq
collie - 27 freq
cola - 10 freq
cuil - 9 freq
chilly - 12 freq
claw - 27 freq
cley - 23 freq
cheel - 11 freq
chile - 15 freq
coll - 7 freq
claa - 14 freq
coul - 56 freq
cowl - 8 freq
cello - 2 freq
clay - 21 freq
cloo - 1 freq
coil - 1 freq
clew - 3 freq
clie - 2 freq
coolly - 2 freq
'cal' - 1 freq
caall - 2 freq
coli - 1 freq
cele - 1 freq
cel - 1 freq
clye - 2 freq
chailie - 1 freq
coul' - 2 freq
coelho - 1 freq
'cool - 1 freq
chilli - 12 freq
cöl - 2 freq
col - 11 freq
coallie - 4 freq
cl - 5 freq
cclew - 7 freq
calle - 2 freq
chello - 1 freq
clowe - 2 freq
chloe - 10 freq
ceol - 1 freq
celia - 2 freq
cheoil - 1 freq
cule - 1 freq
colly - 1 freq
claie - 1 freq
coull - 27 freq
cleyey - 1 freq
cöllie - 1 freq
cweel - 3 freq
cull - 7 freq
chowl - 1 freq
clio - 1 freq
caul' - 5 freq
€˜coyle - 3 freq
caley - 4 freq
€œcall - 2 freq
€˜cool - 2 freq
€˜cooool - 1 freq
coalie - 1 freq
cowley - 1 freq
cooly - 1 freq
caaaal - 1 freq
cole - 1 freq
cala - 1 freq
caulÂ’ - 1 freq
‘cool’ - 1 freq
chielie - 1 freq
cali - 3 freq
cgl - 2 freq
cxlw - 1 freq
cla - 1 freq
cklly - 1 freq
clo - 1 freq
coley - 1 freq
calloo - 2 freq
cyuliea - 1 freq
cxihl - 1 freq
'call' - 1 freq
callie - 1 freq
MetaPhone code - KL
cool - 138 freq
cal - 29 freq
glow - 51 freq
gale - 64 freq
glue - 20 freq
caal - 98 freq
coal - 134 freq
cloy - 1 freq
clue - 87 freq
gowl - 7 freq
kuil - 1 freq
caul - 79 freq
coyle - 53 freq
gala - 15 freq
kill - 216 freq
keelie - 7 freq
quelle - 1 freq
call - 220 freq
glee - 30 freq
keel - 12 freq
coolie - 3 freq
quile - 1 freq
kail - 42 freq
quill - 15 freq
collie - 27 freq
cola - 10 freq
gullie - 9 freq
cuil - 9 freq
quell - 5 freq
gaol - 4 freq
claw - 27 freq
goal - 90 freq
cley - 23 freq
gull - 7 freq
kool - 1 freq
gluh - 1 freq
coll - 7 freq
killie - 26 freq
kale - 26 freq
claa - 14 freq
kyle - 80 freq
gully - 16 freq
gall - 4 freq
coul - 56 freq
cowl - 8 freq
goul - 3 freq
clay - 21 freq
kille - 1 freq
gowlie - 2 freq
keil - 3 freq
cloo - 1 freq
coil - 1 freq
clew - 3 freq
gul - 4 freq
qual - 3 freq
yclla - 1 freq
clie - 2 freq
quail - 5 freq
gal - 4 freq
coolly - 2 freq
'kill - 1 freq
guile - 4 freq
'cal' - 1 freq
caall - 2 freq
coli - 1 freq
gail - 3 freq
kelly - 144 freq
keily - 1 freq
goalie - 11 freq
kilo - 1 freq
kïll - 13 freq
coul' - 2 freq
gol' - 1 freq
galley - 4 freq
'cool - 1 freq
goal' - 1 freq
'kail - 2 freq
killy - 9 freq
cöl - 2 freq
gly - 1 freq
col - 11 freq
gael - 2 freq
gl - 2 freq
'kylie' - 1 freq
coallie - 4 freq
cl - 5 freq
calle - 2 freq
kael - 5 freq
kill- - 3 freq
quyhle - 1 freq
cule - 1 freq
kell - 1 freq
kiyl - 2 freq
gool - 1 freq
glaw - 2 freq
golly - 4 freq
kloo - 1 freq
colly - 1 freq
gloy - 1 freq
claie - 1 freq
gallae - 1 freq
galla - 2 freq
coull - 27 freq
kühl - 1 freq
gla - 1 freq
cöllie - 1 freq
galle - 1 freq
go-o-o-o-o-al - 1 freq
go-o-o-o-o-o-o-o-al - 1 freq
kül - 1 freq
kellie - 1 freq
keele - 1 freq
cull - 7 freq
kuala - 2 freq
kele - 1 freq
glie - 1 freq
gle - 4 freq
€œqually - 1 freq
glee- - 1 freq
goolie - 1 freq
clio - 1 freq
caul' - 5 freq
€˜coyle - 3 freq
caley - 4 freq
€œcall - 2 freq
€˜cool - 2 freq
€˜cooool - 1 freq
coalie - 1 freq
koala - 2 freq
cowley - 1 freq
kull - 3 freq
kali - 1 freq
kula - 1 freq
cooly - 1 freq
caaaal - 1 freq
glu - 1 freq
qlw - 1 freq
cole - 1 freq
cala - 1 freq
ycl - 1 freq
‘glow’ - 1 freq
caulÂ’ - 1 freq
kl - 4 freq
‘goal’ - 1 freq
‘cool’ - 1 freq
gaal - 1 freq
kyl - 1 freq
cali - 3 freq
cla - 1 freq
qually - 1 freq
cklly - 1 freq
clo - 1 freq
coley - 1 freq
khloe - 1 freq
kylie - 1 freq
calloo - 2 freq
ykal - 1 freq
'call' - 1 freq
callie - 1 freq
kwlu - 1 freq
glé - 1 freq
kleo - 1 freq
COLL
Time to execute Levenshtein function - 0.514411 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.182773 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029248 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.166634 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000927 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.