A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to college in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
college (0) - 114 freq
coillege (1) - 1 freq
colledge (1) - 1 freq
colleges (1) - 4 freq
collage (1) - 1 freq
colleg (1) - 2 freq
collogue (2) - 62 freq
coalese (2) - 1 freq
fecollege (2) - 1 freq
allege (2) - 1 freq
colleague (2) - 15 freq
college's (2) - 2 freq
collect (2) - 29 freq
cortege (2) - 1 freq
colline (2) - 1 freq
collate (2) - 1 freq
colleck (2) - 11 freq
collie (2) - 27 freq
colled (2) - 2 freq
collages (2) - 1 freq
tollie (3) - 3 freq
cole (3) - 1 freq
coolie (3) - 3 freq
liege (3) - 4 freq
collared (3) - 1 freq
college (0) - 114 freq
collage (1) - 1 freq
colleg (1) - 2 freq
coillege (1) - 1 freq
colleague (2) - 15 freq
colledge (2) - 1 freq
colleges (2) - 4 freq
collogue (2) - 62 freq
collate (3) - 1 freq
colline (3) - 1 freq
colled (3) - 2 freq
collages (3) - 1 freq
collie (3) - 27 freq
fecollege (3) - 1 freq
allege (3) - 1 freq
collar (4) - 72 freq
cleg (4) - 12 freq
collum (4) - 1 freq
colliery (4) - 3 freq
collogued (4) - 4 freq
collier (4) - 2 freq
colly (4) - 1 freq
village (4) - 163 freq
ceolbeg (4) - 1 freq
collab (4) - 1 freq
SoundEx code - C420
cleg - 12 freq
claes - 582 freq
clock - 197 freq
close - 499 freq
claws - 31 freq
cleek - 31 freq
cless - 131 freq
claggy - 12 freq
collogue - 62 freq
cloak - 30 freq
coils - 11 freq
clase - 8 freq
cleuks - 18 freq
cleck - 10 freq
cleuch - 18 freq
cailleach - 9 freq
cellic - 11 freq
click - 132 freq
class - 452 freq
claik - 24 freq
cleuk - 3 freq
clocks - 18 freq
clock's - 4 freq
clack - 9 freq
college - 114 freq
chiels - 112 freq
clyes - 6 freq
calls - 35 freq
chalk - 25 freq
coals - 18 freq
chiel's - 17 freq
clash - 41 freq
claiss - 3 freq
cleys - 4 freq
cheils - 33 freq
cloack - 20 freq
clyse - 10 freq
clues - 7 freq
claggie - 4 freq
clak - 2 freq
cleik - 9 freq
cools - 6 freq
cells - 17 freq
claise - 11 freq
cluke - 1 freq
closs - 61 freq
clooks - 4 freq
cheels - 2 freq
colleague - 15 freq
clag - 4 freq
clos - 14 freq
colic - 4 freq
claick - 1 freq
cloase - 2 freq
clossie - 1 freq
claosie - 1 freq
chile's - 1 freq
cliché - 14 freq
chills - 7 freq
cliche - 5 freq
clique - 3 freq
coalesce - 5 freq
'coalesce - 2 freq
clogs - 10 freq
clok - 2 freq
claus - 17 freq
class' - 3 freq
chalky - 2 freq
cloacks - 5 freq
clais - 2 freq
clese - 1 freq
claas - 29 freq
cleg's - 1 freq
caalls - 1 freq
colleck - 11 freq
clause - 13 freq
coulg - 1 freq
classy - 3 freq
'coleeks' - 1 freq
cloaks - 8 freq
class's - 2 freq
colleg - 2 freq
clasg - 1 freq
clicky - 41 freq
cleiks - 4 freq
clicks - 8 freq
cleeks - 12 freq
cloke - 3 freq
clegs - 2 freq
callous - 2 freq
cauls - 1 freq
culls - 2 freq
cluck - 2 freq
cello's - 1 freq
cleggs - 3 freq
closey - 1 freq
claase - 1 freq
cllsca - 2 freq
calais - 1 freq
cheil's - 4 freq
collage - 1 freq
chloe's - 1 freq
claggey - 1 freq
cell's - 1 freq
clews - 1 freq
clays - 2 freq
caals - 2 freq
clog - 2 freq
coalese - 1 freq
closse - 1 freq
closs' - 1 freq
'close' - 1 freq
claag - 1 freq
clusks - 1 freq
closie - 5 freq
'cheils' - 1 freq
coles - 4 freq
colls - 1 freq
calico - 1 freq
chiels' - 1 freq
clug - 1 freq
collecks - 1 freq
culloes - 1 freq
coalhoose - 3 freq
clecks - 1 freq
callus - 1 freq
clyack - 6 freq
coillege - 1 freq
collie's - 1 freq
clouse - 1 freq
clek - 1 freq
chelsea - 1 freq
€™clock - 12 freq
colas - 1 freq
clish - 1 freq
cliquey - 1 freq
clook - 3 freq
clize - 2 freq
cheales - 1 freq
clegg - 2 freq
€™cloak - 3 freq
€™cloack - 5 freq
clooky - 1 freq
click's - 1 freq
clesh - 1 freq
'click' - 1 freq
claiggs - 1 freq
cuils - 1 freq
€™cksheils - 1 freq
challice - 1 freq
cloos - 1 freq
€œclaes - 1 freq
collies - 1 freq
clc - 1 freq
cligeey - 90 freq
clouseau - 1 freq
colz - 10 freq
clash' - 1 freq
cullough - 1 freq
chloegee - 1 freq
celic - 1 freq
cellik - 1 freq
cliq - 1 freq
cwilk - 2 freq
claes' - 1 freq
clx - 1 freq
clue's - 1 freq
clokkie - 1 freq
clouise - 10 freq
cklj - 1 freq
cwilso - 1 freq
MetaPhone code - KLJ
college - 114 freq
cludgie - 34 freq
cludge - 1 freq
gaeilge - 2 freq
collage - 1 freq
gaelige - 2 freq
colledge - 1 freq
gledge - 1 freq
coillege - 1 freq
cludgy - 1 freq
cligeey - 90 freq
cludgey - 1 freq
cklj - 1 freq
COLLEGE
Time to execute Levenshtein function - 0.195624 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.386365 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030203 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037513 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000912 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.