A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to combs in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
combs (0) - 7 freq
combe (1) - 1 freq
bombs (1) - 26 freq
comms (1) - 1 freq
combo (1) - 5 freq
tombs (1) - 9 freq
comes (1) - 962 freq
comb (1) - 19 freq
cobs (1) - 1 freq
cous (2) - 1 freq
coafs (2) - 1 freq
coils (2) - 11 freq
conns (2) - 1 freq
comics (2) - 20 freq
bomb (2) - 31 freq
tom's (2) - 5 freq
sobs (2) - 13 freq
coma (2) - 13 freq
domms (2) - 2 freq
compo (2) - 2 freq
codds (2) - 2 freq
homes (2) - 10 freq
corps (2) - 13 freq
costs (2) - 46 freq
camby (2) - 1 freq
combs (0) - 7 freq
comes (2) - 962 freq
comb (2) - 19 freq
tombs (2) - 9 freq
cambus (2) - 3 freq
cobs (2) - 1 freq
combo (2) - 5 freq
combe (2) - 1 freq
bombs (2) - 26 freq
comms (2) - 1 freq
chibs (3) - 5 freq
comins (3) - 4 freq
cum's (3) - 5 freq
combat (3) - 5 freq
crubs (3) - 2 freq
clubs (3) - 104 freq
comets (3) - 12 freq
choobs (3) - 2 freq
combin (3) - 1 freq
jambs (3) - 1 freq
crumbs (3) - 44 freq
cumms (3) - 2 freq
come's (3) - 1 freq
mbs (3) - 2 freq
lambs (3) - 57 freq
SoundEx code - C512
canvas - 35 freq
canvases - 2 freq
confesst - 4 freq
confused - 49 freq
confusion - 41 freq
confusin - 13 freq
convick - 5 freq
convicks' - 1 freq
compass - 16 freq
'champagne - 1 freq
chimps - 2 freq
confess - 21 freq
confession - 10 freq
convoys - 8 freq
comeback - 8 freq
confuse - 8 freq
conviction - 11 freq
compassion - 6 freq
campaign - 103 freq
campsie - 2 freq
compescet - 2 freq
canvassers - 1 freq
canvassed - 2 freq
canvased - 1 freq
champagne - 15 freq
confessed - 8 freq
convicted - 3 freq
composin - 3 freq
confuision - 2 freq
composed - 15 freq
cunfuchlt - 1 freq
cunfusen - 1 freq
cunfeyoosed - 1 freq
cumpoased - 1 freq
cunfuchult - 1 freq
combs - 7 freq
campsites - 1 freq
campsies - 1 freq
confessin - 3 freq
come-affs - 1 freq
come-ups - 1 freq
caunopies - 1 freq
campaignin - 19 freq
compose - 6 freq
combust - 4 freq
composure - 4 freq
compassionless - 1 freq
compasssionate - 1 freq
compact - 7 freq
confuses - 3 freq
compos-mentis - 1 freq
confection - 1 freq
camps - 15 freq
campaigns - 15 freq
canvis - 1 freq
composer - 3 freq
campsite - 1 freq
cannabis - 1 freq
canapés - 1 freq
camp-site - 1 freq
confiscated - 2 freq
convictions - 1 freq
champs - 2 freq
confesses - 1 freq
campus - 5 freq
composit - 2 freq
composietion - 1 freq
cambus - 3 freq
co-investin - 1 freq
confaise - 2 freq
compasses - 2 freq
confesso - 1 freq
campos - 1 freq
confusan - 3 freq
champaign - 1 freq
composeition - 2 freq
compaignion - 1 freq
confuise - 1 freq
coanvicts - 1 freq
coanvict - 3 freq
convickit - 1 freq
coanvics - 1 freq
cinfession - 1 freq
composeetions - 1 freq
cambuslang - 2 freq
campie's - 1 freq
compostela - 1 freq
compost - 7 freq
campaiging - 1 freq
campaigning - 4 freq
confaised - 1 freq
€œcnapag - 1 freq
confessouris - 1 freq
confuised - 1 freq
confuisin - 1 freq
campaigner - 3 freq
compassionate - 1 freq
comfiest - 1 freq
canvassin - 3 freq
convictit - 2 freq
compositional - 1 freq
composition - 3 freq
configuration - 1 freq
compostit - 1 freq
canopies - 2 freq
campaignt - 1 freq
confiscating - 1 freq
canapes - 1 freq
compis - 1 freq
composing - 1 freq
confucius - 1 freq
€˜convicts - 1 freq
convicts - 1 freq
canvas-covert - 1 freq
campaigned - 1 freq
campaigners - 7 freq
confiscate - 1 freq
canvass - 1 freq
chin-ups - 1 freq
confessions - 2 freq
chimpscum - 1 freq
campusprjo - 1 freq
composureÂ… - 1 freq
comfies - 2 freq
compston - 2 freq
canvassing - 1 freq
ccampbauslangs - 1 freq
campesina - 1 freq
cammvgwg - 1 freq
confusing - 1 freq
champagnes - 1 freq
cumback - 1 freq
convos - 1 freq
cambuslangsteve - 1 freq
MetaPhone code - KMS
comes - 962 freq
gamie's - 2 freq
cums - 111 freq
gums - 16 freq
goams - 2 freq
games - 248 freq
cams - 54 freq
kames - 7 freq
gumsy - 5 freq
game's - 8 freq
gams - 1 freq
kaims - 2 freq
gummies - 2 freq
cum's - 5 freq
combs - 7 freq
kemis - 1 freq
gooms - 4 freq
games' - 2 freq
'games - 2 freq
gamies - 11 freq
'gamies' - 1 freq
cumes - 1 freq
come's - 1 freq
kums - 5 freq
kimsey - 1 freq
cambus - 3 freq
cumms - 2 freq
cammy's - 1 freq
gmse - 1 freq
kemes - 1 freq
commas - 3 freq
cmsy - 1 freq
kms - 1 freq
kmze - 1 freq
qyms - 1 freq
comms - 1 freq
gomez - 1 freq
camz - 1 freq
COMBS
Time to execute Levenshtein function - 0.213114 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.397232 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028458 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038631 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.