A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to combo in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
combo (0) - 5 freq
combs (1) - 6 freq
compo (1) - 2 freq
combe (1) - 1 freq
como (1) - 1 freq
comb (1) - 18 freq
comet (2) - 12 freq
crambo (2) - 1 freq
comm (2) - 2 freq
coma (2) - 13 freq
clomb (2) - 1 freq
citbo (2) - 1 freq
lambo (2) - 1 freq
combed (2) - 3 freq
gambo (2) - 1 freq
bombs (2) - 25 freq
bombe (2) - 1 freq
csmb (2) - 1 freq
comms (2) - 1 freq
comyn (2) - 3 freq
compt (2) - 1 freq
comon (2) - 1 freq
bimbo (2) - 1 freq
cowboy (2) - 16 freq
camby (2) - 1 freq
combo (0) - 5 freq
combe (1) - 1 freq
comb (1) - 18 freq
como (2) - 1 freq
camby (2) - 1 freq
compo (2) - 2 freq
combs (2) - 6 freq
come (3) - 3111 freq
caimb (3) - 9 freq
comic (3) - 31 freq
cameo (3) - 4 freq
bomb (3) - 30 freq
zomb (3) - 1 freq
jimbo (3) - 1 freq
omb (3) - 3 freq
colombo (3) - 1 freq
cmo (3) - 1 freq
comer (3) - 4 freq
comber (3) - 2 freq
comed (3) - 12 freq
womb (3) - 8 freq
comim (3) - 1 freq
tomboy (3) - 1 freq
cosby (3) - 1 freq
jumbo (3) - 13 freq
SoundEx code - C510
comfy - 46 freq
canopy - 10 freq
camp - 53 freq
compo - 2 freq
combo - 5 freq
convo - 2 freq
champ - 17 freq
chomp - 3 freq
convoy - 12 freq
comp - 3 freq
convey - 5 freq
canopie - 1 freq
cump - 1 freq
comb - 18 freq
comfae - 1 freq
cumfae - 2 freq
camphe - 1 freq
cnvey - 1 freq
connive - 1 freq
'camp - 1 freq
canfoo - 1 freq
campie - 16 freq
cumbie - 1 freq
caimb - 9 freq
cum-by - 1 freq
combe - 1 freq
chimp - 3 freq
csmb - 1 freq
compy - 1 freq
czmpf - 1 freq
conf - 1 freq
camby - 1 freq
MetaPhone code - KM
cam - 2618 freq
come - 3111 freq
game - 642 freq
gamie - 12 freq
cum - 641 freq
came - 890 freq
'come - 73 freq
combo - 5 freq
caum - 36 freq
'c'm - 1 freq
kaim - 13 freq
gum - 18 freq
gammy - 8 freq
com - 131 freq
gammie - 4 freq
'cum - 4 freq
caim - 57 freq
caam - 11 freq
'caum - 1 freq
coma - 13 freq
kim - 10 freq
'gome - 1 freq
comb - 18 freq
kame - 6 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
gam - 2 freq
co'm - 1 freq
gaime - 1 freq
come' - 2 freq
cammy - 10 freq
kmee - 1 freq
cam' - 4 freq
km - 4 freq
gaem - 1 freq
gummy - 6 freq
kum - 8 freq
kam - 24 freq
gm - 10 freq
gmb - 4 freq
kehm - 1 freq
'kum - 1 freq
kaam - 1 freq
goom - 1 freq
cama - 1 freq
gome - 1 freq
cameo - 4 freq
guiami - 1 freq
camm - 1 freq
€˜come - 8 freq
koam - 1 freq
€œcm - 1 freq
cumbie - 1 freq
caimb - 9 freq
€œcum - 4 freq
coum - 2 freq
gme - 45 freq
€œcome - 32 freq
gamma - 9 freq
gambo - 1 freq
comm - 2 freq
€˜cam - 1 freq
combe - 1 freq
€œkum - 1 freq
kumbh - 3 freq
como - 1 freq
comme - 3 freq
qmh - 1 freq
cmmh - 1 freq
cm - 13 freq
qmy - 1 freq
cmo - 1 freq
cammay - 1 freq
qwmi - 1 freq
game” - 1 freq
gaim - 1 freq
kom - 2 freq
'comma - 1 freq
comma - 1 freq
camby - 1 freq
qm - 1 freq
kmm - 1 freq
qqmw - 1 freq
game' - 1 freq
cùm - 1 freq
camo - 1 freq
COMBO
Time to execute Levenshtein function - 0.202413 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.343434 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027212 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.044144 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000821 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.