A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to combo in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
combo (0) - 5 freq
combs (1) - 7 freq
comb (1) - 19 freq
combe (1) - 1 freq
compo (1) - 2 freq
como (1) - 1 freq
coman (2) - 41 freq
csmb (2) - 1 freq
convo (2) - 2 freq
comms (2) - 1 freq
coyb (2) - 1 freq
tomboy (2) - 1 freq
cmo (2) - 1 freq
crimbo (2) - 6 freq
congo (2) - 1 freq
common (2) - 301 freq
comp (2) - 3 freq
comin (2) - 1066 freq
costo (2) - 1 freq
com (2) - 134 freq
dumbo (2) - 2 freq
clomb (2) - 1 freq
come (2) - 3162 freq
comfor (2) - 1 freq
tomb (2) - 31 freq
combo (0) - 5 freq
combe (1) - 1 freq
comb (1) - 19 freq
camby (2) - 1 freq
como (2) - 1 freq
compo (2) - 2 freq
combs (2) - 7 freq
combed (3) - 3 freq
bombe (3) - 1 freq
rambo (3) - 1 freq
jumbo (3) - 13 freq
cowboy (3) - 18 freq
compy (3) - 1 freq
omb (3) - 3 freq
colombo (3) - 1 freq
comes (3) - 962 freq
combat (3) - 5 freq
comber (3) - 2 freq
citbo (3) - 1 freq
comim (3) - 1 freq
bimbo (3) - 1 freq
crambo (3) - 1 freq
comte (3) - 1 freq
symbo (3) - 1 freq
comon (3) - 1 freq
SoundEx code - C510
comfy - 48 freq
canopy - 10 freq
camp - 53 freq
compo - 2 freq
combo - 5 freq
convo - 2 freq
champ - 17 freq
chomp - 3 freq
convoy - 12 freq
comp - 3 freq
convey - 6 freq
canopie - 1 freq
cump - 1 freq
canapie - 1 freq
comb - 19 freq
comfae - 1 freq
cumfae - 2 freq
camphe - 1 freq
cnvey - 1 freq
connive - 1 freq
'camp - 1 freq
canfoo - 1 freq
campie - 16 freq
cumbie - 1 freq
caimb - 9 freq
cum-by - 1 freq
combe - 1 freq
chimp - 3 freq
csmb - 1 freq
compy - 1 freq
czmpf - 1 freq
conf - 1 freq
camby - 1 freq
MetaPhone code - KM
cam - 2629 freq
come - 3162 freq
game - 648 freq
gamie - 12 freq
cum - 643 freq
came - 899 freq
'come - 73 freq
combo - 5 freq
caum - 36 freq
'c'm - 1 freq
kaim - 13 freq
gum - 19 freq
gammy - 8 freq
com - 134 freq
gammie - 4 freq
'cum - 4 freq
caim - 58 freq
caam - 11 freq
cam' - 5 freq
comb - 19 freq
'caum - 1 freq
coma - 13 freq
kim - 10 freq
'gome - 1 freq
kame - 6 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
gam - 2 freq
co'm - 1 freq
gaime - 1 freq
come' - 2 freq
cammy - 10 freq
kmee - 1 freq
km - 4 freq
gaem - 1 freq
gummy - 6 freq
kum - 8 freq
kam - 24 freq
gm - 10 freq
gmb - 4 freq
kehm - 1 freq
'kum - 1 freq
kaam - 1 freq
goom - 1 freq
cama - 1 freq
gome - 1 freq
cameo - 4 freq
guiami - 1 freq
camm - 1 freq
€˜come - 8 freq
koam - 1 freq
€œcm - 1 freq
cumbie - 1 freq
caimb - 9 freq
€œcum - 4 freq
coum - 2 freq
gme - 45 freq
€œcome - 32 freq
gamma - 9 freq
gambo - 1 freq
comm - 2 freq
€˜cam - 1 freq
combe - 1 freq
€œkum - 1 freq
kumbh - 3 freq
como - 1 freq
comme - 3 freq
qmh - 1 freq
cmmh - 1 freq
cm - 13 freq
qmy - 1 freq
cmo - 1 freq
cammay - 1 freq
qwmi - 1 freq
game” - 1 freq
gaim - 1 freq
kom - 2 freq
'comma - 1 freq
comma - 1 freq
camby - 1 freq
qm - 1 freq
kmm - 1 freq
qqmw - 1 freq
game' - 1 freq
cùm - 1 freq
camo - 1 freq
COMBO
Time to execute Levenshtein function - 0.502885 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.718612 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.067531 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039076 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.035996 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.