A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to caimb in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
caimb (0) - 9 freq
climb (1) - 58 freq
caim (1) - 58 freq
caif (2) - 1 freq
chimp (2) - 3 freq
caip (2) - 3 freq
limb (2) - 14 freq
maim (2) - 7 freq
clamb (2) - 2 freq
hairb (2) - 1 freq
haims (2) - 4 freq
caist (2) - 4 freq
aib (2) - 2 freq
camp (2) - 53 freq
caimbin (2) - 1 freq
climbt (2) - 1 freq
gaim (2) - 1 freq
caimbed (2) - 3 freq
caird (2) - 96 freq
cim (2) - 5 freq
cairo (2) - 2 freq
camo (2) - 1 freq
cab (2) - 16 freq
damb (2) - 1 freq
kaimt (2) - 1 freq
caimb (0) - 9 freq
comb (2) - 19 freq
climb (2) - 58 freq
camby (2) - 1 freq
caim (2) - 58 freq
cams (3) - 54 freq
cims (3) - 1 freq
craib (3) - 5 freq
crimbo (3) - 6 freq
cam (3) - 2629 freq
caiman (3) - 1 freq
clomb (3) - 1 freq
csmb (3) - 1 freq
caam (3) - 11 freq
caum (3) - 36 freq
cam' (3) - 5 freq
camz (3) - 1 freq
crumb (3) - 7 freq
damb (3) - 1 freq
cama (3) - 1 freq
lamb (3) - 63 freq
cammy (3) - 10 freq
cab (3) - 16 freq
jamb (3) - 2 freq
crib (3) - 7 freq
SoundEx code - C510
comfy - 48 freq
canopy - 10 freq
camp - 53 freq
compo - 2 freq
combo - 5 freq
convo - 2 freq
champ - 17 freq
chomp - 3 freq
convoy - 12 freq
comp - 3 freq
convey - 6 freq
canopie - 1 freq
cump - 1 freq
canapie - 1 freq
comb - 19 freq
comfae - 1 freq
cumfae - 2 freq
camphe - 1 freq
cnvey - 1 freq
connive - 1 freq
'camp - 1 freq
canfoo - 1 freq
campie - 16 freq
cumbie - 1 freq
caimb - 9 freq
cum-by - 1 freq
combe - 1 freq
chimp - 3 freq
csmb - 1 freq
compy - 1 freq
czmpf - 1 freq
conf - 1 freq
camby - 1 freq
MetaPhone code - KM
cam - 2629 freq
come - 3162 freq
game - 648 freq
gamie - 12 freq
cum - 643 freq
came - 899 freq
'come - 73 freq
combo - 5 freq
caum - 36 freq
'c'm - 1 freq
kaim - 13 freq
gum - 19 freq
gammy - 8 freq
com - 134 freq
gammie - 4 freq
'cum - 4 freq
caim - 58 freq
caam - 11 freq
cam' - 5 freq
comb - 19 freq
'caum - 1 freq
coma - 13 freq
kim - 10 freq
'gome - 1 freq
kame - 6 freq
cum' - 2 freq
©cum - 1 freq
'cam - 9 freq
gam - 2 freq
co'm - 1 freq
gaime - 1 freq
come' - 2 freq
cammy - 10 freq
kmee - 1 freq
km - 4 freq
gaem - 1 freq
gummy - 6 freq
kum - 8 freq
kam - 24 freq
gm - 10 freq
gmb - 4 freq
kehm - 1 freq
'kum - 1 freq
kaam - 1 freq
goom - 1 freq
cama - 1 freq
gome - 1 freq
cameo - 4 freq
guiami - 1 freq
camm - 1 freq
€˜come - 8 freq
koam - 1 freq
€œcm - 1 freq
cumbie - 1 freq
caimb - 9 freq
€œcum - 4 freq
coum - 2 freq
gme - 45 freq
€œcome - 32 freq
gamma - 9 freq
gambo - 1 freq
comm - 2 freq
€˜cam - 1 freq
combe - 1 freq
€œkum - 1 freq
kumbh - 3 freq
como - 1 freq
comme - 3 freq
qmh - 1 freq
cmmh - 1 freq
cm - 13 freq
qmy - 1 freq
cmo - 1 freq
cammay - 1 freq
qwmi - 1 freq
game” - 1 freq
gaim - 1 freq
kom - 2 freq
'comma - 1 freq
comma - 1 freq
camby - 1 freq
qm - 1 freq
kmm - 1 freq
qqmw - 1 freq
game' - 1 freq
cùm - 1 freq
camo - 1 freq
CAIMB
Time to execute Levenshtein function - 0.189020 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341415 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029112 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039807 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000876 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.