A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to combine in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
combine (0) - 19 freq
combined (1) - 10 freq
combines (1) - 15 freq
coming (2) - 116 freq
combinin (2) - 1 freq
commune (2) - 3 freq
combe (2) - 1 freq
combine's (2) - 1 freq
columbine (2) - 2 freq
comin (2) - 1056 freq
colline (2) - 1 freq
bombing (2) - 2 freq
comlie (2) - 1 freq
zombie (2) - 11 freq
comin' (2) - 19 freq
codeine (2) - 1 freq
cumbie (2) - 1 freq
commin (2) - 8 freq
bombin (2) - 4 freq
corbie (2) - 27 freq
comins (2) - 4 freq
commone (2) - 2 freq
domine (2) - 2 freq
clmbie (2) - 2 freq
cocaine (2) - 6 freq
combine (0) - 19 freq
combines (2) - 15 freq
combined (2) - 10 freq
commone (3) - 2 freq
caimbin (3) - 1 freq
commin (3) - 8 freq
comin (3) - 1056 freq
cumbie (3) - 1 freq
columbine (3) - 2 freq
commune (3) - 3 freq
combinin (3) - 1 freq
bombin (3) - 4 freq
combe (3) - 1 freq
chmbin (3) - 1 freq
combo (4) - 5 freq
climbin (4) - 18 freq
coman (4) - 41 freq
combat (4) - 5 freq
corbyn (4) - 8 freq
comunn (4) - 1 freq
cobyn (4) - 1 freq
cabin (4) - 20 freq
common (4) - 295 freq
campaine (4) - 14 freq
comeen (4) - 1 freq
SoundEx code - C515
convention - 31 freq
company - 206 freq
convenor - 3 freq
convent - 3 freq
confined - 9 freq
campioun - 1 freq
chmbin - 1 freq
combine - 19 freq
convinced - 36 freq
combination - 22 freq
'chumpion - 1 freq
convenshun - 1 freq
companion - 26 freq
chimpanzee - 3 freq
companies - 34 freq
comeuppance - 6 freq
companion' - 1 freq
convincin - 19 freq
convince - 40 freq
company's - 2 freq
conventicle - 10 freq
conventicles - 11 freq
companions - 6 freq
companie - 29 freq
champion - 33 freq
confoun - 1 freq
champin - 7 freq
cumpanee - 2 freq
convintion - 1 freq
comeeventually - 2 freq
conveniently - 8 freq
campin - 4 freq
convinces - 1 freq
convenient - 5 freq
conveniences - 3 freq
canavan's - 1 freq
conveyance - 1 freq
connivin - 2 freq
combined - 10 freq
compensation - 10 freq
component - 2 freq
champion's - 2 freq
conventions - 15 freq
cumpany' - 1 freq
combine's - 1 freq
combines - 15 freq
companionship' - 1 freq
'companionship' - 1 freq
confinement - 3 freq
confound - 1 freq
conveenced - 2 freq
come-uppance - 2 freq
championship - 12 freq
comeuppance' - 1 freq
campion - 4 freq
convener - 18 freq
confoondin - 1 freq
confoondit - 2 freq
comventions - 1 freq
compensations - 1 freq
compensate - 2 freq
convenerie - 4 freq
convenerie's - 2 freq
conveneries - 6 freq
compaingen - 3 freq
cumpanie - 4 freq
convinceen - 2 freq
convenience - 3 freq
components - 3 freq
compounds - 8 freq
champions - 16 freq
championschip - 1 freq
championschips - 1 freq
champenoise - 1 freq
conveniece - 1 freq
convene - 3 freq
confinin - 1 freq
champion-eys - 2 freq
combinations - 5 freq
combinin - 1 freq
compound - 9 freq
componit - 1 freq
cammavine - 1 freq
convoyin - 4 freq
componed - 2 freq
compaingens - 3 freq
campainin - 1 freq
campain - 3 freq
campainers - 1 freq
comp'ny - 1 freq
championships - 3 freq
compoonds - 3 freq
campaine - 14 freq
conveinces - 1 freq
confounding - 1 freq
convened - 3 freq
conveence - 1 freq
conventionis - 1 freq
caimbin - 1 freq
compensautioun - 1 freq
cumpanies - 3 freq
compoond - 5 freq
confoundit - 1 freq
cumpany - 1 freq
conventiclers - 1 freq
compoondin - 1 freq
componer - 6 freq
componers - 13 freq
componin - 4 freq
convincinly - 2 freq
conveying - 1 freq
confines - 2 freq
compoondid - 1 freq
convenes - 1 freq
conveyin - 2 freq
companionable - 1 freq
canavan - 2 freq
championin - 2 freq
chumpioney - 1 freq
championey - 1 freq
camping - 3 freq
cwmnfn - 1 freq
comefindme - 1 freq
camping' - 1 freq
compensating - 1 freq
MetaPhone code - KMN
comin - 1056 freq
common - 295 freq
'c'moan - 20 freq
c'mon - 84 freq
c'moan - 34 freq
cowmon - 1 freq
'c'mon - 7 freq
combine - 19 freq
comena - 2 freq
commin - 8 freq
caumin - 2 freq
cumin - 70 freq
'commen' - 1 freq
'commen - 1 freq
comin' - 19 freq
kemnay - 4 freq
coman - 41 freq
cumman - 11 freq
comyn - 3 freq
gamyn - 1 freq
cumen - 1 freq
cummen - 26 freq
kiemon - 1 freq
cmoan - 1 freq
gammon - 5 freq
'comin - 3 freq
cmon - 18 freq
comman - 4 freq
camin - 4 freq
comeen - 1 freq
cummin - 30 freq
gummin - 1 freq
cuman - 2 freq
coemmaan - 1 freq
'coman - 1 freq
gomin - 21 freq
commone - 2 freq
comunn - 1 freq
comon - 1 freq
caimbin - 1 freq
commoun - 3 freq
€œcaman - 1 freq
commune - 3 freq
coamon - 1 freq
coman' - 1 freq
gamin - 3 freq
commaun - 1 freq
komin - 1 freq
cÂ’mon - 6 freq
cumoan - 2 freq
gameon - 1 freq
‘c’mon - 1 freq
kemonn - 1 freq
coomannie - 1 freq
caiman - 1 freq
cayman - 1 freq
COMBINE
Time to execute Levenshtein function - 0.232629 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.399549 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029623 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.042571 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001175 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.