A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to combine in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
combine (0) - 21 freq
combined (1) - 11 freq
combin (1) - 1 freq
combines (1) - 15 freq
commone (2) - 2 freq
bombing (2) - 2 freq
bombin (2) - 4 freq
commune (2) - 3 freq
comin' (2) - 19 freq
cumbie (2) - 1 freq
cocaine (2) - 6 freq
combinin (2) - 1 freq
comrie (2) - 1 freq
clmbie (2) - 2 freq
zombie (2) - 12 freq
corbie (2) - 27 freq
combe (2) - 1 freq
commin (2) - 8 freq
chmbin (2) - 1 freq
coming (2) - 118 freq
codeine (2) - 1 freq
comlie (2) - 1 freq
comin (2) - 1066 freq
comins (2) - 4 freq
combine's (2) - 1 freq
combine (0) - 21 freq
combin (1) - 1 freq
combined (2) - 11 freq
combines (2) - 15 freq
comin (3) - 1066 freq
caimbin (3) - 1 freq
columbine (3) - 2 freq
combinin (3) - 1 freq
chmbin (3) - 1 freq
commin (3) - 8 freq
combe (3) - 1 freq
cumbie (3) - 1 freq
bombin (3) - 4 freq
commone (3) - 2 freq
commune (3) - 3 freq
lambin (4) - 19 freq
comman (4) - 4 freq
comon (4) - 1 freq
comb (4) - 19 freq
hambane (4) - 1 freq
cobyn (4) - 1 freq
combs (4) - 7 freq
comber (4) - 2 freq
cabin (4) - 20 freq
campaine (4) - 14 freq
SoundEx code - C515
convention - 31 freq
company - 207 freq
convenor - 3 freq
convent - 3 freq
confined - 9 freq
campioun - 1 freq
chmbin - 1 freq
combine - 21 freq
convinced - 39 freq
combination - 23 freq
'chumpion - 1 freq
convenshun - 1 freq
companion - 26 freq
chimpanzee - 3 freq
companies - 35 freq
comeuppance - 6 freq
companion' - 1 freq
convincin - 19 freq
convince - 41 freq
company's - 2 freq
conventicle - 10 freq
conventicles - 11 freq
companions - 6 freq
companie - 32 freq
champion - 34 freq
confoun - 1 freq
champin - 7 freq
cumpanee - 2 freq
convintion - 1 freq
conveniently - 9 freq
combined - 11 freq
champions - 17 freq
combin - 1 freq
campin - 4 freq
convinces - 1 freq
convenient - 5 freq
conveniences - 3 freq
canavan's - 1 freq
conveyance - 1 freq
connivin - 2 freq
compensation - 10 freq
component - 2 freq
champion's - 2 freq
conventions - 15 freq
cumpany' - 1 freq
combine's - 1 freq
combines - 15 freq
companionship' - 1 freq
'companionship' - 1 freq
confinement - 3 freq
confound - 1 freq
conveenced - 2 freq
come-uppance - 2 freq
championship - 12 freq
comeuppance' - 1 freq
campion - 4 freq
convener - 18 freq
confoondin - 1 freq
confoondit - 2 freq
comventions - 1 freq
compensations - 1 freq
compensate - 2 freq
convenerie - 4 freq
convenerie's - 2 freq
conveneries - 6 freq
compaingen - 3 freq
cumpanie - 4 freq
convinceen - 2 freq
convenience - 3 freq
components - 3 freq
compounds - 8 freq
championschip - 1 freq
championschips - 1 freq
champenoise - 1 freq
conveniece - 1 freq
convene - 3 freq
confinin - 1 freq
champion-eys - 2 freq
combinations - 5 freq
combinin - 1 freq
compound - 9 freq
componit - 1 freq
cammavine - 1 freq
convoyin - 4 freq
componed - 2 freq
compaingens - 3 freq
campainin - 1 freq
campain - 3 freq
campainers - 1 freq
comp'ny - 1 freq
championships - 3 freq
compoonds - 3 freq
campaine - 14 freq
conveinces - 1 freq
confounding - 1 freq
convened - 3 freq
conveence - 1 freq
conventionis - 1 freq
caimbin - 1 freq
compensautioun - 1 freq
cumpanies - 3 freq
compoond - 5 freq
confoundit - 1 freq
cumpany - 1 freq
conventiclers - 1 freq
compoondin - 1 freq
componer - 6 freq
componers - 13 freq
componin - 4 freq
convincinly - 2 freq
conveying - 1 freq
confines - 2 freq
compoondid - 1 freq
convenes - 1 freq
conveyin - 2 freq
companionable - 1 freq
canavan - 2 freq
championin - 2 freq
chumpioney - 1 freq
championey - 1 freq
camping - 3 freq
cwmnfn - 1 freq
comefindme - 1 freq
camping' - 1 freq
compensating - 1 freq
MetaPhone code - KMN
comin - 1066 freq
common - 301 freq
'c'moan - 20 freq
c'mon - 87 freq
c'moan - 34 freq
cowmon - 1 freq
'c'mon - 7 freq
combine - 21 freq
comena - 2 freq
commin - 8 freq
caumin - 2 freq
cumin - 70 freq
'commen' - 1 freq
'commen - 1 freq
comin' - 19 freq
kemnay - 4 freq
coman - 41 freq
cumman - 11 freq
comyn - 3 freq
gamyn - 1 freq
cumen - 1 freq
cummen - 26 freq
c''mon - 1 freq
cummin' - 1 freq
combin - 1 freq
kiemon - 1 freq
cmoan - 1 freq
gammon - 5 freq
'comin - 3 freq
cmon - 18 freq
comman - 4 freq
camin - 4 freq
comeen - 1 freq
cummin - 30 freq
gummin - 1 freq
cuman - 2 freq
coemmaan - 1 freq
'coman - 1 freq
gomin - 21 freq
commone - 2 freq
comunn - 1 freq
comon - 1 freq
caimbin - 1 freq
commoun - 3 freq
€œcaman - 1 freq
commune - 3 freq
coamon - 1 freq
coman' - 1 freq
gamin - 3 freq
commaun - 1 freq
komin - 1 freq
cÂ’mon - 6 freq
cumoan - 2 freq
gameon - 1 freq
‘c’mon - 1 freq
kemonn - 1 freq
coomannie - 1 freq
caiman - 1 freq
cayman - 1 freq
COMBINE
Time to execute Levenshtein function - 0.188056 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341557 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027846 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037314 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000859 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.