A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to colombia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
colombia (0) - 3 freq
columba (2) - 2 freq
colombo (2) - 1 freq
columbae (3) - 2 freq
clmbie (3) - 2 freq
columbus (3) - 1 freq
colonial (3) - 8 freq
combin (3) - 1 freq
clomb (3) - 1 freq
colourin (3) - 3 freq
olympia (3) - 1 freq
columbine (3) - 2 freq
climbit (3) - 3 freq
columbo (3) - 2 freq
climbin (3) - 18 freq
colonic (3) - 1 freq
colourit (3) - 1 freq
coimbra (3) - 1 freq
coortin (4) - 18 freq
bolotana (4) - 1 freq
solomons (4) - 2 freq
clamber (4) - 5 freq
climbing (4) - 3 freq
clootin (4) - 2 freq
clourin (4) - 1 freq
colombia (0) - 3 freq
colombo (2) - 1 freq
columba (2) - 2 freq
columbo (3) - 2 freq
clomb (3) - 1 freq
clmbie (3) - 2 freq
columbae (3) - 2 freq
climbit (4) - 3 freq
columbine (4) - 2 freq
climbin (4) - 18 freq
clamb (4) - 2 freq
columbus (4) - 1 freq
climb (4) - 58 freq
aplomb (5) - 1 freq
column (5) - 20 freq
limbie (5) - 1 freq
calmit (5) - 2 freq
combe (5) - 1 freq
calmin (5) - 3 freq
collab (5) - 1 freq
climin (5) - 6 freq
colum (5) - 2 freq
colab (5) - 1 freq
combo (5) - 5 freq
climbt (5) - 1 freq
SoundEx code - C451
climbed - 33 freq
clumpit - 1 freq
clampit - 4 freq
climb - 58 freq
clamp - 3 freq
climbin - 18 freq
clamberin - 4 freq
columbo - 2 freq
clump - 18 freq
clamped - 7 freq
clamberit - 1 freq
climbit - 3 freq
climbin' - 1 freq
clump's - 1 freq
clumps - 8 freq
climbing - 3 freq
clampin - 3 freq
columbine - 2 freq
climped - 1 freq
climpt - 1 freq
clamp't - 1 freq
clambert - 2 freq
clampetts - 1 freq
clampt - 1 freq
clomb - 1 freq
'climb - 1 freq
clamber - 5 freq
clump-clump - 1 freq
climb'd - 1 freq
calumpniat - 1 freq
climban - 2 freq
clean-forgot - 1 freq
clampin-up's - 1 freq
columba - 2 freq
claimbert - 1 freq
columbae - 2 freq
clamb - 2 freq
climbs - 2 freq
colombo - 1 freq
clomph - 1 freq
clumpin - 1 freq
climbers - 2 freq
clambers - 2 freq
clambering - 1 freq
clambered - 3 freq
climbt - 1 freq
columbus - 1 freq
clampdoon - 1 freq
colinforman - 1 freq
colinphoenix - 1 freq
colinburnett - 28 freq
colinbell - 13 freq
colombia - 3 freq
clmbie - 2 freq
columbaheritage - 2 freq
MetaPhone code - KLM
clim - 33 freq
climb - 58 freq
glum - 10 freq
calm - 133 freq
clammy - 4 freq
'calm - 10 freq
gloam - 5 freq
clam - 10 freq
claim - 80 freq
columbo - 2 freq
caulm - 5 freq
clime - 7 freq
gloom - 32 freq
callum - 81 freq
gloomy - 6 freq
glaum - 4 freq
glim - 8 freq
glam - 3 freq
qualm - 2 freq
'claim - 4 freq
calum - 12 freq
caalm - 1 freq
gleam - 7 freq
claim' - 1 freq
clomb - 1 freq
'climb - 1 freq
collum - 1 freq
clame - 1 freq
colum - 2 freq
clem - 4 freq
gloum - 1 freq
glaim - 4 freq
golem - 1 freq
gaulum - 1 freq
gaulum' - 2 freq
columba - 2 freq
climm - 5 freq
glame - 1 freq
columbae - 2 freq
clamb - 2 freq
cleem - 1 freq
colombo - 1 freq
€œcalm - 1 freq
gollum - 1 freq
€˜calm - 2 freq
claem - 2 freq
‘gloom’ - 1 freq
colombia - 3 freq
clmbie - 2 freq
COLOMBIA
Time to execute Levenshtein function - 0.240216 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.415011 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029419 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039640 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001025 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.