A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to colonies in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
colonies (0) - 7 freq
colonised (2) - 3 freq
cojones (2) - 1 freq
cronies (2) - 41 freq
coories (2) - 5 freq
colonists (2) - 1 freq
colonials (2) - 1 freq
colonise (2) - 2 freq
colonial (2) - 8 freq
goonies (2) - 3 freq
polonie (2) - 1 freq
colonisers (2) - 1 freq
calories (2) - 11 freq
colonel (2) - 8 freq
colonels (2) - 1 freq
collies (2) - 1 freq
cookies (2) - 10 freq
boonies (2) - 1 freq
clones (2) - 1 freq
loonies (2) - 3 freq
felonies (2) - 1 freq
toonies (2) - 1 freq
coonties (2) - 5 freq
colonic (2) - 1 freq
cooties (2) - 1 freq
colonies (0) - 7 freq
colonise (2) - 2 freq
clones (2) - 1 freq
collies (3) - 1 freq
colonel (3) - 8 freq
colonels (3) - 1 freq
felonies (3) - 1 freq
colonsay (3) - 1 freq
clynes (3) - 1 freq
colonic (3) - 1 freq
calories (3) - 11 freq
loonies (3) - 3 freq
colonials (3) - 1 freq
colonial (3) - 8 freq
colonised (3) - 3 freq
cronies (3) - 41 freq
cojones (3) - 1 freq
collins (4) - 3 freq
colourins (4) - 1 freq
cones (4) - 3 freq
colin's (4) - 5 freq
clans (4) - 26 freq
closes (4) - 27 freq
cornes (4) - 1 freq
cloves (4) - 4 freq
SoundEx code - C452
clingin - 21 freq
clingfilm - 2 freq
challenge - 79 freq
clings - 9 freq
cleans - 9 freq
clangin - 6 freq
columcille - 2 freq
clanjamfrie - 9 freq
clowns - 16 freq
clancy - 24 freq
ceilings - 1 freq
clincher - 2 freq
clanked - 1 freq
clancy's - 5 freq
clamjamfry - 4 freq
clanjamfry - 1 freq
columns - 9 freq
clamjamfrae - 1 freq
clung - 14 freq
clinked - 1 freq
clancy'll - 1 freq
clinkumbell - 2 freq
clankin - 6 freq
clinics - 1 freq
calms - 2 freq
clink - 18 freq
clenched - 18 freq
clamjafry - 1 freq
ceiling - 11 freq
callum's - 5 freq
clunkertonies - 1 freq
claims - 32 freq
climactic - 1 freq
clans - 26 freq
clanking - 1 freq
challenged - 19 freq
clang - 12 freq
calling - 15 freq
cleansed - 5 freq
ceilins - 8 freq
clanjamfirie - 1 freq
colonsay - 1 freq
clinkin - 5 freq
clinkers - 2 freq
cuilness - 1 freq
climmcen - 1 freq
clinker - 3 freq
clean-shaven - 2 freq
colonisers - 1 freq
climes - 6 freq
clunk - 8 freq
clumsy - 10 freq
clinical - 6 freq
cling - 7 freq
colonisation - 2 freq
clench - 3 freq
cleansin - 6 freq
clinched - 1 freq
claumjamfrie - 1 freq
challenges - 8 freq
clumsily - 4 freq
clinically - 3 freq
'clink' - 1 freq
'clamjamfrie' - 1 freq
clims - 1 freq
clane-shaved - 1 freq
challengin - 10 freq
clankit - 2 freq
cleanse - 3 freq
colonic - 1 freq
cleanser - 2 freq
colonies - 7 freq
'colonists' - 1 freq
clansmen - 5 freq
columnists - 1 freq
columnist - 1 freq
clencht - 1 freq
clenchit - 1 freq
colin's - 5 freq
clams - 1 freq
cleanest - 1 freq
collins - 3 freq
ceilin's - 4 freq
clingeth - 1 freq
calmness - 3 freq
clan's - 3 freq
clinic - 3 freq
clank - 2 freq
challance - 1 freq
collums - 2 freq
culmas - 2 freq
clenkit - 1 freq
colum's - 5 freq
clangan - 1 freq
clankan - 1 freq
clamjamfrie - 5 freq
clannish - 1 freq
clinky - 1 freq
clingan - 2 freq
challengers - 1 freq
challenging - 4 freq
coulness - 1 freq
clanjamphrie - 1 freq
climacteric - 1 freq
clamjafrie - 1 freq
colonisin - 1 freq
colonised - 3 freq
clonk - 2 freq
clunkarts - 1 freq
clanjamphray - 1 freq
chaillenge - 4 freq
chaillenger - 2 freq
callanish - 1 freq
clunkin - 2 freq
cleansing - 2 freq
clunk-clunky - 1 freq
clunkety - 1 freq
clunk-clunk - 1 freq
colonise - 2 freq
clean's - 1 freq
clamjamfrey - 3 freq
clonca - 1 freq
colonists - 1 freq
colonski - 1 freq
clensing - 1 freq
climax - 2 freq
clink-clatter - 1 freq
clim's - 1 freq
clannies - 1 freq
cooling - 1 freq
clones - 1 freq
clenches - 1 freq
clunked - 1 freq
clinkit - 1 freq
clean-sheet - 1 freq
chulemaster - 1 freq
cling-film - 1 freq
chillingly - 1 freq
clenchin - 2 freq
colonaisan - 1 freq
colinclyne - 8 freq
clynes - 1 freq
clankin' - 1 freq
colinmcgeechan - 2 freq
culling - 2 freq
calliemac - 4 freq
colmcille - 1 freq
callinicos - 1 freq
clangered - 1 freq
clangers - 1 freq
clanger - 1 freq
colinmccredie - 1 freq
callumc - 46 freq
colinmacalba - 1 freq
callumcarson - 1 freq
colinjherd - 1 freq
chileans - 1 freq
callangranny - 1 freq
callumstweets - 2 freq
colinmachamster - 1 freq
cullensewan - 1 freq
clinging - 1 freq
clinician - 1 freq
callumcarsonwlc - 3 freq
clmike - 1 freq
calmac - 1 freq
MetaPhone code - KLNS
glance - 47 freq
glence - 4 freq
cleans - 9 freq
glen's - 4 freq
clowns - 16 freq
clancy - 24 freq
gallons - 14 freq
glenn's - 1 freq
clans - 26 freq
glens - 44 freq
killins - 1 freq
colonsay - 1 freq
cuilness - 1 freq
cleanse - 3 freq
colonies - 7 freq
colin's - 5 freq
collins - 3 freq
clan's - 3 freq
gleans - 1 freq
coulness - 1 freq
kilns - 1 freq
glency - 1 freq
colonise - 2 freq
clean's - 1 freq
clannies - 1 freq
clones - 1 freq
clynes - 1 freq
'glance' - 1 freq
klines - 1 freq
COLONIES
Time to execute Levenshtein function - 0.192290 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.377860 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027497 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038562 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000924 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.