A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clumsy in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clumsy (0) - 10 freq
clumps (2) - 7 freq
clubby (2) - 1 freq
clubs (2) - 102 freq
crums (2) - 1 freq
chums (2) - 13 freq
cluny (2) - 2 freq
cums (2) - 111 freq
glumly (2) - 1 freq
plums (2) - 7 freq
cmsy (2) - 1 freq
cludgy (2) - 1 freq
clams (2) - 1 freq
flimsy (2) - 4 freq
crumey (2) - 2 freq
lumpy (2) - 4 freq
classy (2) - 3 freq
clammy (2) - 4 freq
cluds (2) - 9 freq
lums (2) - 34 freq
clims (2) - 1 freq
caumly (2) - 8 freq
slums (2) - 8 freq
clump (2) - 18 freq
gumsy (2) - 5 freq
clumsy (0) - 10 freq
clams (2) - 1 freq
clims (2) - 1 freq
clubs (3) - 102 freq
lums (3) - 34 freq
cluds (3) - 9 freq
clammy (3) - 4 freq
slums (3) - 8 freq
clump (3) - 18 freq
calms (3) - 2 freq
clumps (3) - 7 freq
clumsily (3) - 4 freq
clues (3) - 7 freq
climes (3) - 6 freq
classy (3) - 3 freq
cums (3) - 111 freq
chums (3) - 13 freq
crums (3) - 1 freq
cmsy (3) - 1 freq
plums (3) - 7 freq
claims (3) - 32 freq
flimsy (3) - 4 freq
clem (4) - 4 freq
claiss (4) - 3 freq
clouds (4) - 84 freq
SoundEx code - C452
clingin - 21 freq
clingfilm - 2 freq
challenge - 79 freq
clings - 9 freq
cleans - 9 freq
clangin - 6 freq
columcille - 2 freq
clanjamfrie - 9 freq
clowns - 16 freq
clancy - 24 freq
ceilings - 1 freq
clincher - 2 freq
clanked - 1 freq
clancy's - 5 freq
clamjamfry - 4 freq
clanjamfry - 1 freq
columns - 9 freq
clamjamfrae - 1 freq
clung - 14 freq
clinked - 1 freq
clancy'll - 1 freq
clinkumbell - 2 freq
clankin - 6 freq
clinics - 1 freq
calms - 2 freq
clink - 18 freq
clenched - 17 freq
clamjafry - 1 freq
ceiling - 11 freq
callum's - 5 freq
clunkertonies - 1 freq
claims - 32 freq
climactic - 1 freq
clans - 26 freq
clanking - 1 freq
challenged - 19 freq
clang - 12 freq
calling - 15 freq
cleansed - 5 freq
ceilins - 8 freq
clanjamfirie - 1 freq
colonsay - 1 freq
clinkin - 5 freq
clinkers - 2 freq
cuilness - 1 freq
climmcen - 1 freq
clinker - 3 freq
clean-shaven - 2 freq
colonisers - 1 freq
climes - 6 freq
clunk - 8 freq
clumsy - 10 freq
clinched - 1 freq
claumjamfrie - 1 freq
challenges - 8 freq
clumsily - 4 freq
clench - 2 freq
clinically - 3 freq
'clink' - 1 freq
'clamjamfrie' - 1 freq
clims - 1 freq
clane-shaved - 1 freq
challengin - 10 freq
clankit - 2 freq
cleanse - 3 freq
cleansin - 5 freq
colonic - 1 freq
cleanser - 2 freq
colonies - 7 freq
'colonists' - 1 freq
cling - 6 freq
clansmen - 5 freq
columnists - 1 freq
columnist - 1 freq
clencht - 1 freq
clenchit - 1 freq
colin's - 5 freq
clams - 1 freq
cleanest - 1 freq
collins - 3 freq
ceilin's - 4 freq
clingeth - 1 freq
calmness - 3 freq
clan's - 3 freq
clinic - 3 freq
clank - 2 freq
challance - 1 freq
collums - 2 freq
culmas - 2 freq
clenkit - 1 freq
colum's - 5 freq
clangan - 1 freq
clankan - 1 freq
clamjamfrie - 5 freq
clannish - 1 freq
clinky - 1 freq
clingan - 2 freq
challengers - 1 freq
challenging - 4 freq
coulness - 1 freq
clanjamphrie - 1 freq
climacteric - 1 freq
clamjafrie - 1 freq
colonisin - 1 freq
colonised - 3 freq
clonk - 2 freq
clunkarts - 1 freq
clanjamphray - 1 freq
chaillenge - 4 freq
chaillenger - 2 freq
callanish - 1 freq
clunkin - 2 freq
cleansing - 2 freq
clunk-clunky - 1 freq
clunkety - 1 freq
clunk-clunk - 1 freq
colonise - 2 freq
clean's - 1 freq
clamjamfrey - 3 freq
clonca - 1 freq
colonists - 1 freq
colonski - 1 freq
clensing - 1 freq
climax - 2 freq
clink-clatter - 1 freq
clim's - 1 freq
clannies - 1 freq
colonisation - 1 freq
cooling - 1 freq
clones - 1 freq
clenches - 1 freq
clunked - 1 freq
clinkit - 1 freq
clinical - 4 freq
clean-sheet - 1 freq
chulemaster - 1 freq
cling-film - 1 freq
chillingly - 1 freq
clenchin - 2 freq
colonaisan - 1 freq
colinclyne - 8 freq
clynes - 1 freq
clankin' - 1 freq
colinmcgeechan - 2 freq
culling - 2 freq
calliemac - 4 freq
colmcille - 1 freq
callinicos - 1 freq
clangered - 1 freq
clangers - 1 freq
clanger - 1 freq
colinmccredie - 1 freq
callumc - 46 freq
colinmacalba - 1 freq
callumcarson - 1 freq
colinjherd - 1 freq
chileans - 1 freq
callangranny - 1 freq
callumstweets - 2 freq
colinmachamster - 1 freq
cullensewan - 1 freq
clinging - 1 freq
clinician - 1 freq
callumcarsonwlc - 3 freq
clmike - 1 freq
calmac - 1 freq
MetaPhone code - KLMS
calms - 2 freq
callum's - 5 freq
claims - 32 freq
glamis - 6 freq
climes - 6 freq
clumsy - 10 freq
clims - 1 freq
clams - 1 freq
gleems - 1 freq
qualms - 3 freq
gleams - 2 freq
collums - 2 freq
culmas - 2 freq
colum's - 5 freq
glims - 8 freq
glooms - 2 freq
klyms - 1 freq
gloums - 1 freq
gloams - 1 freq
glaims - 1 freq
climbs - 2 freq
clim's - 1 freq
columbus - 1 freq
CLUMSY
Time to execute Levenshtein function - 0.186514 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.374924 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.033310 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037776 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000861 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.