A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clunk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clunk (0) - 8 freq
clink (1) - 18 freq
crunk (1) - 5 freq
plunk (1) - 3 freq
cluck (1) - 2 freq
cluny (1) - 2 freq
clung (1) - 14 freq
clank (1) - 2 freq
slunk (1) - 8 freq
chunk (1) - 17 freq
clonk (1) - 2 freq
bunk (2) - 28 freq
cln (2) - 1 freq
cunt (2) - 447 freq
clunked (2) - 1 freq
glurk (2) - 1 freq
blun (2) - 1 freq
hunk (2) - 5 freq
trunk (2) - 18 freq
clusks (2) - 1 freq
cleck (2) - 10 freq
cauk (2) - 5 freq
luck (2) - 327 freq
lank (2) - 6 freq
alunt (2) - 1 freq
clunk (0) - 8 freq
clonk (1) - 2 freq
clank (1) - 2 freq
clink (1) - 18 freq
slunk (2) - 8 freq
clinky (2) - 1 freq
chunk (2) - 17 freq
plunk (2) - 3 freq
clung (2) - 14 freq
crunk (2) - 5 freq
cluck (2) - 2 freq
cluny (2) - 2 freq
conk (3) - 2 freq
cling (3) - 7 freq
chink (3) - 13 freq
blink (3) - 91 freq
cleek (3) - 31 freq
cluke (3) - 1 freq
flank (3) - 12 freq
cline (3) - 4 freq
clans (3) - 26 freq
cleuk (3) - 3 freq
clone (3) - 5 freq
clack (3) - 9 freq
plank (3) - 29 freq
SoundEx code - C452
clingin - 21 freq
clingfilm - 2 freq
challenge - 79 freq
clings - 9 freq
cleans - 9 freq
clangin - 6 freq
columcille - 2 freq
clanjamfrie - 9 freq
clowns - 16 freq
clancy - 24 freq
ceilings - 1 freq
clincher - 2 freq
clanked - 1 freq
clancy's - 5 freq
clamjamfry - 4 freq
clanjamfry - 1 freq
columns - 9 freq
clamjamfrae - 1 freq
clung - 14 freq
clinked - 1 freq
clancy'll - 1 freq
clinkumbell - 2 freq
clankin - 6 freq
clinics - 1 freq
calms - 2 freq
clink - 18 freq
clenched - 18 freq
clamjafry - 1 freq
ceiling - 11 freq
callum's - 5 freq
clunkertonies - 1 freq
claims - 32 freq
climactic - 1 freq
clans - 26 freq
clanking - 1 freq
challenged - 19 freq
clang - 12 freq
calling - 15 freq
cleansed - 5 freq
ceilins - 8 freq
clanjamfirie - 1 freq
colonsay - 1 freq
clinkin - 5 freq
clinkers - 2 freq
cuilness - 1 freq
climmcen - 1 freq
clinker - 3 freq
clean-shaven - 2 freq
colonisers - 1 freq
climes - 6 freq
clunk - 8 freq
clumsy - 10 freq
clinical - 6 freq
cling - 7 freq
colonisation - 2 freq
clench - 3 freq
cleansin - 6 freq
clinched - 1 freq
claumjamfrie - 1 freq
challenges - 8 freq
clumsily - 4 freq
clinically - 3 freq
'clink' - 1 freq
'clamjamfrie' - 1 freq
clims - 1 freq
clane-shaved - 1 freq
challengin - 10 freq
clankit - 2 freq
cleanse - 3 freq
colonic - 1 freq
cleanser - 2 freq
colonies - 7 freq
'colonists' - 1 freq
clansmen - 5 freq
columnists - 1 freq
columnist - 1 freq
clencht - 1 freq
clenchit - 1 freq
colin's - 5 freq
clams - 1 freq
cleanest - 1 freq
collins - 3 freq
ceilin's - 4 freq
clingeth - 1 freq
calmness - 3 freq
clan's - 3 freq
clinic - 3 freq
clank - 2 freq
challance - 1 freq
collums - 2 freq
culmas - 2 freq
clenkit - 1 freq
colum's - 5 freq
clangan - 1 freq
clankan - 1 freq
clamjamfrie - 5 freq
clannish - 1 freq
clinky - 1 freq
clingan - 2 freq
challengers - 1 freq
challenging - 4 freq
coulness - 1 freq
clanjamphrie - 1 freq
climacteric - 1 freq
clamjafrie - 1 freq
colonisin - 1 freq
colonised - 3 freq
clonk - 2 freq
clunkarts - 1 freq
clanjamphray - 1 freq
chaillenge - 4 freq
chaillenger - 2 freq
callanish - 1 freq
clunkin - 2 freq
cleansing - 2 freq
clunk-clunky - 1 freq
clunkety - 1 freq
clunk-clunk - 1 freq
colonise - 2 freq
clean's - 1 freq
clamjamfrey - 3 freq
clonca - 1 freq
colonists - 1 freq
colonski - 1 freq
clensing - 1 freq
climax - 2 freq
clink-clatter - 1 freq
clim's - 1 freq
clannies - 1 freq
cooling - 1 freq
clones - 1 freq
clenches - 1 freq
clunked - 1 freq
clinkit - 1 freq
clean-sheet - 1 freq
chulemaster - 1 freq
cling-film - 1 freq
chillingly - 1 freq
clenchin - 2 freq
colonaisan - 1 freq
colinclyne - 8 freq
clynes - 1 freq
clankin' - 1 freq
colinmcgeechan - 2 freq
culling - 2 freq
calliemac - 4 freq
colmcille - 1 freq
callinicos - 1 freq
clangered - 1 freq
clangers - 1 freq
clanger - 1 freq
colinmccredie - 1 freq
callumc - 46 freq
colinmacalba - 1 freq
callumcarson - 1 freq
colinjherd - 1 freq
chileans - 1 freq
callangranny - 1 freq
callumstweets - 2 freq
colinmachamster - 1 freq
cullensewan - 1 freq
clinging - 1 freq
clinician - 1 freq
callumcarsonwlc - 3 freq
clmike - 1 freq
calmac - 1 freq
MetaPhone code - KLNK
clung - 14 freq
clink - 18 freq
killing - 7 freq
clang - 12 freq
calling - 15 freq
clunk - 8 freq
glencoe - 6 freq
cling - 7 freq
'clink' - 1 freq
colonic - 1 freq
clinic - 3 freq
clank - 2 freq
clinky - 1 freq
kelng - 1 freq
glink - 1 freq
clonk - 2 freq
qualunque - 1 freq
clonca - 1 freq
cooling - 1 freq
culling - 2 freq
CLUNK
Time to execute Levenshtein function - 0.204807 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.346068 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030794 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037867 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000925 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.