A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to clunk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
clunk (0) - 8 freq
cluck (1) - 2 freq
cluny (1) - 2 freq
clink (1) - 18 freq
clonk (1) - 2 freq
crunk (1) - 5 freq
clank (1) - 2 freq
slunk (1) - 8 freq
chunk (1) - 17 freq
clung (1) - 14 freq
plunk (1) - 3 freq
clud (2) - 9 freq
flung (2) - 152 freq
bunk (2) - 28 freq
clubs (2) - 102 freq
chuk (2) - 1 freq
conk (2) - 2 freq
blenk (2) - 12 freq
luna (2) - 2 freq
clure (2) - 1 freq
slung (2) - 14 freq
clerk (2) - 24 freq
cleck (2) - 10 freq
drunk (2) - 92 freq
chunks (2) - 8 freq
clunk (0) - 8 freq
clonk (1) - 2 freq
clank (1) - 2 freq
clink (1) - 18 freq
clinky (2) - 1 freq
clung (2) - 14 freq
plunk (2) - 3 freq
chunk (2) - 17 freq
slunk (2) - 8 freq
cluck (2) - 2 freq
cluny (2) - 2 freq
crunk (2) - 5 freq
clang (3) - 12 freq
blank (3) - 59 freq
clek (3) - 1 freq
click (3) - 129 freq
link (3) - 70 freq
clone (3) - 5 freq
plenk (3) - 3 freq
cling (3) - 6 freq
claik (3) - 24 freq
lank (3) - 3 freq
glink (3) - 1 freq
clunked (3) - 1 freq
clark (3) - 32 freq
SoundEx code - C452
clingin - 21 freq
clingfilm - 2 freq
challenge - 79 freq
clings - 9 freq
cleans - 9 freq
clangin - 6 freq
columcille - 2 freq
clanjamfrie - 9 freq
clowns - 16 freq
clancy - 24 freq
ceilings - 1 freq
clincher - 2 freq
clanked - 1 freq
clancy's - 5 freq
clamjamfry - 4 freq
clanjamfry - 1 freq
columns - 9 freq
clamjamfrae - 1 freq
clung - 14 freq
clinked - 1 freq
clancy'll - 1 freq
clinkumbell - 2 freq
clankin - 6 freq
clinics - 1 freq
calms - 2 freq
clink - 18 freq
clenched - 17 freq
clamjafry - 1 freq
ceiling - 11 freq
callum's - 5 freq
clunkertonies - 1 freq
claims - 32 freq
climactic - 1 freq
clans - 26 freq
clanking - 1 freq
challenged - 19 freq
clang - 12 freq
calling - 15 freq
cleansed - 5 freq
ceilins - 8 freq
clanjamfirie - 1 freq
colonsay - 1 freq
clinkin - 5 freq
clinkers - 2 freq
cuilness - 1 freq
climmcen - 1 freq
clinker - 3 freq
clean-shaven - 2 freq
colonisers - 1 freq
climes - 6 freq
clunk - 8 freq
clumsy - 10 freq
clinched - 1 freq
claumjamfrie - 1 freq
challenges - 8 freq
clumsily - 4 freq
clench - 2 freq
clinically - 3 freq
'clink' - 1 freq
'clamjamfrie' - 1 freq
clims - 1 freq
clane-shaved - 1 freq
challengin - 10 freq
clankit - 2 freq
cleanse - 3 freq
cleansin - 5 freq
colonic - 1 freq
cleanser - 2 freq
colonies - 7 freq
'colonists' - 1 freq
cling - 6 freq
clansmen - 5 freq
columnists - 1 freq
columnist - 1 freq
clencht - 1 freq
clenchit - 1 freq
colin's - 5 freq
clams - 1 freq
cleanest - 1 freq
collins - 3 freq
ceilin's - 4 freq
clingeth - 1 freq
calmness - 3 freq
clan's - 3 freq
clinic - 3 freq
clank - 2 freq
challance - 1 freq
collums - 2 freq
culmas - 2 freq
clenkit - 1 freq
colum's - 5 freq
clangan - 1 freq
clankan - 1 freq
clamjamfrie - 5 freq
clannish - 1 freq
clinky - 1 freq
clingan - 2 freq
challengers - 1 freq
challenging - 4 freq
coulness - 1 freq
clanjamphrie - 1 freq
climacteric - 1 freq
clamjafrie - 1 freq
colonisin - 1 freq
colonised - 3 freq
clonk - 2 freq
clunkarts - 1 freq
clanjamphray - 1 freq
chaillenge - 4 freq
chaillenger - 2 freq
callanish - 1 freq
clunkin - 2 freq
cleansing - 2 freq
clunk-clunky - 1 freq
clunkety - 1 freq
clunk-clunk - 1 freq
colonise - 2 freq
clean's - 1 freq
clamjamfrey - 3 freq
clonca - 1 freq
colonists - 1 freq
colonski - 1 freq
clensing - 1 freq
climax - 2 freq
clink-clatter - 1 freq
clim's - 1 freq
clannies - 1 freq
colonisation - 1 freq
cooling - 1 freq
clones - 1 freq
clenches - 1 freq
clunked - 1 freq
clinkit - 1 freq
clinical - 4 freq
clean-sheet - 1 freq
chulemaster - 1 freq
cling-film - 1 freq
chillingly - 1 freq
clenchin - 2 freq
colonaisan - 1 freq
colinclyne - 8 freq
clynes - 1 freq
clankin' - 1 freq
colinmcgeechan - 2 freq
culling - 2 freq
calliemac - 4 freq
colmcille - 1 freq
callinicos - 1 freq
clangered - 1 freq
clangers - 1 freq
clanger - 1 freq
colinmccredie - 1 freq
callumc - 46 freq
colinmacalba - 1 freq
callumcarson - 1 freq
colinjherd - 1 freq
chileans - 1 freq
callangranny - 1 freq
callumstweets - 2 freq
colinmachamster - 1 freq
cullensewan - 1 freq
clinging - 1 freq
clinician - 1 freq
callumcarsonwlc - 3 freq
clmike - 1 freq
calmac - 1 freq
MetaPhone code - KLNK
clung - 14 freq
clink - 18 freq
killing - 7 freq
clang - 12 freq
calling - 15 freq
clunk - 8 freq
glencoe - 5 freq
'clink' - 1 freq
colonic - 1 freq
cling - 6 freq
clinic - 3 freq
clank - 2 freq
clinky - 1 freq
kelng - 1 freq
glink - 1 freq
clonk - 2 freq
qualunque - 1 freq
clonca - 1 freq
cooling - 1 freq
culling - 2 freq
CLUNK
Time to execute Levenshtein function - 0.200205 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.368461 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028058 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047718 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001135 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.