A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to challenge in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
challenge (0) - 79 freq
challenged (1) - 19 freq
chaillenge (1) - 4 freq
challenges (1) - 8 freq
challengin (2) - 10 freq
chaenge (2) - 6 freq
charlene (2) - 519 freq
challengers (2) - 1 freq
challance (2) - 1 freq
chaillenger (2) - 2 freq
chenge (3) - 11 freq
college (3) - 113 freq
coillege (3) - 1 freq
allege (3) - 1 freq
charlene' (3) - 1 freq
colledge (3) - 1 freq
chainge (3) - 50 freq
unchallenged (3) - 2 freq
challice (3) - 1 freq
cheenge (3) - 42 freq
callender (3) - 1 freq
chaynge (3) - 1 freq
halleve (3) - 2 freq
challenging (3) - 4 freq
change (3) - 299 freq
challenge (0) - 79 freq
chaillenge (1) - 4 freq
challenged (2) - 19 freq
challenges (2) - 8 freq
challance (3) - 1 freq
chaillenger (3) - 2 freq
challengin (3) - 10 freq
challengers (4) - 1 freq
charlene (4) - 519 freq
chaenge (4) - 6 freq
calling (4) - 15 freq
chellegl (5) - 1 freq
charlen (5) - 1 freq
challenging (5) - 4 freq
change (5) - 299 freq
culling (5) - 2 freq
chaynge (5) - 1 freq
chillin (5) - 6 freq
shilling (5) - 3 freq
schilling (5) - 1 freq
chillingly (5) - 1 freq
chenge (5) - 11 freq
chainge (5) - 50 freq
colledge (5) - 1 freq
coillege (5) - 1 freq
SoundEx code - C452
clingin - 21 freq
clingfilm - 2 freq
challenge - 79 freq
clings - 9 freq
cleans - 9 freq
clangin - 6 freq
columcille - 2 freq
clanjamfrie - 9 freq
clowns - 16 freq
clancy - 24 freq
ceilings - 1 freq
clincher - 2 freq
clanked - 1 freq
clancy's - 5 freq
clamjamfry - 4 freq
clanjamfry - 1 freq
columns - 9 freq
clamjamfrae - 1 freq
clung - 14 freq
clinked - 1 freq
clancy'll - 1 freq
clinkumbell - 2 freq
clankin - 6 freq
clinics - 1 freq
calms - 2 freq
clink - 18 freq
clenched - 17 freq
clamjafry - 1 freq
ceiling - 11 freq
callum's - 5 freq
clunkertonies - 1 freq
claims - 32 freq
climactic - 1 freq
clans - 26 freq
clanking - 1 freq
challenged - 19 freq
clang - 12 freq
calling - 15 freq
cleansed - 5 freq
ceilins - 8 freq
clanjamfirie - 1 freq
colonsay - 1 freq
clinkin - 5 freq
clinkers - 2 freq
cuilness - 1 freq
climmcen - 1 freq
clinker - 3 freq
clean-shaven - 2 freq
colonisers - 1 freq
climes - 6 freq
clunk - 8 freq
clumsy - 10 freq
clinched - 1 freq
claumjamfrie - 1 freq
challenges - 8 freq
clumsily - 4 freq
clench - 2 freq
clinically - 3 freq
'clink' - 1 freq
'clamjamfrie' - 1 freq
clims - 1 freq
clane-shaved - 1 freq
challengin - 10 freq
clankit - 2 freq
cleanse - 3 freq
cleansin - 5 freq
colonic - 1 freq
cleanser - 2 freq
colonies - 7 freq
'colonists' - 1 freq
cling - 6 freq
clansmen - 5 freq
columnists - 1 freq
columnist - 1 freq
clencht - 1 freq
clenchit - 1 freq
colin's - 5 freq
clams - 1 freq
cleanest - 1 freq
collins - 3 freq
ceilin's - 4 freq
clingeth - 1 freq
calmness - 3 freq
clan's - 3 freq
clinic - 3 freq
clank - 2 freq
challance - 1 freq
collums - 2 freq
culmas - 2 freq
clenkit - 1 freq
colum's - 5 freq
clangan - 1 freq
clankan - 1 freq
clamjamfrie - 5 freq
clannish - 1 freq
clinky - 1 freq
clingan - 2 freq
challengers - 1 freq
challenging - 4 freq
coulness - 1 freq
clanjamphrie - 1 freq
climacteric - 1 freq
clamjafrie - 1 freq
colonisin - 1 freq
colonised - 3 freq
clonk - 2 freq
clunkarts - 1 freq
clanjamphray - 1 freq
chaillenge - 4 freq
chaillenger - 2 freq
callanish - 1 freq
clunkin - 2 freq
cleansing - 2 freq
clunk-clunky - 1 freq
clunkety - 1 freq
clunk-clunk - 1 freq
colonise - 2 freq
clean's - 1 freq
clamjamfrey - 3 freq
clonca - 1 freq
colonists - 1 freq
colonski - 1 freq
clensing - 1 freq
climax - 2 freq
clink-clatter - 1 freq
clim's - 1 freq
clannies - 1 freq
colonisation - 1 freq
cooling - 1 freq
clones - 1 freq
clenches - 1 freq
clunked - 1 freq
clinkit - 1 freq
clinical - 4 freq
clean-sheet - 1 freq
chulemaster - 1 freq
cling-film - 1 freq
chillingly - 1 freq
clenchin - 2 freq
colonaisan - 1 freq
colinclyne - 8 freq
clynes - 1 freq
clankin' - 1 freq
colinmcgeechan - 2 freq
culling - 2 freq
calliemac - 4 freq
colmcille - 1 freq
callinicos - 1 freq
clangered - 1 freq
clangers - 1 freq
clanger - 1 freq
colinmccredie - 1 freq
callumc - 46 freq
colinmacalba - 1 freq
callumcarson - 1 freq
colinjherd - 1 freq
chileans - 1 freq
callangranny - 1 freq
callumstweets - 2 freq
colinmachamster - 1 freq
cullensewan - 1 freq
clinging - 1 freq
clinician - 1 freq
callumcarsonwlc - 3 freq
clmike - 1 freq
calmac - 1 freq
MetaPhone code - XLNJ
challenge - 79 freq
chaillenge - 4 freq
CHALLENGE
Time to execute Levenshtein function - 0.204891 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.370010 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030514 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.047322 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000771 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.