A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to conneryscottishwalks in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
conneryscottishwalks (0) - 1 freq
conversationally (10) - 3 freq
scottishweek (10) - 1 freq
indyscotwales (10) - 7 freq
scottishlass (10) - 16 freq
lokiscottishrap (10) - 15 freq
weescottishmamm (10) - 1 freq
conversational (10) - 4 freq
scottishbooks (10) - 3 freq
connections (11) - 35 freq
scottishgaelic (11) - 1 freq
scottishisms (11) - 1 freq
connotational (11) - 1 freq
wearescottishfb (11) - 32 freq
fillyscottish (11) - 1 freq
scottishlit (11) - 1 freq
scottishcilt (11) - 6 freq
scottishhs (11) - 1 freq
scottishmam (11) - 1 freq
a-seen-scottish (11) - 1 freq
scottishseas (11) - 1 freq
conversationalists (11) - 1 freq
anti-scottish (11) - 1 freq
scottishjill (11) - 13 freq
constitutionals (11) - 1 freq
conneryscottishwalks (0) - 1 freq
scottishbooks (16) - 3 freq
scottishlass (16) - 16 freq
scottishweek (16) - 1 freq
cerysmatthews (17) - 5 freq
indyscotwales (17) - 7 freq
scottishseas (18) - 1 freq
scottishhs (18) - 1 freq
anti-scottish (18) - 1 freq
a-seen-scottish (18) - 1 freq
scottishness (18) - 4 freq
conorscott (18) - 1 freq
scottishlit (18) - 1 freq
constitutionals (18) - 1 freq
scottishjill (18) - 13 freq
scottishcilt (18) - 6 freq
scottishisms (18) - 1 freq
scottishgaelic (18) - 1 freq
conversational (18) - 4 freq
weescottishmamm (18) - 1 freq
lokiscottishrap (18) - 15 freq
conversationally (18) - 3 freq
wearescottishfb (18) - 32 freq
coiningscots (19) - 1 freq
countesswells (19) - 1 freq
SoundEx code - C562
chaumers - 14 freq
cinners - 4 freq
cameras - 10 freq
comers - 2 freq
commercial - 16 freq
commercialise - 1 freq
connerik - 2 freq
commerce - 4 freq
conract - 1 freq
camras - 1 freq
commercially - 1 freq
chaumer's - 1 freq
chaummers - 1 freq
chaamer-quine - 1 freq
connery's - 1 freq
conneryscottishwalks - 1 freq
cumers - 1 freq
chimaeras - 1 freq
cummersum - 1 freq
conorsharkeysc - 1 freq
conorkelly - 1 freq
commercialism - 1 freq
connorjbyrne - 1 freq
comer's - 1 freq
camera's - 1 freq
cymrawes - 3 freq
conorscott - 1 freq
cymrogav - 1 freq
MetaPhone code - KNRSKTXW
CONNERYSCOTTISHWALKS
Time to execute Levenshtein function - 0.430779 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.512427 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030219 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038337 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000871 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.