A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to genres in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
genres (0) - 13 freq
genre (1) - 9 freq
genes (1) - 10 freq
senses (2) - 50 freq
gerss (2) - 5 freq
feres (2) - 52 freq
geer's (2) - 1 freq
pences (2) - 3 freq
lenses (2) - 10 freq
ganges (2) - 1 freq
metres (2) - 15 freq
fences (2) - 22 freq
centres (2) - 38 freq
ceres (2) - 11 freq
gemmes (2) - 74 freq
gentles (2) - 1 freq
gear's (2) - 1 freq
gengs (2) - 15 freq
gene (2) - 2 freq
'eres (2) - 1 freq
gees (2) - 41 freq
georges (2) - 2 freq
geddes (2) - 22 freq
henries (2) - 1 freq
tenures (2) - 1 freq
genres (0) - 13 freq
genes (2) - 10 freq
genre (2) - 9 freq
ganues (3) - 1 freq
gengis (3) - 1 freq
genus (3) - 2 freq
gears (3) - 14 freq
tenures (3) - 1 freq
genomes (3) - 1 freq
genie's (3) - 1 freq
gents (3) - 7 freq
glares (3) - 7 freq
gentis (3) - 1 freq
gers (3) - 8 freq
gener (3) - 1 freq
genral (3) - 4 freq
geirs (3) - 1 freq
henries (3) - 1 freq
genius (3) - 31 freq
genyus (3) - 1 freq
gunes (3) - 2 freq
ignores (3) - 10 freq
ganges (3) - 1 freq
generous (3) - 21 freq
gunrays (3) - 1 freq
SoundEx code - G562
gauners - 1 freq
gimmers - 3 freq
gunrays - 1 freq
gunnar's - 6 freq
generous - 21 freq
gumrie's - 1 freq
generosity - 4 freq
genres - 13 freq
generously - 1 freq
generic - 1 freq
gey-near-gothic - 1 freq
goners - 1 freq
gunners - 1 freq
generic-an-tae-wastren-een-interchangeable - 1 freq
gaynorkane - 10 freq
gamerstorm - 1 freq
gamerstormsco - 1 freq
gamergran - 1 freq
MetaPhone code - JNRS
jyners' - 3 freq
jyners - 26 freq
joiners - 7 freq
jyner's - 5 freq
generous - 21 freq
jiners - 6 freq
juniors - 16 freq
jenners - 6 freq
genres - 13 freq
jenners' - 1 freq
jonar's - 1 freq
GENRES
Time to execute Levenshtein function - 0.207746 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.347409 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028672 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038160 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000909 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.