A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to literate in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
literate (0) - 9 freq
liberate (1) - 3 freq
leiterate (1) - 1 freq
literati (1) - 3 freq
literal (2) - 7 freq
literature (2) - 94 freq
literacy (2) - 34 freq
líterate (2) - 1 freq
nitrate (2) - 1 freq
literarie (2) - 1 freq
literar (2) - 3 freq
obliterate (2) - 1 freq
leeterate (2) - 9 freq
literary (2) - 63 freq
liberated (2) - 3 freq
illiterate (2) - 3 freq
hibernate (3) - 6 freq
libert (3) - 1 freq
leeterati (3) - 1 freq
leiberat (3) - 1 freq
latinate (3) - 21 freq
nidebate (3) - 1 freq
lateral (3) - 3 freq
obliterated (3) - 1 freq
liberatin (3) - 1 freq
literate (0) - 9 freq
leiterate (1) - 1 freq
literati (1) - 3 freq
leeterate (2) - 9 freq
liberate (2) - 3 freq
literary (3) - 63 freq
literal (3) - 7 freq
leeterati (3) - 1 freq
literar (3) - 3 freq
illiterate (3) - 3 freq
obliterate (3) - 1 freq
literacy (3) - 34 freq
nitrate (3) - 1 freq
literarie (3) - 1 freq
literature (3) - 94 freq
liberty (4) - 31 freq
libertie (4) - 3 freq
alternate (4) - 3 freq
leiterarie (4) - 43 freq
strate (4) - 1 freq
liter (4) - 1 freq
litre (4) - 11 freq
leiterary (4) - 3 freq
leiteratur (4) - 21 freq
trate (4) - 2 freq
SoundEx code - L363
leather't - 1 freq
leiterate - 1 freq
leiteratur - 21 freq
leathered - 4 freq
littered - 9 freq
literature - 94 freq
leeteratur - 21 freq
literate - 9 freq
loitered - 1 freq
leeterature - 15 freq
leeterate - 9 freq
latter-day - 1 freq
leathert - 1 freq
ïll-traitit - 1 freq
leeter-ature - 1 freq
laddert - 1 freq
leetratur - 6 freq
lettert - 8 freq
literature' - 3 freq
looderit - 1 freq
leiterature - 12 freq
leitratur - 5 freq
leitratrur - 1 freq
letterit - 1 freq
literati - 3 freq
literatureinlearning - 1 freq
leeterati - 1 freq
leeterature's - 1 freq
literatures - 2 freq
líterate - 1 freq
letterheids - 1 freq
lydiareidyes - 1 freq
MetaPhone code - LTRT
leiterate - 1 freq
littered - 9 freq
literate - 9 freq
loitered - 1 freq
leeterate - 9 freq
latter-day - 1 freq
laddert - 1 freq
lettert - 8 freq
looderit - 1 freq
letterit - 1 freq
literati - 3 freq
leeterati - 1 freq
líterate - 1 freq
LITERATE
Time to execute Levenshtein function - 0.191263 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333256 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027404 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036751 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000843 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.