A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to literacy in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
literacy (0) - 34 freq
literary (1) - 63 freq
leiteracy (1) - 3 freq
literate (2) - 9 freq
leeteracy (2) - 15 freq
literal (2) - 7 freq
literati (2) - 3 freq
lit'rary (2) - 1 freq
literally (2) - 42 freq
literar (2) - 3 freq
leiterary (2) - 4 freq
litany (3) - 1 freq
lateral (3) - 3 freq
tracy (3) - 13 freq
literarie (3) - 1 freq
leiterate (3) - 1 freq
literacies (3) - 1 freq
liberate (3) - 3 freq
liberal (3) - 26 freq
aikeray (3) - 2 freq
limerick (3) - 7 freq
numeracy (3) - 3 freq
livery (3) - 13 freq
riteday (3) - 1 freq
€˜literary (3) - 1 freq
literacy (0) - 34 freq
leiteracy (1) - 3 freq
leeteracy (2) - 15 freq
literary (2) - 63 freq
leiterary (3) - 4 freq
literar (3) - 3 freq
literati (3) - 3 freq
literate (3) - 9 freq
literal (3) - 7 freq
literacies (4) - 1 freq
liturgy (4) - 3 freq
liter (4) - 1 freq
leeterary (4) - 19 freq
leiterate (4) - 1 freq
tracy (4) - 13 freq
literarie (4) - 1 freq
literally (4) - 42 freq
lateral (4) - 3 freq
lit'rary (4) - 1 freq
litres (5) - 3 freq
trace (5) - 37 freq
retrace (5) - 1 freq
later (5) - 554 freq
leeteral (5) - 2 freq
laters (5) - 12 freq
SoundEx code - L362
letters - 142 freq
leaders - 30 freq
ledder's - 1 freq
literacy - 34 freq
luther's - 1 freq
letturs - 3 freq
ladders - 7 freq
leader's - 2 freq
leather's - 1 freq
loiters - 1 freq
leadership - 21 freq
lettérs - 1 freq
lethargic - 1 freq
liturgy - 3 freq
ledders - 2 freq
leeteracy - 15 freq
leeteracies - 2 freq
litres - 3 freq
lettèrs - 7 freq
'letters - 1 freq
lettirs - 1 freq
ledars - 1 freq
leaderschip - 3 freq
lettres - 1 freq
liturgical - 4 freq
letteris - 1 freq
leiteracy - 3 freq
literacies - 1 freq
letter's - 1 freq
lethers - 1 freq
leddrach - 50 freq
leddrach's - 1 freq
leather-jeckets - 1 freq
€˜leaders - 1 freq
leaders-aff - 2 freq
leadersdebate - 1 freq
lethargy - 1 freq
'laters' - 1 freq
laters - 12 freq
literacypkc - 1 freq
lldrkmiuw - 1 freq
MetaPhone code - LTRS
letters - 142 freq
leaders - 30 freq
ledder's - 1 freq
literacy - 34 freq
letturs - 3 freq
ladders - 7 freq
leader's - 2 freq
loiters - 1 freq
lettérs - 1 freq
ledders - 2 freq
leeteracy - 15 freq
litres - 3 freq
lettèrs - 7 freq
'letters - 1 freq
lettirs - 1 freq
ledars - 1 freq
lettres - 1 freq
letteris - 1 freq
leiteracy - 3 freq
letter's - 1 freq
€˜leaders - 1 freq
'laters' - 1 freq
laters - 12 freq
LITERACY
Time to execute Levenshtein function - 0.249336 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.564607 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027703 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.074738 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001113 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.