A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eaceducation in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
eaceducation (0) - 17 freq
bbceducation (2) - 1 freq
education (3) - 421 freq
'education (3) - 2 freq
neducation (3) - 4 freq
re-education (3) - 3 freq
ejaculation (4) - 1 freq
abduction (4) - 1 freq
adulation (4) - 1 freq
malediction (4) - 2 freq
reduction (4) - 3 freq
educatioun (4) - 7 freq
seduction (4) - 2 freq
eddication (4) - 51 freq
valediction (4) - 1 freq
medication (4) - 8 freq
dedication (4) - 8 freq
evacuation (4) - 1 freq
aduration (4) - 1 freq
edication (4) - 3 freq
predication (4) - 4 freq
accusation (4) - 8 freq
abdication (4) - 1 freq
uofgeducation (4) - 2 freq
educatin (4) - 4 freq
eaceducation (0) - 17 freq
neducation (4) - 4 freq
'education (4) - 2 freq
bbceducation (4) - 1 freq
education (4) - 421 freq
medication (5) - 8 freq
eddication (5) - 51 freq
dedication (5) - 8 freq
abdication (5) - 1 freq
seduction (5) - 2 freq
edication (5) - 3 freq
educatioun (5) - 7 freq
educatin (5) - 4 freq
abduction (5) - 1 freq
re-education (5) - 3 freq
reduction (5) - 3 freq
eddicatin (6) - 1 freq
induction (6) - 2 freq
dedicatin (6) - 2 freq
eddicaetion (6) - 2 freq
addiction (6) - 9 freq
indication (6) - 5 freq
eductaion (6) - 1 freq
dedicatioun (6) - 1 freq
abductin (6) - 1 freq
SoundEx code - E232
ecat's - 1 freq
ex-sodjers - 1 freq
exudes - 2 freq
exotic - 17 freq
echty-echt - 1 freq
eejits - 50 freq
ecstasy - 11 freq
eighties - 11 freq
eightsome - 6 freq
ects - 4 freq
eggheads - 1 freq
eichty-six - 1 freq
eichty-seeven - 2 freq
eichts - 4 freq
exotick - 1 freq
exits - 2 freq
exodus - 6 freq
'eightsome - 1 freq
echty-six - 1 freq
eichtsome - 2 freq
echtsome - 2 freq
exoticism - 1 freq
easthoose - 1 freq
eastis - 1 freq
easthouse - 1 freq
extistence - 1 freq
eichties - 3 freq
east-wast - 1 freq
ecatched - 1 freq
ecstacie - 1 freq
echtie-sax - 1 freq
eye-catchin - 1 freq
ectos - 1 freq
ee-catchin - 1 freq
eejets - 1 freq
ecwads - 1 freq
esgdkqgdi - 1 freq
eustacemcnally - 4 freq
esdc - 1 freq
eijits - 1 freq
easydoesit - 3 freq
ewhsdzzxye - 1 freq
equates - 1 freq
ehgdiszjke - 1 freq
eegits' - 1 freq
eastscotlandfa - 57 freq
easthouses - 3 freq
easthousesafc - 3 freq
eastspacetweets - 1 freq
easthouseslily - 4 freq
eaceducation - 17 freq
eistok - 1 freq
MetaPhone code - ESTKXN
eaceducation - 17 freq
EACEDUCATION
Time to execute Levenshtein function - 0.301610 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.721509 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.100979 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.087689 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000965 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.