A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to encyclopaedia in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
encyclopaedia (0) - 5 freq
encyclopedia (1) - 4 freq
cyclopaean (5) - 1 freq
enclosed (6) - 5 freq
enveloped (6) - 1 freq
nicomedia (6) - 1 freq
andyclosee (7) - 1 freq
accoardin (7) - 13 freq
recycled (7) - 2 freq
unloadit (7) - 1 freq
enclosin (7) - 1 freq
envelope (7) - 72 freq
encompassin (7) - 3 freq
encooraged (7) - 1 freq
episcopalian (7) - 3 freq
eclipsed (7) - 2 freq
envelopes (7) - 6 freq
nacboadie (7) - 1 freq
unloadin (7) - 6 freq
ensconsed (7) - 1 freq
anclapped (7) - 1 freq
unclouded (7) - 1 freq
enclooden (7) - 1 freq
envelopin (7) - 1 freq
encleetic (7) - 1 freq
encyclopaedia (0) - 5 freq
encyclopedia (1) - 4 freq
cyclopaean (7) - 1 freq
enveloped (8) - 1 freq
enclosed (8) - 5 freq
encircled (9) - 1 freq
unclouded (9) - 1 freq
anclapped (9) - 1 freq
cyclops (9) - 2 freq
recycled (9) - 2 freq
nicomedia (9) - 1 freq
cycled (9) - 4 freq
enunclated (9) - 1 freq
accolade (10) - 1 freq
cyclonic (10) - 1 freq
enchiladas (10) - 1 freq
ensconced (10) - 1 freq
inclined (10) - 6 freq
uncalled (10) - 2 freq
occupeed (10) - 1 freq
clyped (10) - 8 freq
cclop (10) - 1 freq
scalped (10) - 3 freq
enceladus (10) - 1 freq
included (10) - 19 freq
SoundEx code - E522
enjoys - 17 freq
enough's - 3 freq
engaged - 19 freq
enjoys-an - 1 freq
eemages - 15 freq
enseeists - 1 freq
engagements - 3 freq
encaise - 1 freq
encyclopedia - 4 freq
engaged' - 1 freq
engage - 40 freq
engish - 2 freq
ehanges - 1 freq
emojis - 1 freq
engagin - 16 freq
enxious - 4 freq
eunuchs - 2 freq
enjoys't - 1 freq
encased - 2 freq
engaging - 4 freq
engagement - 28 freq
encyclopaedia - 5 freq
enjoses - 1 freq
enjose - 3 freq
eimages - 8 freq
enjosed - 5 freq
'enough's - 2 freq
engagin' - 4 freq
engagerment - 1 freq
engages - 3 freq
enseignies - 1 freq
enjoy's - 1 freq
ensues - 2 freq
enjogh - 1 freq
€˜engage - 1 freq
eeemages - 1 freq
engaigne - 1 freq
enjosin - 1 freq
ensignie - 1 freq
engisl - 1 freq
eunisjassemi - 2 freq
euanmcgachie - 1 freq
MetaPhone code - ENSKLPT
encyclopedia - 4 freq
encyclopaedia - 5 freq
ENCYCLOPAEDIA
Time to execute Levenshtein function - 0.325258 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.615135 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.063793 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038063 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000936 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.