Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to eemis in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
eemis (0) - 10 freq semis (1) - 4 freq eesin (2) - 1 freq eamin (2) - 1 freq exis (2) - 1 freq seems (2) - 517 freq emit (2) - 4 freq deemies (2) - 1 freq permis (2) - 1 freq eeses (2) - 1 freq denis (2) - 11 freq leis (2) - 1 freq feemin (2) - 1 freq feemit (2) - 1 freq nevis (2) - 4 freq jeesis (2) - 2 freq semi (2) - 20 freq eerie (2) - 34 freq penis (2) - 1 freq eekin (2) - 1 freq ermie (2) - 1 freq eerins (2) - 6 freq deemit (2) - 2 freq demit (2) - 2 freq eet's (2) - 58 freq	eemis (0) - 10 freq ems (2) - 4 freq semis (2) - 4 freq emus (2) - 1 freq mis (2) - 3 freq nemos (3) - 1 freq eemur (3) - 1 freq eens (3) - 131 freq eenies (3) - 15 freq teems (3) - 1 freq dems (3) - 6 freq nems (3) - 19 freq memes (3) - 4 freq exems (3) - 1 freq enemies (3) - 36 freq mos (3) - 9 freq feeis (3) - 1 freq erms (3) - 42 freq eomin (3) - 1 freq geis (3) - 8 freq deems (3) - 6 freq hems (3) - 8 freq jeems (3) - 91 freq eyewis (3) - 5 freq weems (3) - 1 freq	SoundEx code - E520 enough - 883 freq ens - 16 freq enns - 11 freq eneuch - 748 freq eence - 316 freq eens - 131 freq enjoay - 1 freq enjoy - 331 freq eneugh - 49 freq ense - 15 freq eemage - 18 freq emmma's - 1 freq enuch - 89 freq eemis - 10 freq een's - 13 freq eyn's - 1 freq eyns - 7 freq enjey - 11 freq eense - 16 freq eneaise - 1 freq enn's - 1 freq enough-he - 1 freq enouch - 4 freq enugh - 2 freq enc - 1 freq enic's - 1 freq 'enjoy - 2 freq eunice - 1 freq enoch - 17 freq enosh - 3 freq emmaus - 6 freq eemock - 8 freq einas - 1 freq eins - 2 freq enyoch - 36 freq eens-shö - 1 freq eans - 10 freq emus - 1 freq eimage - 17 freq enogh - 20 freq enjye - 1 freq enough-a - 1 freq 'enough - 1 freq eng - 10 freq eneuch- - 1 freq eyeing - 1 freq eines - 3 freq enack - 1 freq eince - 1 freq eneoch - 2 freq ��eence - 1 freq eenies - 15 freq ��eneuch - 1 freq ems - 4 freq enschew - 1 freq enjy - 2 freq emma's - 1 freq ewing - 6 freq enes - 2 freq ��enoch - 1 freq enyoch' - 1 freq enes - 1 freq emms - 1 freq emosh - 1 freq engy - 2 freq emaaq - 1 freq emz - 4 freq emoji - 3 freq euang - 1 freq e'en's - 1 freq eyemask - 1 freq eimsj - 1 freq enoug - 1 freq enj - 1 freq euankay - 1 freq	MetaPhone code - EMS emmma's - 1 freq eemis - 10 freq emmaus - 6 freq emus - 1 freq embassie - 1 freq embassy - 1 freq ems - 4 freq emma's - 1 freq emms - 1 freq emz - 4 freq	EEMIS
Time to execute Levenshtein function - 0.185257 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.363952 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.028366 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.038864 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000823 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics