Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to anytime in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
anytime (0) - 29 freq onytime (1) - 11 freq daytime (2) - 7 freq aftime (2) - 4 freq aw-time (2) - 1 freq airtime (2) - 1 freq flytime (2) - 2 freq meytime (2) - 1 freq wan-time (2) - 1 freq ae-time (2) - 5 freq langtime (2) - 1 freq taytime (2) - 1 freq nytimes (2) - 1 freq ane-time (2) - 1 freq native (3) - 94 freq atotie (3) - 1 freq i'time (3) - 22 freq bedtime (3) - 17 freq anyfin (3) - 8 freq cantie (3) - 77 freq bigtime (3) - 1 freq amitie (3) - 5 freq awtie (3) - 1 freq jigtime (3) - 1 freq anything (3) - 138 freq	anytime (0) - 29 freq onytime (1) - 11 freq anatomy (3) - 5 freq taytime (3) - 1 freq ae-time (3) - 5 freq onietime (3) - 1 freq naetime (3) - 1 freq ane-time (3) - 1 freq nytimes (3) - 1 freq meytime (3) - 1 freq aftime (3) - 4 freq airtime (3) - 1 freq daytime (3) - 7 freq antifa (4) - 1 freq entire (4) - 32 freq teatime (4) - 13 freq naime (4) - 7 freq aname (4) - 1 freq anti (4) - 11 freq antiek (4) - 1 freq taetime (4) - 2 freq auntie (4) - 157 freq nyte (4) - 1 freq antrim (4) - 28 freq meantime (4) - 29 freq	SoundEx code - A535 anythin - 145 freq anything - 138 freq anytime - 29 freq ane-time - 1 freq anthony - 12 freq antennae - 2 freq anythin' - 2 freq antimatter - 1 freq anyt'n - 1 freq amadan - 1 freq antnin - 1 freq a-wantin - 1 freq anthems - 5 freq anti-union - 1 freq ane-tae-ane - 4 freq anti-englishness - 1 freq antonine - 2 freq anti-nuclear - 1 freq an-thon's - 1 freq anatomy - 5 freq anthem - 13 freq antonia - 8 freq antommy - 1 freq antenna - 4 freq aumtums - 1 freq antoni - 1 freq anodynes - 1 freq antoinine - 1 freq antonine's - 1 freq andaman - 1 freq antonines - 1 freq antonio - 1 freq ahint-haun - 1 freq anathema - 1 freq anatomists - 1 freq antunett - 1 freq ��anatomy - 1 freq anthum - 1 freq ��anything - 1 freq anatomical - 1 freq ��anatomical - 1 freq anti-militarists - 1 freq anti-english - 1 freq ��auntient - 2 freq anti-infection - 1 freq ��anything - 1 freq antanddec - 12 freq anduinnineach - 1 freq anthonyrjoseph - 1 freq aanwtnsoj - 1 freq anythn - 1 freq anathemastan - 1 freq anthemsprinter - 1 freq andean - 1 freq antonjtmcc - 1 freq anti-onybuddy - 1 freq antonymciver - 1 freq anthonyfjoshua - 1 freq andyhunter - 6 freq andymacmillan - 1 freq andymc - 1 freq	MetaPhone code - ANTM anytime - 29 freq ane-time - 1 freq anatomy - 5 freq antommy - 1 freq ��anatomy - 1 freq	ANYTIME
Time to execute Levenshtein function - 0.416123 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.570374 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027637 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.074140 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.001080 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics