Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to scotticism in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
scotticism (0) - 9 freq scotticisms (1) - 12 freq scotticise (1) - 1 freq scepticism (2) - 4 freq scottiscisms (2) - 2 freq criticism (3) - 19 freq scottis (3) - 52 freq 'scotticisms' (3) - 5 freq skepticism (3) - 1 freq scottish (3) - 1225 freq exoticism (3) - 1 freq stoicism (3) - 2 freq scottishisms (3) - 1 freq scotties (3) - 1 freq creeticism (4) - 1 freq scottishfa (4) - 23 freq scottish- (4) - 1 freq scottishsun (4) - 3 freq scott's (4) - 15 freq scotitie (4) - 1 freq scottm (4) - 1 freq scotlit's (4) - 2 freq optimism (4) - 6 freq activism (4) - 5 freq scoattish (4) - 2 freq	scotticism (0) - 9 freq scotticise (2) - 1 freq scotticisms (2) - 12 freq scepticism (3) - 4 freq scottiscisms (4) - 2 freq skepticism (5) - 1 freq scotties (5) - 1 freq stoicism (5) - 2 freq scottish (5) - 1225 freq scottis (5) - 52 freq criticism (5) - 19 freq creiticism (6) - 5 freq scoattish (6) - 2 freq scottm (6) - 1 freq sceptisim (6) - 1 freq scottyc (6) - 1 freq scotts (6) - 11 freq sceptics (6) - 3 freq scottories (6) - 21 freq scott's (6) - 15 freq creeticism (6) - 1 freq scottishisms (6) - 1 freq exoticism (6) - 1 freq 'scotticisms' (6) - 5 freq scatters (7) - 2 freq	SoundEx code - S322 stooshies - 2 freq stookies - 3 freq suitcase - 13 freq stokes - 2 freq stashes - 1 freq stages - 16 freq scots-accented - 1 freq stooges' - 2 freq stooges - 3 freq stake's - 2 freq stuckies - 2 freq stcik's - 1 freq stcik - 1 freq stiches - 2 freq suitcases - 6 freq scottish-swiss - 1 freq 'scottish-swiss - 1 freq scotchies - 11 freq sadducees - 9 freq sadducees' - 1 freq swatches - 16 freq scutches - 1 freq switches - 7 freq scotsis - 1 freq scotticisms - 12 freq stoushies - 1 freq scotticism - 9 freq stagecoach - 3 freq swaatches - 10 freq sketches - 3 freq stakes - 6 freq stgaight - 1 freq staggie's - 1 freq scottishking - 8 freq scottishsport - 3 freq stushies - 2 freq scottishcorpus - 2 freq stagecoaches - 1 freq scottiscisms - 2 freq scottishisms - 1 freq stuikies - 2 freq swatchis - 1 freq ��stages - 1 freq stoicism - 2 freq stocious - 2 freq scotshoose - 1 freq sidekick - 1 freq stookie's - 1 freq scotticise - 1 freq scotticisation - 1 freq steuchs - 1 freq scottishjill - 13 freq scottishsun - 3 freq scottishweek - 1 freq staceyjaneadam - 1 freq scottishcup - 15 freq stockist - 1 freq stacey's - 1 freq scotswikipedia - 2 freq scottishgreens - 2 freq scottishgaelic - 1 freq scottishspyro - 1 freq stushie-ish - 1 freq scottishhs - 1 freq stagecoachescot - 6 freq stagecoachwscot - 7 freq scottishcilt - 6 freq scotgeoclass - 1 freq scotsisterphoto - 1 freq sutchkins - 1 freq 'scotticisms' - 5 freq stokecity - 4 freq scotshistory - 2 freq stjoacad - 1 freq scottishseas - 1 freq scottishcritter - 1 freq suitcasetrains - 1 freq	MetaPhone code - SKTSSM scotticism - 9 freq	SCOTTICISM
Time to execute Levenshtein function - 0.281400 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.581965 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027009 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.069814 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000786 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics