Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to script in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
script (0) - 30 freq stript (1) - 4 freq scripts (1) - 3 freq scrift (1) - 2 freq scrit (1) - 6 freq scrip (1) - 3 freq skript (1) - 1 freq scripter (2) - 6 freq crept (2) - 33 freq scriptit (2) - 1 freq scribe (2) - 7 freq scrive (2) - 1 freq crypt (2) - 2 freq scraipet (2) - 2 freq skrit (2) - 6 freq scrat (2) - 31 freq scries (2) - 1 freq scraps (2) - 22 freq scirt (2) - 1 freq sprit (2) - 2 freq sclimt (2) - 9 freq chipt (2) - 1 freq scrape (2) - 30 freq scriba (2) - 1 freq criet (2) - 1 freq	script (0) - 30 freq scrapit (2) - 10 freq scraipet (2) - 2 freq scrip (2) - 3 freq skript (2) - 1 freq scrit (2) - 6 freq scrift (2) - 2 freq scripts (2) - 3 freq stript (2) - 4 freq scraip (3) - 1 freq scrap (3) - 33 freq scairt (3) - 6 freq scrunt (3) - 7 freq scriped (3) - 1 freq scriptur (3) - 2 freq scrapy (3) - 1 freq sculpt (3) - 1 freq scriptir (3) - 4 freq scrimpit (3) - 4 freq scripted (3) - 2 freq sacrist (3) - 1 freq scrievt (3) - 1 freq scrape (3) - 30 freq scraps (3) - 22 freq scrat (3) - 31 freq	SoundEx code - S613 served - 75 freq sairved - 4 freq skreivit - 11 freq screivit - 83 freq scraped - 14 freq scripture - 15 freq servit - 8 freq servt - 1 freq scrievit - 104 freq script - 30 freq servetus - 2 freq screived - 39 freq sereived - 1 freq 'sherbit - 1 freq sherbet - 7 freq shirpit - 2 freq scrieved - 47 freq shrovetide - 1 freq shrift - 2 freq surveyed - 5 freq scribed - 1 freq scrubbed - 6 freq scrapit - 10 freq scriptures - 11 freq scriptorium - 5 freq scripturs - 4 freq scriptur - 2 freq scrift - 2 freq scraipet - 2 freq scrappet - 1 freq scrived - 3 freq sherp-tuithed - 1 freq scripted - 2 freq 'served - 1 freq scruifed - 1 freq sorbet - 2 freq scrubbit - 1 freq swerved - 2 freq scrubbid - 2 freq shiropidist - 12 freq scrabbed - 3 freq scrïptures - 1 freq serepta - 1 freq scrïpture - 1 freq serviette - 1 freq sharp-edged - 1 freq scripter - 6 freq scriptir - 4 freq scriptirs - 6 freq sarepta - 1 freq scripters - 4 freq scrievitleid - 1 freq skribbit - 1 freq screeved - 2 freq surreptitiously - 2 freq servitude - 2 freq screevit - 3 freq scrievt - 1 freq scrybit - 1 freq scriftin - 4 freq servitors - 1 freq serift - 2 freq scriftit - 2 freq scriffed - 1 freq servitours - 1 freq squarebottle' - 1 freq scriptur-readin - 1 freq scrubbit-doon - 1 freq scripteral - 1 freq shairp-toothed - 1 freq scripts - 3 freq squarebottle - 1 freq skrievit - 1 freq schirefdomes - 1 freq skreived - 2 freq surfeits - 1 freq scriptvs - 1 freq scriped - 1 freq servitor - 2 freq scrieved-on - 1 freq surfeit - 1 freq ��served - 1 freq survation - 4 freq skript - 1 freq sharped - 1 freq sherbets - 1 freq scriptit - 1 freq	MetaPhone code - SKRPT scraped - 14 freq script - 30 freq scrapit - 10 freq scraipet - 2 freq scrappet - 1 freq scriped - 1 freq skript - 1 freq	SCRIPT
Time to execute Levenshtein function - 0.190133 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.337045 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027521 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.038921 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000905 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics