Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to od in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
od (0) - 8 freq rd (1) - 63 freq ld (1) - 3 freq oq (1) - 4 freq ov (1) - 59 freq ok (1) - 116 freq odo (1) - 1 freq oj (1) - 2 freq td (1) - 9 freq oad (1) - 2 freq md (1) - 10 freq -d (1) - 8 freq cod (1) - 20 freq god (1) - 941 freq zd (1) - 1 freq �d (1) - 3 freq ovd (1) - 3 freq or (1) - 9206 freq bd (1) - 6 freq vd (1) - 2 freq ond (1) - 1 freq mod (1) - 9 freq ot (1) - 18 freq oid (1) - 1 freq oh (1) - 1175 freq	od (0) - 8 freq ud (1) - 4 freq oda (1) - 5 freq oid (1) - 1 freq ed (1) - 53 freq d (1) - 462 freq id (1) - 597 freq ode (1) - 13 freq oad (1) - 2 freq ad (1) - 126 freq oed (1) - 6 freq odo (1) - 1 freq om (2) - 6 freq ob (2) - 2 freq o (2) - 56035 freq qd (2) - 1 freq jd (2) - 5 freq yud (2) - 1 freq ou (2) - 17 freq uid (2) - 1 freq doy (2) - 1 freq eid (2) - 4 freq kd (2) - 2 freq oy (2) - 5 freq og (2) - 10 freq	SoundEx code - O300 oot - 13735 freq out - 773 freq othe - 4 freq o't - 272 freq ooty - 21 freq oot-d'ye - 1 freq owt - 10 freq oath - 13 freq odd - 134 freq 'oot - 12 freq ode - 13 freq od't - 4 freq ot - 18 freq oat - 12 freq o'd - 13 freq ootae - 83 freq -odd - 3 freq oota - 53 freq ootwi - 17 freq oawthe - 1 freq owed - 13 freq owet - 1 freq oit - 1 freq ooadaa - 1 freq owte - 2 freq od - 8 freq oottae - 1 freq ooto - 85 freq oed - 6 freq oot' - 4 freq ootd - 2 freq owd - 1 freq o'tay - 1 freq out' - 2 freq 'out - 2 freq ootdae - 5 freq o't' - 1 freq owid - 7 freq oatae - 1 freq ott - 4 freq oda - 5 freq oot-o'-e-way - 2 freq oot- - 1 freq ��oot - 5 freq oo'd - 1 freq ��out - 2 freq out-the-wey - 1 freq out-waw - 1 freq ��out - 1 freq ��oot - 45 freq oot-the-wey - 1 freq oeht - 1 freq 'oot' - 1 freq odo - 1 freq ootta - 2 freq ��oot - 1 freq othha - 1 freq oad - 2 freq outta - 4 freq ��ootwi - 2 freq ��ode - 1 freq outty - 2 freq oudey - 1 freq oid - 1 freq o'the - 7 freq othe - 8 freq o'dee - 1 freq o'at - 1 freq oodie - 1 freq outwi - 1 freq oto - 1 freq ot - 2 freq oot - 1 freq 'ootwi' - 1 freq outa - 6 freq	MetaPhone code - OT oot - 13735 freq out - 773 freq o't - 272 freq ooty - 21 freq owt - 10 freq odd - 134 freq 'oot - 12 freq ode - 13 freq ot - 18 freq oat - 12 freq o'd - 13 freq ootae - 83 freq -odd - 3 freq oota - 53 freq oit - 1 freq ooadaa - 1 freq owte - 2 freq od - 8 freq oottae - 1 freq ooto - 85 freq oed - 6 freq oot' - 4 freq owd - 1 freq o'tay - 1 freq out' - 2 freq 'out - 2 freq o't' - 1 freq oatae - 1 freq ott - 4 freq oda - 5 freq oot- - 1 freq ��oot - 5 freq oo'd - 1 freq ��out - 2 freq ��out - 1 freq ��oot - 45 freq oeht - 1 freq 'oot' - 1 freq odo - 1 freq ootta - 2 freq ��oot - 1 freq oad - 2 freq outta - 4 freq ��ode - 1 freq outty - 2 freq oudey - 1 freq oid - 1 freq o'dee - 1 freq o'at - 1 freq oodie - 1 freq oto - 1 freq ot - 2 freq oot - 1 freq outa - 6 freq	OD
Time to execute Levenshtein function - 0.502176 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.980450 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.090841 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.110119 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.001012 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics