Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to ah-thin in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
ah-thin (0) - 3 freq ah-hin (1) - 2 freq a'thin (2) - 2 freq anythin (2) - 145 freq aa'thin (2) - 7 freq ahthin' (2) - 1 freq aathin (2) - 203 freq athin (2) - 226 freq awthin (2) - 133 freq anything (3) - 138 freq nacthin (3) - 3 freq chattin (3) - 22 freq shitein (3) - 2 freq thatchin (3) - 2 freq kitthin (3) - 1 freq swithin (3) - 1 freq athins (3) - 2 freq weethin (3) - 28 freq shuttin (3) - 39 freq 'aathin (3) - 1 freq somthin (3) - 7 freq nothin (3) - 194 freq sumthin (3) - 97 freq ahein (3) - 1 freq awhin (3) - 1 freq	ah-thin (0) - 3 freq ah-hin (2) - 2 freq aathin (4) - 203 freq awthin (4) - 133 freq ahthin' (4) - 1 freq athin (4) - 226 freq aa'thin (4) - 7 freq anythin (4) - 145 freq a'thin (4) - 2 freq anythn (5) - 1 freq ah-ta (5) - 1 freq naecthin (5) - 1 freq xanthin (5) - 1 freq ah-hah (5) - 1 freq hushin (5) - 1 freq seithin (5) - 1 freq naithin (5) - 3 freq hitchin (5) - 1 freq teethin (5) - 1 freq ah-ah (5) - 1 freq 'nothin (5) - 4 freq naeithin (5) - 1 freq moothin (5) - 3 freq gethin (5) - 1 freq vrythin (5) - 1 freq	SoundEx code - A350 atween - 1053 freq awthin - 133 freq aathin - 203 freq ae-time - 5 freq addin - 39 freq autumn - 60 freq adoun - 2 freq athin - 226 freq adam - 189 freq aetin - 27 freq ae-them - 1 freq aeten - 7 freq atwein - 46 freq atein - 4 freq a-team - 2 freq aiten - 4 freq 'adam - 2 freq auduma - 2 freq athein - 1 freq awthein - 9 freq adden - 1 freq aitten - 2 freq awthin' - 7 freq aa'thin - 7 freq 'atween - 1 freq awaitin - 6 freq admm - 1 freq atoun - 1 freq aidin - 1 freq ah-thin - 3 freq aetan - 3 freq atone - 2 freq aitin - 21 freq atwain - 2 freq atom - 5 freq addin' - 1 freq a-daein - 1 freq 'aathin - 1 freq 'awthin - 4 freq aeteen - 1 freq aden - 2 freq atin - 3 freq at'm - 1 freq adom - 1 freq atwen - 1 freq aatin - 1 freq aatheen - 1 freq aw-time - 1 freq adem - 1 freq a'dyn - 1 freq atwien - 2 freq a'diein - 1 freq ahtween - 2 freq addan - 3 freq aten - 2 freq attain - 2 freq aatum - 1 freq aiteen - 1 freq aa-deen - 2 freq 'aa-deen - 1 freq 'aa-deen' - 1 freq athena - 26 freq 'autumn - 1 freq a'thin - 2 freq atomie - 3 freq autmn - 1 freq atten - 2 freq aittin - 1 freq adomnán - 2 freq Éadaoin - 1 freq ��aathin - 1 freq aidam - 1 freq athene - 1 freq aidan - 5 freq ��aathin - 1 freq aidom - 1 freq atheen - 1 freq atween - 1 freq a'tween - 1 freq aathin' - 1 freq atmo - 1 freq adna - 1 freq add-on - 1 freq ahthin' - 1 freq aidanmo - 5 freq atm - 3 freq adn - 1 freq	MetaPhone code - A0N awthin - 133 freq aathin - 203 freq athin - 226 freq athein - 1 freq awthein - 9 freq awthin' - 7 freq aa'thin - 7 freq ah-thin - 3 freq 'aathin - 1 freq 'awthin - 4 freq aatheen - 1 freq athena - 26 freq a'thin - 2 freq ��aathin - 1 freq athene - 1 freq ��aathin - 1 freq atheen - 1 freq aathin' - 1 freq ahthin' - 1 freq	AH-THIN
Time to execute Levenshtein function - 0.195472 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.333397 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027669 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.037130 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000928 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics