Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to taisten in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
taisten (0) - 1 freq faisten (1) - 1 freq taistet (1) - 1 freq tainted (2) - 3 freq torsten (2) - 1 freq waistin (2) - 1 freq taisted- (2) - 1 freq aiten (2) - 4 freq austen (2) - 2 freq tistet (2) - 2 freq tastes (2) - 37 freq existen (2) - 2 freq laisted (2) - 3 freq waisted (2) - 1 freq cuisten (2) - 2 freq haister (2) - 2 freq twistin (2) - 14 freq leisten (2) - 3 freq caistin (2) - 1 freq taised (2) - 1 freq taisled (2) - 1 freq tistin (2) - 1 freq hairten (2) - 1 freq taisin (2) - 1 freq twister (2) - 2 freq	taisten (0) - 1 freq tistin (2) - 1 freq tastin (2) - 13 freq faisten (2) - 1 freq taistet (2) - 1 freq tasted (3) - 20 freq taistit (3) - 1 freq tairtan (3) - 2 freq tostin (3) - 1 freq taste (3) - 197 freq tristan (3) - 15 freq tostan (3) - 1 freq neisten (3) - 1 freq hasten (3) - 1 freq taist (3) - 2 freq traistin (3) - 1 freq taistie (3) - 1 freq toastin (3) - 2 freq testan (3) - 1 freq tisen (3) - 1 freq taislin (3) - 3 freq tastet (3) - 1 freq casten (3) - 2 freq fasten (3) - 2 freq aitten (3) - 2 freq	SoundEx code - T235 twistin - 14 freq tichtened - 9 freq testament - 42 freq testimonials - 1 freq tichtent - 3 freq testin - 15 freq textin - 8 freq tight-mouthed - 1 freq testimony - 5 freq tastin - 13 freq tightens - 2 freq testimonie - 4 freq toastin - 2 freq taisten - 1 freq ticht-and - 1 freq twustin - 3 freq taxation - 3 freq tichten - 3 freq tighten - 4 freq testing - 12 freq 'testin - 1 freq thocht-on - 1 freq tightened - 2 freq tweistin - 1 freq testan - 1 freq tea-stained - 1 freq tightenin - 2 freq tostan - 1 freq tightnan - 2 freq testaments - 4 freq ticht-nailed - 1 freq thochtiness - 1 freq tectonic - 2 freq testimonial - 3 freq tichtens - 4 freq tichtness - 2 freq tostin - 1 freq tichtenin - 2 freq ��testament - 1 freq tasting - 5 freq tistin - 1 freq tightness - 1 freq toastin' - 1 freq twisting - 1 freq toasting - 3 freq tightening - 1 freq texting - 1 freq thesatinepheonixcardisgorgeous - 1 freq tasteohame - 1 freq	MetaPhone code - TSTN decidin - 17 freq testin - 15 freq destiny - 12 freq dustin - 12 freq tastin - 13 freq disdain - 10 freq toastin - 2 freq taisten - 1 freq distain - 1 freq destinie - 3 freq decidein - 1 freq distanee - 2 freq 'testin - 1 freq testan - 1 freq tostan - 1 freq desydin - 1 freq tostin - 1 freq tistin - 1 freq ��destiny - 1 freq deistin - 3 freq toastin' - 1 freq	TAISTEN
Time to execute Levenshtein function - 0.215382 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.394149 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.032951 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.039093 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000810 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics