Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to waitit in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
waitit (0) - 81 freq waitet (1) - 20 freq waftit (1) - 4 freq waitin (1) - 411 freq wastit (1) - 21 freq saitit (1) - 2 freq wantit (1) - 280 freq wautit (1) - 1 freq waistit (1) - 4 freq wailit (1) - 1 freq awaitit (1) - 3 freq maitit (1) - 1 freq whitit (1) - 1 freq haitit (1) - 1 freq waukit (2) - 4 freq waith (2) - 2 freq waist (2) - 45 freq whitrit (2) - 6 freq wanlit (2) - 4 freq airtit (2) - 42 freq sautit (2) - 2 freq swattit (2) - 2 freq vaikit (2) - 1 freq wakkit (2) - 2 freq daivit (2) - 4 freq	waitit (0) - 81 freq wautit (1) - 1 freq awaitit (1) - 3 freq waitet (1) - 20 freq haitit (2) - 1 freq waitin (2) - 411 freq watt (2) - 27 freq wattie (2) - 24 freq witt (2) - 2 freq maitit (2) - 1 freq wytit (2) - 20 freq whitit (2) - 1 freq wantit (2) - 280 freq saitit (2) - 2 freq wastit (2) - 21 freq waftit (2) - 4 freq waistit (2) - 4 freq wailit (2) - 1 freq wipit (3) - 5 freq editit (3) - 6 freq waakit (3) - 18 freq waitan (3) - 27 freq saetit (3) - 1 freq wabiti (3) - 1 freq awaitin (3) - 6 freq	SoundEx code - W330 waited - 109 freq wydit - 1 freq waitit - 81 freq withoot - 413 freq whitit - 1 freq whittie-whattie - 2 freq wud-at - 1 freq watehd - 1 freq wide-e'ed - 1 freq wide-eed - 2 freq wytit - 20 freq without - 100 freq wyted - 13 freq wheetie-whattie - 1 freq weedowed - 3 freq weedowhood - 1 freq wattet - 1 freq widowed - 1 freq waddit - 3 freq whited - 2 freq whitied - 1 freq wuidit - 1 freq whetted - 1 freq wide-eyed - 2 freq with-oot - 1 freq waded - 6 freq waitet - 20 freq wootwoodo - 1 freq widid - 2 freq widit - 1 freq wadded - 1 freq weeda't - 1 freq wedded - 4 freq wideeyed - 2 freq 'withoot - 1 freq withhaud - 1 freq 'without' - 1 freq wuidid - 1 freq widdit - 2 freq watedoo - 1 freq wautit - 1 freq wadit - 1 freq witoot - 1 freq wyded - 1 freq weeted - 1 freq weedit - 1 freq white-hot - 1 freq wuddit - 1 freq wedowheid - 1 freq witout - 2 freq wated - 1 freq widded - 1 freq weeded - 1 freq	MetaPhone code - WTT waited - 109 freq waitit - 81 freq whit'd - 2 freq whitit - 1 freq wud-at - 1 freq watehd - 1 freq wide-e'ed - 1 freq wide-eed - 2 freq wattet - 1 freq waddit - 3 freq whited - 2 freq whitied - 1 freq wuidit - 1 freq whetted - 1 freq waded - 6 freq waitet - 20 freq widid - 2 freq widit - 1 freq wadded - 1 freq weeda't - 1 freq wedded - 4 freq witd - 1 freq wuidid - 1 freq widdit - 2 freq watedoo - 1 freq wautit - 1 freq wadit - 1 freq witoot - 1 freq weeted - 1 freq weedit - 1 freq wit'd - 1 freq what'd - 1 freq wuddit - 1 freq witout - 2 freq wated - 1 freq widded - 1 freq weeded - 1 freq	WAITIT wait - 481 freq waitin - 411 freq waited - 109 freq waitit - 81 freq waiting - 49 freq waits - 27 freq
Time to execute Levenshtein function - 0.179386 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.346964 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.027619 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.037836 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000990 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics