Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to stooges in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
stooges (0) - 3 freq stooges' (1) - 2 freq storees (2) - 1 freq stoops (2) - 4 freq shoogs (2) - 1 freq shooge (2) - 1 freq stormes (2) - 1 freq stooter (2) - 1 freq sponges (2) - 8 freq stories (2) - 359 freq stonger (2) - 1 freq stoored (2) - 4 freq stoor's (2) - 1 freq stooped (2) - 6 freq stoons (2) - 3 freq stodge (2) - 3 freq toogs (2) - 6 freq stokes (2) - 2 freq stooks (2) - 16 freq stovies (2) - 37 freq stones (2) - 29 freq stages (2) - 16 freq shoogles (2) - 8 freq stools (2) - 2 freq stookies (2) - 3 freq	stooges (0) - 3 freq stages (2) - 16 freq stooges' (2) - 2 freq stools (3) - 2 freq stovies (3) - 37 freq stooks (3) - 16 freq stookies (3) - 3 freq stones (3) - 29 freq stowes (3) - 1 freq stoves (3) - 3 freq stores (3) - 30 freq stoots (3) - 2 freq toogs (3) - 6 freq stags (3) - 1 freq stokes (3) - 2 freq stugs (3) - 1 freq stories (3) - 359 freq stoops (3) - 4 freq shoogs (3) - 1 freq stoons (3) - 3 freq storees (3) - 1 freq stanes (4) - 277 freq stons (4) - 13 freq stakes (4) - 6 freq stobs (4) - 8 freq	SoundEx code - S322 stooshies - 2 freq stookies - 3 freq suitcase - 14 freq stokes - 2 freq stashes - 1 freq stages - 16 freq scots-accented - 1 freq stooges' - 2 freq stooges - 3 freq stake's - 2 freq stuckies - 2 freq stcik's - 1 freq stcik - 1 freq suitcasees - 1 freq sketches - 4 freq stiches - 2 freq suitcases - 6 freq scottish-swiss - 1 freq 'scottish-swiss - 1 freq scotchies - 11 freq sadducees - 9 freq sadducees' - 1 freq swatches - 16 freq scutches - 1 freq switches - 7 freq scotsis - 1 freq scotticisms - 12 freq stoushies - 1 freq scotticism - 9 freq stagecoach - 3 freq swaatches - 10 freq stakes - 6 freq stgaight - 1 freq staggie's - 1 freq scottishking - 8 freq scottishsport - 3 freq stushies - 2 freq scottishcorpus - 2 freq stagecoaches - 1 freq scottiscisms - 2 freq scottishisms - 1 freq stuikies - 2 freq swatchis - 1 freq ��stages - 1 freq stoicism - 2 freq stocious - 2 freq scotshoose - 1 freq sidekick - 1 freq stookie's - 1 freq scotticise - 1 freq scotticisation - 1 freq steuchs - 1 freq scottishjill - 13 freq scottishsun - 3 freq scottishweek - 1 freq staceyjaneadam - 1 freq scottishcup - 15 freq stockist - 1 freq stacey's - 1 freq scotswikipedia - 2 freq scottishgreens - 2 freq scottishgaelic - 1 freq scottishspyro - 1 freq stushie-ish - 1 freq scottishhs - 1 freq stagecoachescot - 6 freq stagecoachwscot - 7 freq scottishcilt - 6 freq scotgeoclass - 1 freq scotsisterphoto - 1 freq sutchkins - 1 freq 'scotticisms' - 5 freq stokecity - 4 freq scotshistory - 2 freq stjoacad - 1 freq scottishseas - 1 freq scottishcritter - 1 freq suitcasetrains - 1 freq	MetaPhone code - STJS stages - 16 freq stooges' - 2 freq stooges - 3 freq ��stages - 1 freq	STOOGES
Time to execute Levenshtein function - 0.256808 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.372471 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.031165 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.039244 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000934 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics