Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to titania in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
titania (0) - 1 freq titanic (1) - 27 freq italia (2) - 1 freq totonia (2) - 1 freq titan (2) - 5 freq titanium (2) - 2 freq titans (2) - 1 freq titanic' (2) - 1 freq titnt (3) - 1 freq ttands (3) - 1 freq tittin (3) - 2 freq milanda (3) - 1 freq kirtana (3) - 1 freq vitamin (3) - 34 freq tiltan (3) - 1 freq miranda (3) - 7 freq ritan (3) - 5 freq idaia (3) - 27 freq litany (3) - 1 freq leitanie (3) - 1 freq tirnin (3) - 5 freq natalia (3) - 2 freq mithna (3) - 2 freq stanie (3) - 2 freq itsna (3) - 1 freq	titania (0) - 1 freq titan (2) - 5 freq titanic (2) - 27 freq totonia (2) - 1 freq titian (3) - 1 freq titans (3) - 1 freq titanium (3) - 2 freq fitna (4) - 40 freq eitan (4) - 1 freq tanya (4) - 1 freq tetanus (4) - 1 freq titnes (4) - 1 freq tinnie (4) - 7 freq tisan (4) - 2 freq tittie (4) - 30 freq tiftan (4) - 1 freq antonia (4) - 8 freq itan (4) - 1 freq teitin (4) - 1 freq aittin (4) - 1 freq attain (4) - 2 freq italian (4) - 62 freq titi (4) - 1 freq bitan (4) - 1 freq petunia (4) - 3 freq	SoundEx code - T350 tea-time - 8 freq tidyin - 6 freq teatime - 13 freq tuition - 7 freq 'twadna - 1 freq tidn - 1 freq tooten - 2 freq tootin - 2 freq thythm - 2 freq tweetin - 20 freq totonia - 1 freq teetin - 10 freq tattooin - 3 freq taytim - 1 freq tay-tim - 1 freq tuttin - 5 freq tittin - 2 freq thuddin - 3 freq taetime - 2 freq two-tone - 1 freq tuttan - 1 freq teuton - 2 freq teitin - 1 freq teeden - 1 freq titan - 5 freq titania - 1 freq teethin - 1 freq titian - 1 freq tae-time - 1 freq taytime - 1 freq toadyin - 1 freq	MetaPhone code - TTN didna - 1636 freq didnae - 1693 freq doutna - 1 freq 'didnae - 2 freq daudin - 2 freq dauden - 1 freq dootin - 5 freq didein - 1 freq tidn - 1 freq tooten - 2 freq tootin - 2 freq dittaino - 1 freq datin - 11 freq didny - 16 freq totonia - 1 freq dotin - 2 freq daednae - 3 freq teetin - 10 freq tattooin - 3 freq didno - 17 freq deudno - 4 freq tuttin - 5 freq dïdnae - 24 freq did'nae - 1 freq tittin - 2 freq deayton - 1 freq datten - 8 freq did'n - 1 freq dottan - 2 freq dateen - 1 freq tuttan - 1 freq 'didna - 1 freq teuton - 2 freq dottin - 1 freq teitin - 1 freq deddin - 1 freq teeden - 1 freq datn - 4 freq didn - 25 freq doitin - 3 freq dittan - 1 freq dautin - 2 freq doutin - 1 freq datna - 2 freq titan - 5 freq titania - 1 freq didni - 21 freq ��didn - 1 freq didne - 1 freq ��didn - 1 freq did-an - 1 freq daddin - 1 freq duettin - 1 freq dudno - 3 freq didnay - 2 freq dtn - 1 freq didna' - 1 freq dadden - 1 freq	TITANIA
Time to execute Levenshtein function - 0.665712 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.957271 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.072505 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.085972 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.001216 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics