Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hint in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
hint (0) - 78 freq wint (1) - 629 freq ahint (1) - 748 freq hinn (1) - 1 freq fint (1) - 2 freq hent (1) - 6 freq hine (1) - 26 freq hilt (1) - 2 freq haint (1) - 3 freq hunt (1) - 62 freq hin't (1) - 1 freq dint (1) - 10 freq bint (1) - 10 freq kint (1) - 58 freq rint (1) - 4 freq mint (1) - 64 freq hin (1) - 36 freq pint (1) - 188 freq hixt (1) - 1 freq hink (1) - 436 freq hind (1) - 14 freq hina (1) - 2 freq nint (1) - 4 freq hant (1) - 6 freq hing (1) - 604 freq	hint (0) - 78 freq haint (1) - 3 freq hant (1) - 6 freq hent (1) - 6 freq hunt (1) - 62 freq ahint (1) - 748 freq int (2) - 31 freq hit (2) - 1229 freq hing (2) - 604 freq aint (2) - 13 freq nint (2) - 4 freq hina (2) - 2 freq jint (2) - 6 freq tint (2) - 218 freq haunt (2) - 28 freq ahent (2) - 102 freq hind (2) - 14 freq sint (2) - 24 freq hints (2) - 9 freq hainit (2) - 6 freq lint (2) - 17 freq hilt (2) - 2 freq dint (2) - 10 freq hinn (2) - 1 freq hine (2) - 26 freq	SoundEx code - H530 haund - 384 freq hummed - 10 freq handy - 56 freq haundie - 1 freq haunt - 28 freq hin't - 1 freq haunit - 2 freq hint - 78 freq hoond - 2 freq hand - 319 freq hant - 6 freq hunt - 62 freq honed - 5 freq him-it - 1 freq hent - 6 freq haundy - 18 freq hannit - 20 freq hantie - 4 freq hamewith - 9 freq hained - 27 freq honey-dew - 1 freq haunmaid - 1 freq handee - 4 freq handie - 3 freq hind - 14 freq haun't - 4 freq hemmed - 3 freq 'haund - 3 freq hem't - 1 freq hannet - 2 freq hummit - 1 freq haand - 104 freq hindu - 8 freq him-hit - 1 freq honeyed - 1 freq hunda - 1 freq haun-med - 1 freq hainit - 6 freq him-id - 1 freq houmit - 1 freq hunde - 1 freq hinnied - 1 freq hamada - 1 freq haint - 3 freq haunnit - 2 freq hende - 1 freq heymouthe - 1 freq hynd - 1 freq ��hunty - 1 freq hound - 11 freq ��hand - 1 freq hnd - 3 freq hindi - 2 freq hond - 3 freq henwudie - 1 freq henwuddie - 3 freq haun-made - 1 freq hammett - 1 freq heymooth - 1 freq huntie - 1 freq heynd - 1 freq hmt - 1 freq hand - 1 freq hnuty - 1 freq hmdt - 1 freq honeat - 1 freq hund - 1 freq hnid - 1 freq hunt' - 1 freq handw - 1 freq	MetaPhone code - HNT haund - 384 freq handy - 56 freq haundie - 1 freq haunt - 28 freq hin't - 1 freq haunit - 2 freq hint - 78 freq hoond - 2 freq hand - 319 freq hant - 6 freq hunt - 62 freq honed - 5 freq hent - 6 freq haundy - 18 freq hannit - 20 freq hantie - 4 freq hained - 27 freq honey-dew - 1 freq handee - 4 freq handie - 3 freq hind - 14 freq haun't - 4 freq 'haund - 3 freq hannet - 2 freq haand - 104 freq hindu - 8 freq hunda - 1 freq hainit - 6 freq hunde - 1 freq hinnied - 1 freq haint - 3 freq haunnit - 2 freq hende - 1 freq ��hunty - 1 freq hound - 11 freq ��hand - 1 freq hindi - 2 freq hond - 3 freq huntie - 1 freq heynd - 1 freq hand - 1 freq honeat - 1 freq hund - 1 freq hunt' - 1 freq handw - 1 freq	HINT
Time to execute Levenshtein function - 0.176657 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.315617 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.032952 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.040077 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000952 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics