Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hent in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
hent (0) - 6 freq hert (1) - 770 freq sent (1) - 546 freq helt (1) - 5 freq ahent (1) - 102 freq henk (1) - 3 freq heat (1) - 163 freq gent (1) - 10 freq tent (1) - 460 freq shent (1) - 3 freq cent (1) - 20 freq fent (1) - 11 freq ment (1) - 18 freq hens (1) - 85 freq hint (1) - 78 freq hant (1) - 6 freq kent (1) - 1832 freq hen' (1) - 2 freq ent (1) - 3 freq pent (1) - 39 freq heng (1) - 4 freq heft (1) - 111 freq dent (1) - 4 freq hett (1) - 35 freq lent (1) - 38 freq	hent (0) - 6 freq ahent (1) - 102 freq hant (1) - 6 freq hunt (1) - 62 freq hint (1) - 78 freq hen (2) - 415 freq heet (2) - 2 freq yent (2) - 1 freq hett (2) - 35 freq lent (2) - 38 freq dent (2) - 4 freq went (2) - 1923 freq ahint (2) - 748 freq haint (2) - 3 freq haunt (2) - 28 freq bent (2) - 104 freq het (2) - 262 freq heft (2) - 111 freq rent (2) - 48 freq vent (2) - 6 freq heni (2) - 2 freq heat (2) - 163 freq gent (2) - 10 freq tent (2) - 460 freq hert (2) - 770 freq	SoundEx code - H530 haund - 384 freq hummed - 10 freq handy - 56 freq haundie - 1 freq haunt - 28 freq hin't - 1 freq haunit - 2 freq hint - 78 freq hoond - 2 freq hand - 319 freq hant - 6 freq hunt - 62 freq honed - 5 freq him-it - 1 freq hent - 6 freq haundy - 18 freq hannit - 20 freq hantie - 4 freq hamewith - 9 freq hained - 27 freq honey-dew - 1 freq haunmaid - 1 freq handee - 4 freq handie - 3 freq hind - 14 freq haun't - 4 freq hemmed - 3 freq 'haund - 3 freq hem't - 1 freq hannet - 2 freq hummit - 1 freq haand - 104 freq hindu - 8 freq him-hit - 1 freq honeyed - 1 freq hunda - 1 freq haun-med - 1 freq hainit - 6 freq him-id - 1 freq houmit - 1 freq hunde - 1 freq hinnied - 1 freq hamada - 1 freq haint - 3 freq haunnit - 2 freq hende - 1 freq heymouthe - 1 freq hynd - 1 freq ��hunty - 1 freq hound - 11 freq ��hand - 1 freq hnd - 3 freq hindi - 2 freq hond - 3 freq henwudie - 1 freq henwuddie - 3 freq haun-made - 1 freq hammett - 1 freq heymooth - 1 freq huntie - 1 freq heynd - 1 freq hmt - 1 freq hand - 1 freq hnuty - 1 freq hmdt - 1 freq honeat - 1 freq hund - 1 freq hnid - 1 freq hunt' - 1 freq handw - 1 freq	MetaPhone code - HNT haund - 384 freq handy - 56 freq haundie - 1 freq haunt - 28 freq hin't - 1 freq haunit - 2 freq hint - 78 freq hoond - 2 freq hand - 319 freq hant - 6 freq hunt - 62 freq honed - 5 freq hent - 6 freq haundy - 18 freq hannit - 20 freq hantie - 4 freq hained - 27 freq honey-dew - 1 freq handee - 4 freq handie - 3 freq hind - 14 freq haun't - 4 freq 'haund - 3 freq hannet - 2 freq haand - 104 freq hindu - 8 freq hunda - 1 freq hainit - 6 freq hunde - 1 freq hinnied - 1 freq haint - 3 freq haunnit - 2 freq hende - 1 freq ��hunty - 1 freq hound - 11 freq ��hand - 1 freq hindi - 2 freq hond - 3 freq huntie - 1 freq heynd - 1 freq hand - 1 freq honeat - 1 freq hund - 1 freq hunt' - 1 freq handw - 1 freq	HENT
Time to execute Levenshtein function - 0.304480 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.540607 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.046038 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.069376 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000917 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics