Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to huvtae in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
huvtae (0) - 6 freq hudtae (1) - 1 freq huviae (1) - 1 freq hivtae (1) - 1 freq huvnae (1) - 59 freq huntie (2) - 1 freq huttie (2) - 1 freq yuptae (2) - 4 freq husnae (2) - 29 freq haetae (2) - 7 freq huv'nae (2) - 1 freq uptae (2) - 19 freq hudnae (2) - 58 freq mustae (2) - 6 freq havnae (2) - 22 freq ustae (2) - 2 freq huvna (2) - 1 freq huntan (2) - 4 freq hidtae (2) - 4 freq huvane (2) - 1 freq hurtle (2) - 1 freq hustle (2) - 1 freq furtae (2) - 57 freq hivnae (2) - 42 freq untae (2) - 13 freq	huvtae (0) - 6 freq hivtae (1) - 1 freq huvnae (2) - 59 freq huviae (2) - 1 freq hudtae (2) - 1 freq huvna (3) - 1 freq havti (3) - 1 freq huvane (3) - 1 freq havnae (3) - 22 freq hidtae (3) - 4 freq hivnae (3) - 42 freq huttie (3) - 1 freq haetae (3) - 7 freq huntie (3) - 1 freq haivt (4) - 1 freq hevan (4) - 10 freq hyste (4) - 9 freq hivna (4) - 48 freq hivan (4) - 7 freq have (4) - 1198 freq havenae (4) - 20 freq haute (4) - 1 freq hte (4) - 1 freq hivvie (4) - 18 freq hut (4) - 85 freq	SoundEx code - H130 habit - 49 freq happit - 102 freq heapt - 1 freq hoped - 42 freq howpit - 19 freq hivtae - 1 freq haufwit - 2 freq huvtae - 6 freq howped - 14 freq haffet - 5 freq haiped - 1 freq hft - 1 freq happt - 14 freq hapt - 5 freq heft - 111 freq heaved - 6 freq happed - 35 freq haft - 2 freq haeved - 2 freq hoobeit - 6 freq heavit - 2 freq heapit - 4 freq howbeit - 2 freq hivved - 1 freq hefty - 8 freq haived - 10 freq hope't - 1 freq hobbit - 2 freq hopt - 1 freq haipit - 1 freq huppity - 1 freq höved - 2 freq hoved - 4 freq hiv't - 1 freq hopit - 2 freq hufft - 1 freq hoppit - 2 freq houbeit - 7 freq hippit - 7 freq huffed - 3 freq havti - 1 freq hauf-day - 1 freq heaped - 3 freq haffit - 1 freq haivt - 1 freq haufed - 2 freq howpt - 1 freq hopped - 1 freq hyped - 2 freq hooped - 1 freq hoofed - 3 freq hevved - 1 freq hvd - 1 freq hepped - 1 freq hfwaet - 1 freq	MetaPhone code - HFT hivtae - 1 freq high-doh - 4 freq huvtae - 6 freq haffet - 5 freq heft - 111 freq heaved - 6 freq haft - 2 freq haeved - 2 freq hight - 2 freq heavit - 2 freq hivved - 1 freq hefty - 8 freq haived - 10 freq hoved - 4 freq hiv't - 1 freq hufft - 1 freq huffed - 3 freq havti - 1 freq hauf-day - 1 freq haffit - 1 freq haivt - 1 freq haufed - 2 freq hoofed - 3 freq hevved - 1 freq	HUVTAE
Time to execute Levenshtein function - 0.304083 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.590866 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.061113 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.036489 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000790 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics