Corpus of 21st Century Scots Texts - Levenshtein

A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to huvtae in Corpus

Levenshtein	Double Levenshtein	SoundEx	MetaPhone	Manually curated
huvtae (0) - 6 freq huvnae (1) - 61 freq huviae (1) - 1 freq hivtae (1) - 1 freq hudtae (1) - 1 freq hudnae (2) - 62 freq yuptae (2) - 4 freq furtae (2) - 57 freq huvane (2) - 1 freq huv'nae (2) - 1 freq haetae (2) - 7 freq havnae (2) - 23 freq hurtle (2) - 3 freq hurtan (2) - 1 freq huntan (2) - 4 freq hidtae (2) - 4 freq husnae (2) - 32 freq mustae (2) - 6 freq huttie (2) - 1 freq untae (2) - 13 freq uptae (2) - 22 freq ustae (2) - 2 freq huvna (2) - 1 freq hivnae (2) - 42 freq huntie (2) - 1 freq	huvtae (0) - 6 freq hivtae (1) - 1 freq hudtae (2) - 1 freq huvnae (2) - 61 freq huviae (2) - 1 freq havnae (3) - 23 freq huvna (3) - 1 freq hidtae (3) - 4 freq haetae (3) - 7 freq huvane (3) - 1 freq havti (3) - 1 freq hivnae (3) - 42 freq huntie (3) - 1 freq huttie (3) - 1 freq hantie (4) - 4 freq hut (4) - 85 freq hivna (4) - 48 freq hudty (4) - 1 freq huv (4) - 1243 freq vitae (4) - 3 freq hive (4) - 29 freq hivvie (4) - 18 freq hertie (4) - 11 freq havna (4) - 2 freq hivan (4) - 7 freq	SoundEx code - H130 habit - 50 freq happit - 104 freq heapt - 1 freq hoped - 42 freq howpit - 19 freq hivtae - 1 freq haufwit - 2 freq huvtae - 6 freq howped - 14 freq haffet - 5 freq haiped - 1 freq hft - 1 freq happt - 14 freq hapt - 5 freq heft - 111 freq heaved - 6 freq happed - 35 freq haft - 2 freq haeved - 2 freq hoobeit - 6 freq heavit - 2 freq heapit - 4 freq howbeit - 2 freq hivved - 1 freq hopit - 6 freq hefty - 8 freq haived - 10 freq hope't - 1 freq hobbit - 2 freq hopt - 1 freq haipit - 1 freq huppity - 1 freq höved - 2 freq hoved - 4 freq hiv't - 1 freq hufft - 1 freq hoppit - 2 freq houbeit - 7 freq hippit - 7 freq huffed - 3 freq havti - 1 freq hauf-day - 1 freq heaped - 3 freq haffit - 1 freq haivt - 1 freq haufed - 2 freq howpt - 1 freq hopped - 1 freq hyped - 2 freq hooped - 1 freq hoofed - 3 freq hevved - 1 freq hvd - 1 freq hepped - 1 freq hfwaet - 1 freq	MetaPhone code - HFT hivtae - 1 freq high-doh - 4 freq huvtae - 6 freq haffet - 5 freq heft - 111 freq heaved - 6 freq haft - 2 freq haeved - 2 freq hight - 2 freq heavit - 2 freq hivved - 1 freq hefty - 8 freq haived - 10 freq hoved - 4 freq hiv't - 1 freq hufft - 1 freq huffed - 3 freq havti - 1 freq hauf-day - 1 freq haffit - 1 freq haivt - 1 freq haufed - 2 freq hoofed - 3 freq hevved - 1 freq	HUVTAE
Time to execute Levenshtein function - 0.207848 milliseconds The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings	Time to execute Double Levenshtein function - 0.395882 milliseconds In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.	Time to execute SoundEx function - 0.038772 milliseconds Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.	Time to execute MetaPhone function - 0.043979 milliseconds Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.	Time to execute Manually curated function - 0.000931 milliseconds Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.

Web Analytics