A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hard-on in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hard-on (0) - 1 freq
hard-won (1) - 2 freq
hard-oan (1) - 1 freq
hardon (1) - 2 freq
add-on (2) - 1 freq
hard-goin (2) - 1 freq
hard-wan (2) - 1 freq
pardon (2) - 31 freq
haldon (2) - 1 freq
cardoon (2) - 1 freq
harden (2) - 21 freq
heid-on (2) - 1 freq
harpoon (2) - 8 freq
hardmen (2) - 2 freq
gordon (3) - 123 freq
marion (3) - 42 freq
sharron (3) - 3 freq
harnpan (3) - 6 freq
marlon (3) - 1 freq
herdan (3) - 1 freq
carson (3) - 1 freq
handin (3) - 15 freq
carbon (3) - 7 freq
hardenin (3) - 2 freq
hardcore (3) - 2 freq
hard-on (0) - 1 freq
hard-oan (1) - 1 freq
hardon (2) - 2 freq
hard-won (2) - 2 freq
harden (3) - 21 freq
heid-on (3) - 1 freq
hardmen (3) - 2 freq
hard-goin (3) - 1 freq
hard-wan (3) - 1 freq
hurdlin (4) - 1 freq
herdan (4) - 1 freq
herdin (4) - 15 freq
hoardin (4) - 6 freq
heard- (4) - 1 freq
hirdin (4) - 4 freq
haurd-up (4) - 1 freq
herdmen (4) - 2 freq
hardenin (4) - 2 freq
harpoon (4) - 8 freq
haldon (4) - 1 freq
cardoon (4) - 1 freq
pardon (4) - 31 freq
add-on (4) - 1 freq
yird-an (4) - 1 freq
fardin (5) - 1 freq
SoundEx code - H635
hardenin - 2 freq
hertnin - 5 freq
hardon - 2 freq
hard-oan - 1 freq
hurtin - 17 freq
hardened - 6 freq
hoardin - 6 freq
hertened - 1 freq
harrowtn - 1 freq
herdin - 15 freq
herdan - 1 freq
hard-won - 2 freq
hirdin's - 1 freq
hertenin - 8 freq
herten - 3 freq
hirdin - 4 freq
hard-enough - 1 freq
hortensia - 31 freq
hard-wan - 1 freq
hard-eamed - 1 freq
herdmen - 2 freq
heartenan - 1 freq
hardmen - 2 freq
hairdnes - 1 freq
herthungry - 1 freq
hurtan - 1 freq
hairtmaist - 1 freq
hoardings - 1 freq
huirdin - 1 freq
hairdened - 1 freq
hairten't - 1 freq
herdent - 1 freq
hard-neckit - 1 freq
hurtyng - 2 freq
harden - 21 freq
harden's - 3 freq
hert-hungir - 1 freq
haurd-thinkin - 1 freq
horatian - 1 freq
hardanger - 1 freq
hard-on - 1 freq
hairdness - 1 freq
hurting - 3 freq
hardness - 1 freq
hoordin - 2 freq
herdins - 3 freq
hardiness - 2 freq
haurdent - 1 freq
€™hurtin - 1 freq
hairten - 1 freq
hardening - 1 freq
heartened - 1 freq
hardneck - 1 freq
harding' - 1 freq
hairt-hunger - 1 freq
herding - 1 freq
hartnett - 1 freq
howardmrw - 1 freq
MetaPhone code - HRTN
hardon - 2 freq
hard-oan - 1 freq
hurtin - 17 freq
hoardin - 6 freq
harrowtn - 1 freq
herdin - 15 freq
herdan - 1 freq
herten - 3 freq
hirdin - 4 freq
hurtan - 1 freq
huirdin - 1 freq
harden - 21 freq
hard-on - 1 freq
hoordin - 2 freq
€™hurtin - 1 freq
hairten - 1 freq
HARD-ON
Time to execute Levenshtein function - 0.237626 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.369631 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030497 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037343 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000828 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.