A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to het-houss in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
het-houss (0) - 1 freq
tea-houss (2) - 1 freq
het-hoose (2) - 1 freq
hethoose (3) - 1 freq
cot-hous (3) - 2 freq
hen-hooses (3) - 6 freq
hen-hoose (3) - 13 freq
henhouse (3) - 1 freq
methods (4) - 8 freq
hothooses (4) - 1 freq
fermhous (4) - 4 freq
haw-buss (4) - 1 freq
heh-class (4) - 1 freq
weir-horss (4) - 1 freq
keb-hoose (4) - 1 freq
oot-hauds (4) - 1 freq
hert-touch (4) - 1 freq
retours (4) - 4 freq
hert-holl (4) - 5 freq
cert-horse (4) - 1 freq
pot-holes (4) - 1 freq
ex-boss (4) - 1 freq
shit-hoos (4) - 1 freq
cothous (4) - 9 freq
het-trods (4) - 1 freq
het-houss (0) - 1 freq
het-hoose (3) - 1 freq
tea-houss (3) - 1 freq
hen-hooses (4) - 6 freq
oot-hooses (5) - 1 freq
hothooses (5) - 1 freq
hen-hoose (5) - 13 freq
cot-hous (5) - 2 freq
hethoose (5) - 1 freq
haa-hoose (6) - 1 freq
outhouses (6) - 1 freq
yill-houss (6) - 1 freq
shit-hoos (6) - 1 freq
henhooses (6) - 1 freq
ho-ho's (6) - 1 freq
pot-holes (6) - 1 freq
hot-press (6) - 1 freq
tea-hoose (6) - 1 freq
oot-hoose (6) - 1 freq
henhouse (6) - 1 freq
oot-hauds (6) - 1 freq
haw-buss (6) - 1 freq
hatches (7) - 4 freq
hert-heisin (7) - 1 freq
deid-house (7) - 1 freq
SoundEx code - H320
heids - 461 freq
hedge - 40 freq
hate-c - 2 freq
heid's - 41 freq
hedgie - 3 freq
heidache - 7 freq
hauds - 118 freq
hotch - 5 freq
hates - 35 freq
hitch - 3 freq
hits - 145 freq
heads - 38 freq
hotdogs - 3 freq
hides - 14 freq
'hoots - 2 freq
hoods - 5 freq
heeds - 29 freq
heid-gie - 1 freq
heidy's - 1 freq
huds - 6 freq
hauts - 1 freq
haddock - 13 freq
hadg - 1 freq
hideous - 7 freq
houts - 1 freq
hoodies - 5 freq
hid's - 202 freq
hids - 86 freq
hats - 45 freq
hatch - 12 freq
hat's - 3 freq
heed's - 6 freq
huts - 6 freq
haddies - 2 freq
heats - 2 freq
howdie's - 1 freq
hideyoshi - 2 freq
hit's - 280 freq
heidie's - 5 freq
heidies - 4 freq
hood's - 1 freq
hoots - 9 freq
het-hoose - 1 freq
het's - 5 freq
haeds - 6 freq
haed's - 1 freq
hoatch - 1 freq
haitts - 1 freq
hïts - 1 freq
hts - 4 freq
heathaze - 1 freq
heedge - 1 freq
hodge - 3 freq
'hid's - 2 freq
haets - 2 freq
hades - 4 freq
haads - 17 freq
hadds - 22 freq
'hit's - 5 freq
heat's - 1 freq
het-houss - 1 freq
head's - 2 freq
hethoose - 1 freq
hads - 2 freq
hi-tech - 2 freq
heds - 1 freq
hoids - 2 freq
hieds - 2 freq
'haddow's' - 1 freq
hie-tech - 1 freq
haddocks - 1 freq
hutch - 7 freq
hudds - 1 freq
hudge - 1 freq
heid-heich - 1 freq
heides - 1 freq
huddies - 1 freq
heywood's - 1 freq
hyde's - 1 freq
heedache - 2 freq
haddicks - 2 freq
hudduck - 2 freq
hitec - 1 freq
hotdog - 1 freq
huddock - 6 freq
heid’s - 1 freq
headache - 4 freq
'hates - 1 freq
hewitt's - 1 freq
hutchi - 1 freq
hutchie - 2 freq
hots - 1 freq
hid’s - 2 freq
hdq - 1 freq
heydays - 1 freq
MetaPhone code - HTHS
het-hoose - 1 freq
het-houss - 1 freq
HET-HOUSS
Time to execute Levenshtein function - 0.183840 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.339875 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028097 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036972 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000791 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.