A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hodge in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hodge (0) - 3 freq
fodge (1) - 6 freq
hodged (1) - 3 freq
dodge (1) - 12 freq
lodge (1) - 27 freq
hudge (1) - 1 freq
hedge (1) - 41 freq
wadge (2) - 5 freq
trodge (2) - 1 freq
hinge (2) - 5 freq
wode (2) - 4 freq
dodie (2) - 18 freq
songe (2) - 1 freq
midge (2) - 10 freq
ledge (2) - 15 freq
lodger (2) - 3 freq
hoggy (2) - 1 freq
code (2) - 38 freq
hogget (2) - 1 freq
hyde (2) - 16 freq
fudge (2) - 18 freq
hoose (2) - 2642 freq
house (2) - 122 freq
codgie (2) - 2 freq
sedge (2) - 1 freq
hodge (0) - 3 freq
hudge (1) - 1 freq
hedge (1) - 41 freq
fodge (2) - 6 freq
hedgie (2) - 3 freq
dodge (2) - 12 freq
heedge (2) - 1 freq
lodge (2) - 27 freq
hadg (2) - 1 freq
hodged (2) - 3 freq
hoog (3) - 1 freq
cadge (3) - 2 freq
yhdg (3) - 1 freq
huge (3) - 102 freq
hede (3) - 4 freq
ludge (3) - 22 freq
hide (3) - 189 freq
gadge (3) - 4 freq
hodgin (3) - 3 freq
nidge (3) - 2 freq
odige (3) - 1 freq
podgy (3) - 2 freq
hog (3) - 3 freq
rudge (3) - 1 freq
badge (3) - 14 freq
SoundEx code - H320
heids - 465 freq
hedge - 41 freq
hate-c - 2 freq
heid's - 41 freq
hedgie - 3 freq
heidache - 7 freq
hauds - 123 freq
hotch - 5 freq
hates - 36 freq
hitch - 3 freq
hits - 150 freq
heads - 39 freq
hotdogs - 3 freq
hides - 14 freq
'hoots - 2 freq
hoods - 6 freq
heeds - 29 freq
heid-gie - 1 freq
heidy's - 1 freq
huds - 6 freq
hauts - 1 freq
haddock - 14 freq
hadg - 1 freq
hideous - 7 freq
houts - 1 freq
hoodies - 5 freq
hid's - 202 freq
hids - 86 freq
hats - 46 freq
hatch - 12 freq
hat's - 3 freq
heed's - 6 freq
huts - 7 freq
haddies - 2 freq
heats - 3 freq
hoots - 11 freq
hts - 5 freq
howdie's - 1 freq
hideyoshi - 2 freq
hit's - 280 freq
heidie's - 5 freq
heidies - 4 freq
hood's - 1 freq
het-hoose - 1 freq
het's - 5 freq
haeds - 6 freq
haed's - 1 freq
hoatch - 1 freq
haitts - 1 freq
hïts - 1 freq
heathaze - 1 freq
heedge - 1 freq
hodge - 3 freq
'hid's - 2 freq
haets - 2 freq
hades - 4 freq
haads - 17 freq
hadds - 22 freq
'hit's - 5 freq
heat's - 1 freq
het-houss - 1 freq
head's - 2 freq
hethoose - 1 freq
hads - 2 freq
hi-tech - 2 freq
heds - 1 freq
hoids - 2 freq
hieds - 2 freq
'haddow's' - 1 freq
hie-tech - 1 freq
haddocks - 1 freq
hutch - 7 freq
hudds - 1 freq
hudge - 1 freq
heid-heich - 1 freq
heides - 1 freq
huddies - 1 freq
heywood's - 1 freq
hyde's - 1 freq
heedache - 2 freq
haddicks - 2 freq
hudduck - 2 freq
hitec - 1 freq
hotdog - 1 freq
huddock - 6 freq
heid’s - 1 freq
headache - 4 freq
'hates - 1 freq
hewitt's - 1 freq
hutchi - 1 freq
hutchie - 2 freq
hots - 1 freq
hid’s - 2 freq
hdq - 1 freq
heydays - 1 freq
MetaPhone code - HJ
huge - 102 freq
hedge - 41 freq
hedgie - 3 freq
hauge - 2 freq
heedge - 1 freq
hodge - 3 freq
hege - 2 freq
hudge - 1 freq
hagi - 1 freq
HODGE
Time to execute Levenshtein function - 0.173723 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348739 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028003 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037724 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000908 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.