A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to haddock in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
haddock (0) - 14 freq
haddocks (1) - 1 freq
huddock (1) - 6 freq
handcock (2) - 1 freq
headlock (2) - 1 freq
haddicks (2) - 2 freq
padlock (2) - 5 freq
puddock (2) - 34 freq
hancock (2) - 3 freq
hudduck (2) - 2 freq
paddick (2) - 1 freq
hammock (2) - 4 freq
haddo (2) - 1 freq
wadlock (2) - 1 freq
lassock (3) - 2 freq
riddoch (3) - 5 freq
dock (3) - 32 freq
haddath (3) - 25 freq
zadok (3) - 4 freq
havoc (3) - 6 freq
hardback (3) - 2 freq
warlock (3) - 43 freq
hafolk (3) - 1 freq
marnock (3) - 2 freq
bannock (3) - 18 freq
haddock (0) - 14 freq
huddock (1) - 6 freq
hudduck (2) - 2 freq
haddocks (2) - 1 freq
puddock (3) - 34 freq
paddick (3) - 1 freq
haddicks (3) - 2 freq
headlock (3) - 1 freq
wadlock (4) - 1 freq
ruddick (4) - 1 freq
guddick (4) - 3 freq
haddo (4) - 1 freq
hancock (4) - 3 freq
padlock (4) - 5 freq
handcock (4) - 1 freq
hammock (4) - 4 freq
burdock (5) - 1 freq
handyhock (5) - 2 freq
haddit (5) - 1 freq
haddan (5) - 2 freq
haddie (5) - 9 freq
haimlock (5) - 7 freq
deadlock (5) - 1 freq
haddin (5) - 15 freq
hadd (5) - 70 freq
SoundEx code - H320
heids - 465 freq
hedge - 41 freq
hate-c - 2 freq
heid's - 41 freq
hedgie - 3 freq
heidache - 7 freq
hauds - 123 freq
hotch - 5 freq
hates - 36 freq
hitch - 3 freq
hits - 150 freq
heads - 39 freq
hotdogs - 3 freq
hides - 14 freq
'hoots - 2 freq
hoods - 6 freq
heeds - 29 freq
heid-gie - 1 freq
heidy's - 1 freq
huds - 6 freq
hauts - 1 freq
haddock - 14 freq
hadg - 1 freq
hideous - 7 freq
houts - 1 freq
hoodies - 5 freq
hid's - 202 freq
hids - 86 freq
hats - 46 freq
hatch - 12 freq
hat's - 3 freq
heed's - 6 freq
huts - 7 freq
haddies - 2 freq
heats - 3 freq
hoots - 11 freq
hts - 5 freq
howdie's - 1 freq
hideyoshi - 2 freq
hit's - 280 freq
heidie's - 5 freq
heidies - 4 freq
hood's - 1 freq
het-hoose - 1 freq
het's - 5 freq
haeds - 6 freq
haed's - 1 freq
hoatch - 1 freq
haitts - 1 freq
hïts - 1 freq
heathaze - 1 freq
heedge - 1 freq
hodge - 3 freq
'hid's - 2 freq
haets - 2 freq
hades - 4 freq
haads - 17 freq
hadds - 22 freq
'hit's - 5 freq
heat's - 1 freq
het-houss - 1 freq
head's - 2 freq
hethoose - 1 freq
hads - 2 freq
hi-tech - 2 freq
heds - 1 freq
hoids - 2 freq
hieds - 2 freq
'haddow's' - 1 freq
hie-tech - 1 freq
haddocks - 1 freq
hutch - 7 freq
hudds - 1 freq
hudge - 1 freq
heid-heich - 1 freq
heides - 1 freq
huddies - 1 freq
heywood's - 1 freq
hyde's - 1 freq
heedache - 2 freq
haddicks - 2 freq
hudduck - 2 freq
hitec - 1 freq
hotdog - 1 freq
huddock - 6 freq
heid’s - 1 freq
headache - 4 freq
'hates - 1 freq
hewitt's - 1 freq
hutchi - 1 freq
hutchie - 2 freq
hots - 1 freq
hid’s - 2 freq
hdq - 1 freq
heydays - 1 freq
MetaPhone code - HTK
hate-c - 2 freq
haddock - 14 freq
hadg - 1 freq
hudduck - 2 freq
hitec - 1 freq
huddock - 6 freq
HADDOCK
Time to execute Levenshtein function - 0.190182 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.335448 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027315 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036753 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000869 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.