A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to headache in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
headache (0) - 4 freq
heidache (1) - 7 freq
heedache (1) - 2 freq
heidaches (2) - 2 freq
heartache (2) - 1 freq
hertache (2) - 1 freq
panache (3) - 2 freq
teache (3) - 1 freq
headlicht (3) - 2 freq
apache (3) - 1 freq
headan (3) - 3 freq
headwye (3) - 1 freq
heidrace (3) - 1 freq
waache (3) - 1 freq
bealach (3) - 1 freq
headie (3) - 1 freq
haach (3) - 1 freq
hearadh (3) - 1 freq
headline (3) - 10 freq
headone (3) - 1 freq
haddie (4) - 9 freq
manche (4) - 1 freq
'yaach (4) - 1 freq
benche (4) - 1 freq
arachne (4) - 26 freq
headache (0) - 4 freq
heidache (1) - 7 freq
heedache (1) - 2 freq
heidaches (3) - 2 freq
hertache (4) - 1 freq
heartache (4) - 1 freq
haach (4) - 1 freq
hainch (5) - 2 freq
hatch (5) - 12 freq
hach (5) - 3 freq
hoach (5) - 1 freq
dyche (5) - 1 freq
heidrich (5) - 2 freq
heydrich (5) - 7 freq
heich (5) - 260 freq
hech (5) - 16 freq
heuch (5) - 6 freq
hauch (5) - 12 freq
headicraa (5) - 2 freq
dach (5) - 1 freq
bodach (5) - 12 freq
heech (5) - 7 freq
headan (5) - 3 freq
headwye (5) - 1 freq
heidrace (5) - 1 freq
SoundEx code - H320
heids - 461 freq
hedge - 40 freq
hate-c - 2 freq
heid's - 41 freq
hedgie - 3 freq
heidache - 7 freq
hauds - 118 freq
hotch - 5 freq
hates - 35 freq
hitch - 3 freq
hits - 145 freq
heads - 38 freq
hotdogs - 3 freq
hides - 14 freq
'hoots - 2 freq
hoods - 5 freq
heeds - 29 freq
heid-gie - 1 freq
heidy's - 1 freq
huds - 6 freq
hauts - 1 freq
haddock - 13 freq
hadg - 1 freq
hideous - 7 freq
houts - 1 freq
hoodies - 5 freq
hid's - 202 freq
hids - 86 freq
hats - 45 freq
hatch - 12 freq
hat's - 3 freq
heed's - 6 freq
huts - 6 freq
haddies - 2 freq
heats - 2 freq
howdie's - 1 freq
hideyoshi - 2 freq
hit's - 280 freq
heidie's - 5 freq
heidies - 4 freq
hood's - 1 freq
hoots - 9 freq
het-hoose - 1 freq
het's - 5 freq
haeds - 6 freq
haed's - 1 freq
hoatch - 1 freq
haitts - 1 freq
hïts - 1 freq
hts - 4 freq
heathaze - 1 freq
heedge - 1 freq
hodge - 3 freq
'hid's - 2 freq
haets - 2 freq
hades - 4 freq
haads - 17 freq
hadds - 22 freq
'hit's - 5 freq
heat's - 1 freq
het-houss - 1 freq
head's - 2 freq
hethoose - 1 freq
hads - 2 freq
hi-tech - 2 freq
heds - 1 freq
hoids - 2 freq
hieds - 2 freq
'haddow's' - 1 freq
hie-tech - 1 freq
haddocks - 1 freq
hutch - 7 freq
hudds - 1 freq
hudge - 1 freq
heid-heich - 1 freq
heides - 1 freq
huddies - 1 freq
heywood's - 1 freq
hyde's - 1 freq
heedache - 2 freq
haddicks - 2 freq
hudduck - 2 freq
hitec - 1 freq
hotdog - 1 freq
huddock - 6 freq
heid’s - 1 freq
headache - 4 freq
'hates - 1 freq
hewitt's - 1 freq
hutchi - 1 freq
hutchie - 2 freq
hots - 1 freq
hid’s - 2 freq
hdq - 1 freq
heydays - 1 freq
MetaPhone code - HTX
heidache - 7 freq
hi-tech - 2 freq
hie-tech - 1 freq
heedache - 2 freq
headache - 4 freq
HEADACHE
Time to execute Levenshtein function - 0.196597 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.464501 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027420 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.069805 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000821 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.