A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hughes in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hughes (0) - 5 freq
hugh's (1) - 4 freq
hughie's (2) - 6 freq
hughie (2) - 107 freq
hughesms (2) - 1 freq
huives (2) - 1 freq
higher (2) - 76 freq
hutches (2) - 1 freq
bushes (2) - 41 freq
tighes (2) - 1 freq
hushed (2) - 10 freq
hugs (2) - 25 freq
haughs (2) - 8 freq
haughed (2) - 1 freq
pushes (2) - 8 freq
heges (2) - 1 freq
hugh' (2) - 1 freq
highers (2) - 12 freq
huge (2) - 96 freq
hughesie (2) - 1 freq
heughs (2) - 7 freq
augher (2) - 1 freq
huggis (2) - 7 freq
hug's (2) - 1 freq
rushes (2) - 19 freq
hughes (0) - 5 freq
highs (2) - 1 freq
heughs (2) - 7 freq
haughs (2) - 8 freq
hugh's (2) - 4 freq
hughesie (2) - 1 freq
hugh' (3) - 1 freq
hug's (3) - 1 freq
hugh (3) - 87 freq
highe (3) - 1 freq
highest (3) - 20 freq
heges (3) - 1 freq
huggis (3) - 7 freq
highers (3) - 12 freq
higher (3) - 76 freq
hughie (3) - 107 freq
hughie's (3) - 6 freq
tighes (3) - 1 freq
hugs (3) - 25 freq
haughed (3) - 1 freq
hight (4) - 2 freq
thighs (4) - 14 freq
saughs (4) - 1 freq
heighest (4) - 2 freq
highog (4) - 1 freq
SoundEx code - H220
hooses - 251 freq
hich's - 1 freq
heuchs - 3 freq
hauchs - 4 freq
hakes - 1 freq
haggis - 76 freq
hkes - 1 freq
heizes - 7 freq
hoosies - 9 freq
hochs - 12 freq
heughs - 7 freq
hogus - 2 freq
houses - 25 freq
hughock - 8 freq
hughie's - 6 freq
heezes - 5 freq
hijack - 1 freq
hoose's - 5 freq
hce's - 6 freq
hizzy's - 2 freq
hooches - 1 freq
hizzies - 4 freq
hic-hoc - 1 freq
hisses - 3 freq
hikes - 1 freq
hawkes - 2 freq
hussies - 1 freq
heichs - 2 freq
haughs - 8 freq
hezekiah - 4 freq
highways - 3 freq
hugh's - 4 freq
higgie's - 8 freq
haggis's - 1 freq
hoosie's - 1 freq
hjook - 2 freq
hoosis - 1 freq
hush-hush - 1 freq
hoaxes - 1 freq
hooziss - 1 freq
hussy's - 1 freq
hcjac - 1 freq
hecky's - 4 freq
'heckys - 1 freq
hgis - 1 freq
heges - 1 freq
houssis - 3 freq
hoses - 1 freq
hughes - 5 freq
huzzas - 1 freq
hoswick - 1 freq
hjuks - 1 freq
hjuk - 1 freq
highs - 1 freq
huggis - 7 freq
hazy-eyes - 1 freq
haosaz - 1 freq
hcycu - 1 freq
hjcuq - 1 freq
hughesie - 1 freq
hjihyx - 1 freq
hqec - 1 freq
highog - 1 freq
hegwig - 1 freq
hoswick's - 1 freq
hqwzs - 1 freq
hoagies - 1 freq
hcoj - 1 freq
hokes - 1 freq
MetaPhone code - HFS
huifs - 7 freq
howfs - 6 freq
heavies - 5 freq
heaves - 2 freq
howffs - 5 freq
hooves - 21 freq
hughie's - 6 freq
huffs - 2 freq
huives - 1 freq
haufs - 5 freq
hoofs - 8 freq
hugh's - 4 freq
houffs - 1 freq
haaf's - 1 freq
hughes - 5 freq
highs - 1 freq
hughesie - 1 freq
heivies - 1 freq
hoffice - 1 freq
hives” - 2 freq
hives - 2 freq
HUGHES
Time to execute Levenshtein function - 0.183578 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.333847 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027430 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037134 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000837 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.