A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hauchs in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hauchs (0) - 4 freq
mauchs (1) - 1 freq
haughs (1) - 8 freq
heuchs (1) - 3 freq
sauchs (1) - 9 freq
hauch (1) - 12 freq
lauchs (1) - 21 freq
beuchs (2) - 4 freq
wauchts (2) - 3 freq
heuch' (2) - 1 freq
laucht (2) - 22 freq
haugh (2) - 43 freq
haufs (2) - 5 freq
lauchsm (2) - 1 freq
caucht (2) - 32 freq
hauchles (2) - 1 freq
aucht (2) - 33 freq
laughs (2) - 60 freq
hatcht (2) - 1 freq
hochs (2) - 12 freq
sachs (2) - 4 freq
laichs (2) - 1 freq
hatch (2) - 12 freq
haulds (2) - 2 freq
hauns (2) - 526 freq
hauchs (0) - 4 freq
heuchs (1) - 3 freq
lauchs (2) - 21 freq
hauch (2) - 12 freq
hochs (2) - 12 freq
heichs (2) - 2 freq
sauchs (2) - 9 freq
mauchs (2) - 1 freq
haughs (2) - 8 freq
haach (3) - 1 freq
heuch (3) - 6 freq
hach (3) - 3 freq
huch (3) - 1 freq
luchs (3) - 1 freq
lauches (3) - 1 freq
hatches (3) - 4 freq
lachs (3) - 3 freq
hacks (3) - 3 freq
sheuchs (3) - 13 freq
faachs (3) - 1 freq
heughs (3) - 7 freq
souchs (3) - 25 freq
hauchty (3) - 1 freq
beuchs (3) - 4 freq
sachs (3) - 4 freq
SoundEx code - H220
hooses - 255 freq
hich's - 1 freq
heuchs - 3 freq
hauchs - 4 freq
hakes - 1 freq
haggis - 76 freq
hkes - 1 freq
heizes - 7 freq
hoosies - 9 freq
hochs - 12 freq
heughs - 7 freq
hogus - 2 freq
houses - 27 freq
hughock - 8 freq
hughie's - 6 freq
heezes - 5 freq
hijack - 1 freq
hoose's - 5 freq
hce's - 6 freq
hisses - 4 freq
hizzy's - 2 freq
hooches - 1 freq
hizzies - 4 freq
hic-hoc - 1 freq
hikes - 1 freq
hawkes - 2 freq
hussies - 1 freq
heichs - 2 freq
haughs - 8 freq
hezekiah - 4 freq
highways - 3 freq
hugh's - 4 freq
higgie's - 8 freq
haggis's - 1 freq
hoosie's - 1 freq
hjook - 2 freq
hoosis - 1 freq
hush-hush - 1 freq
hoaxes - 1 freq
hooziss - 1 freq
hussy's - 1 freq
hcjac - 1 freq
hecky's - 4 freq
'heckys - 1 freq
hgis - 1 freq
heges - 1 freq
houssis - 3 freq
hoses - 1 freq
hughes - 5 freq
huzzas - 1 freq
hoswick - 1 freq
hjuks - 1 freq
hjuk - 1 freq
highs - 1 freq
huggis - 7 freq
hazy-eyes - 1 freq
haosaz - 1 freq
hcycu - 1 freq
hjcuq - 1 freq
hughesie - 1 freq
hjihyx - 1 freq
hqec - 1 freq
highog - 1 freq
hegwig - 1 freq
hoswick's - 1 freq
hqwzs - 1 freq
hoagies - 1 freq
hcoj - 1 freq
hokes - 1 freq
MetaPhone code - HXS
hich's - 1 freq
heuchs - 3 freq
hauchs - 4 freq
hitches - 2 freq
hochs - 12 freq
hatches - 4 freq
hatche's - 1 freq
hooches - 1 freq
heichs - 2 freq
hutches - 1 freq
hutchies' - 1 freq
HAUCHS
Time to execute Levenshtein function - 0.180892 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355428 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027844 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040007 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000944 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.