A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to humbug in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
humbug (0) - 5 freq
humbugs (1) - 1 freq
humber (2) - 2 freq
hamburg (2) - 3 freq
humbie (2) - 1 freq
humour (2) - 69 freq
pxmbug (2) - 1 freq
humble (2) - 26 freq
hanbug (2) - 11 freq
humbly (2) - 6 freq
mug (3) - 45 freq
hume's (3) - 3 freq
lumbs (3) - 1 freq
mbu (3) - 1 freq
dumba (3) - 1 freq
humping (3) - 1 freq
humdrum (3) - 3 freq
hummit (3) - 1 freq
cuming (3) - 1 freq
humphs (3) - 2 freq
hanbag (3) - 1 freq
humans (3) - 53 freq
smug (3) - 17 freq
humps (3) - 3 freq
nimbus (3) - 1 freq
humbug (0) - 5 freq
humbugs (2) - 1 freq
hanbug (3) - 11 freq
humbly (3) - 6 freq
humble (3) - 26 freq
humbie (3) - 1 freq
humber (3) - 2 freq
hamburg (3) - 3 freq
lambeg (4) - 5 freq
haunbag (4) - 9 freq
hanbag (4) - 1 freq
pxmbug (4) - 1 freq
humour (4) - 69 freq
thumb (5) - 24 freq
humph (5) - 14 freq
sumg (5) - 1 freq
humpie (5) - 1 freq
mumbo (5) - 5 freq
jumbo (5) - 13 freq
hummy (5) - 1 freq
thumbin (5) - 3 freq
hummel (5) - 5 freq
humpit (5) - 1 freq
zumba (5) - 2 freq
bug (5) - 57 freq
SoundEx code - H512
haunbag - 9 freq
humps - 3 freq
haunfast - 2 freq
hembist - 1 freq
hanfoos - 1 freq
humphs - 2 freq
henna-busks - 1 freq
hanbug - 11 freq
hame-bakes - 1 freq
humbug - 5 freq
haunfus - 2 freq
hanfaes - 1 freq
hame-bakin - 1 freq
home-baked - 1 freq
hinny-bees - 1 freq
hampstead - 1 freq
han-pickit - 1 freq
hinny-piece - 1 freq
hymn-books - 1 freq
hempies- - 1 freq
haunbuik - 2 freq
hampshire - 1 freq
hempies - 2 freq
humfished - 1 freq
hanbags - 1 freq
hanbag - 1 freq
humbugs - 1 freq
henpeckit - 1 freq
homepage - 1 freq
hmfigpsmv - 1 freq
humpbacks - 1 freq
MetaPhone code - HMK
hammock - 4 freq
hummocky - 1 freq
humbug - 5 freq
HUMBUG
Time to execute Levenshtein function - 0.213386 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.373088 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027558 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037614 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000869 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.