A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to huntsman in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
huntsman (0) - 2 freq
huntan (2) - 4 freq
huntsman's (2) - 1 freq
huntlin (3) - 2 freq
scotsman (3) - 39 freq
hanuman (3) - 1 freq
hunsin (3) - 3 freq
hunts (3) - 12 freq
dunstan (3) - 1 freq
hangman (3) - 9 freq
suntan (3) - 2 freq
huntin (3) - 66 freq
hintan (3) - 2 freq
kinsman (3) - 5 freq
gundyman (3) - 3 freq
human (3) - 315 freq
usman (3) - 1 freq
handyman (3) - 2 freq
husban (3) - 4 freq
hauntan (3) - 1 freq
hatman (3) - 1 freq
hang-man (3) - 1 freq
hurtan (3) - 1 freq
linesman (3) - 5 freq
heidsman (3) - 2 freq
huntsman (0) - 2 freq
huntsman's (4) - 1 freq
huntan (4) - 4 freq
hatman (5) - 1 freq
hauntan (5) - 1 freq
kinsman (5) - 5 freq
handyman (5) - 2 freq
heidsman (5) - 2 freq
henchman (5) - 1 freq
henman (5) - 1 freq
hintan (5) - 2 freq
linesman (5) - 5 freq
hang-man (5) - 1 freq
hanuman (5) - 1 freq
hunsin (5) - 3 freq
scotsman (5) - 39 freq
huntlin (5) - 2 freq
huntin (5) - 66 freq
hangman (5) - 9 freq
hunts (5) - 12 freq
hertstane (6) - 1 freq
nutmon (6) - 1 freq
heidsmen (6) - 1 freq
hantin (6) - 6 freq
honeymin (6) - 1 freq
SoundEx code - H532
haunds - 234 freq
handwash - 1 freq
hands - 175 freq
handicap - 2 freq
handsome - 39 freq
hants - 1 freq
haunts - 13 freq
huntsman's - 1 freq
hunds - 3 freq
hunts - 12 freq
huntegowk - 2 freq
handsel - 7 freq
handselt - 2 freq
huntigowk - 3 freq
hounds - 3 freq
handkerchief - 4 freq
handkie - 1 freq
hindquarters - 1 freq
handce - 1 freq
handshake - 2 freq
hinds - 3 freq
huntsman - 2 freq
hand's - 2 freq
hoonds - 1 freq
hand-carved - 1 freq
handcuffed - 2 freq
hindus - 2 freq
handkerchiefs - 1 freq
handsomest - 1 freq
'hen-taes' - 1 freq
haundsome - 1 freq
haands - 43 freq
haands' - 1 freq
hints - 9 freq
hundiclokks - 1 freq
haands-free - 1 freq
huntiegowk - 2 freq
hen-taes - 1 freq
hamethochts - 1 freq
haand's - 2 freq
handsels - 1 freq
haundshake - 1 freq
haandshack - 1 freq
handsomer - 1 freq
haundsel - 3 freq
hinny-douce - 1 freq
hewmets - 1 freq
handis - 1 freq
hinduism - 2 freq
hindsight - 1 freq
haund-cupped - 1 freq
hindsicht - 1 freq
€œhands - 1 freq
handwashing - 1 freq
hands-o - 1 freq
handicapped - 1 freq
handsellin - 3 freq
hand-sellin - 1 freq
'haunds' - 1 freq
honds - 1 freq
handsomely - 1 freq
hundog - 1 freq
hinodegiri - 1 freq
handcairt - 1 freq
hindies - 1 freq
hmdhcjyw - 1 freq
handshakes - 1 freq
handsupfortrad - 49 freq
hmdhqaqt - 1 freq
handcart - 1 freq
hentixn - 1 freq
handcock - 1 freq
handies' - 1 freq
“honds” - 1 freq
handsupfortrad's - 1 freq
handyhock - 2 freq
handsum - 1 freq
hand-shake - 1 freq
MetaPhone code - HNTSMN
huntsman - 2 freq
HUNTSMAN
Time to execute Levenshtein function - 0.240584 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.447824 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028123 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038282 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000796 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.