A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to huntiegowk in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
huntiegowk (0) - 2 freq
huntigowk (1) - 3 freq
huntegowk (1) - 2 freq
moniefowk (4) - 1 freq
huntie (4) - 1 freq
horsegowk (4) - 1 freq
tigowk (4) - 1 freq
gentlefowk (4) - 2 freq
hoarsegowk (4) - 1 freq
hunting (4) - 10 freq
hustings (5) - 7 freq
gulliegaw (5) - 1 freq
hantie (5) - 4 freq
untied (5) - 7 freq
untimeous (5) - 1 freq
huntit (5) - 29 freq
dentie-lik (5) - 1 freq
huntin' (5) - 1 freq
wunting (5) - 1 freq
begowk (5) - 2 freq
tentie-lik (5) - 2 freq
huntin (5) - 66 freq
hunter's (5) - 10 freq
cuttie-lik (5) - 2 freq
auntie (5) - 157 freq
huntiegowk (0) - 2 freq
huntegowk (1) - 2 freq
huntigowk (1) - 3 freq
tigowk (6) - 1 freq
horsegowk (6) - 1 freq
hoarsegowk (6) - 1 freq
hungower (7) - 1 freq
hungowre (7) - 1 freq
huntie (7) - 1 freq
hunting (7) - 10 freq
gentlefowk (7) - 2 freq
moniefowk (7) - 1 freq
hunter (8) - 48 freq
hauntingly (8) - 1 freq
antiek (8) - 1 freq
haundiwork (8) - 1 freq
hunted (8) - 11 freq
santiago (8) - 2 freq
zntirgyw (8) - 1 freq
haunting (8) - 3 freq
hunters (8) - 16 freq
hangower (8) - 3 freq
misbegowk (8) - 1 freq
feardiegowk (8) - 1 freq
hunkie (8) - 20 freq
SoundEx code - H532
haunds - 234 freq
handwash - 1 freq
hands - 175 freq
handicap - 2 freq
handsome - 39 freq
hants - 1 freq
haunts - 13 freq
huntsman's - 1 freq
hunds - 3 freq
hunts - 12 freq
huntegowk - 2 freq
handsel - 7 freq
handselt - 2 freq
huntigowk - 3 freq
hounds - 3 freq
handkerchief - 4 freq
handkie - 1 freq
hindquarters - 1 freq
handce - 1 freq
handshake - 2 freq
hinds - 3 freq
huntsman - 2 freq
hand's - 2 freq
hoonds - 1 freq
hand-carved - 1 freq
handcuffed - 2 freq
hindus - 2 freq
handkerchiefs - 1 freq
handsomest - 1 freq
'hen-taes' - 1 freq
haundsome - 1 freq
haands - 43 freq
haands' - 1 freq
hints - 9 freq
hundiclokks - 1 freq
haands-free - 1 freq
huntiegowk - 2 freq
hen-taes - 1 freq
hamethochts - 1 freq
haand's - 2 freq
handsels - 1 freq
haundshake - 1 freq
haandshack - 1 freq
handsomer - 1 freq
haundsel - 3 freq
hinny-douce - 1 freq
hewmets - 1 freq
handis - 1 freq
hinduism - 2 freq
hindsight - 1 freq
haund-cupped - 1 freq
hindsicht - 1 freq
€œhands - 1 freq
handwashing - 1 freq
hands-o - 1 freq
handicapped - 1 freq
handsellin - 3 freq
hand-sellin - 1 freq
'haunds' - 1 freq
honds - 1 freq
handsomely - 1 freq
hundog - 1 freq
hinodegiri - 1 freq
handcairt - 1 freq
hindies - 1 freq
hmdhcjyw - 1 freq
handshakes - 1 freq
handsupfortrad - 49 freq
hmdhqaqt - 1 freq
handcart - 1 freq
hentixn - 1 freq
handcock - 1 freq
handies' - 1 freq
“honds” - 1 freq
handsupfortrad's - 1 freq
handyhock - 2 freq
handsum - 1 freq
hand-shake - 1 freq
MetaPhone code - HNTKK
huntegowk - 2 freq
huntigowk - 3 freq
huntiegowk - 2 freq
handcock - 1 freq
HUNTIEGOWK
Time to execute Levenshtein function - 0.500761 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.789795 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.077889 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037318 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000936 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.