A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to htm in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
htm (0) - 17 freq
hym (1) - 5 freq
hts (1) - 4 freq
hom (1) - 38 freq
mtm (1) - 1 freq
htp (1) - 1 freq
hem (1) - 27 freq
ham (1) - 49 freq
hmm (1) - 22 freq
'tm (1) - 1 freq
htt (1) - 1 freq
him (1) - 8386 freq
hte (1) - 1 freq
hum (1) - 44 freq
tm (1) - 7 freq
html (1) - 22 freq
ht (1) - 8 freq
hm (1) - 12 freq
atm (1) - 3 freq
utd (2) - 11 freq
bem (2) - 1 freq
hac (2) - 1 freq
stc (2) - 1 freq
btf (2) - 8 freq
hpw (2) - 1 freq
htm (0) - 17 freq
hum (2) - 44 freq
hte (2) - 1 freq
him (2) - 8386 freq
tm (2) - 7 freq
html (2) - 22 freq
atm (2) - 3 freq
hm (2) - 12 freq
htt (2) - 1 freq
ht (2) - 8 freq
hts (2) - 4 freq
hym (2) - 5 freq
'tm (2) - 1 freq
mtm (2) - 1 freq
hom (2) - 38 freq
ham (2) - 49 freq
hmm (2) - 22 freq
htp (2) - 1 freq
hem (2) - 27 freq
eh'm (3) - 5 freq
heth (3) - 25 freq
helm (3) - 15 freq
ahm (3) - 201 freq
huts (3) - 6 freq
'tam (3) - 14 freq
SoundEx code - H350
haudin - 374 freq
hadna - 113 freq
heatin - 16 freq
hauden - 48 freq
hidna - 103 freq
hidin - 77 freq
hidnae - 55 freq
hittin - 35 freq
hadnae - 186 freq
heidin - 58 freq
huddin - 23 freq
hauden-awa - 1 freq
hawdin - 8 freq
headin - 45 freq
haedna - 49 freq
hudden - 3 freq
hoddin - 3 freq
haitin - 6 freq
heid-doun - 3 freq
hidden - 64 freq
hootin - 4 freq
hudnae - 58 freq
hatton - 1 freq
hadean - 3 freq
hodden - 8 freq
'haithen - 1 freq
haithen - 5 freq
heatin' - 5 freq
hudin' - 2 freq
heeten - 1 freq
heeden - 5 freq
heedin - 11 freq
hidin' - 1 freq
haudin' - 1 freq
hoadin - 3 freq
hutten - 1 freq
hawddin - 1 freq
hidno - 2 freq
htm - 17 freq
heidyin - 14 freq
haednae - 8 freq
had'nae - 1 freq
heidin' - 2 freq
hidein' - 1 freq
heiden - 1 freq
heathen - 9 freq
hedna - 72 freq
haddan - 2 freq
hidan - 3 freq
haddin - 15 freq
hadden - 3 freq
houdin - 1 freq
heateen - 1 freq
heedan - 1 freq
hie-heid-yin - 1 freq
haaden - 2 freq
haadan - 2 freq
headan - 3 freq
houton - 1 freq
haedan - 3 freq
hatimo - 3 freq
hiddin - 1 freq
hattamoa - 2 freq
haudan - 4 freq
heid-doon - 1 freq
hoidin - 5 freq
hitin - 1 freq
haadin - 13 freq
hutton - 5 freq
heid-yin' - 1 freq
haedin - 1 freq
headone - 1 freq
heid-yin - 1 freq
hadn - 2 freq
huddin' - 1 freq
howdin - 1 freq
hauddin - 2 freq
haidna - 2 freq
hudna - 5 freq
hudny - 2 freq
'heidin' - 1 freq
hudin - 2 freq
houdini - 4 freq
hadni - 1 freq
howdien - 6 freq
hednae - 4 freq
hey-tyme - 1 freq
heid-on - 1 freq
hydin - 1 freq
hittn - 1 freq
heaton - 1 freq
hidn - 1 freq
hawtin - 1 freq
hatin - 2 freq
MetaPhone code - TM
time - 5878 freq
'time - 4 freq
teem - 46 freq
tam - 519 freq
tom - 134 freq
time- - 2 freq
toom - 35 freq
tame - 23 freq
'tam - 14 freq
domme - 1 freq
'tm - 1 freq
tom' - 1 freq
deem - 19 freq
doom - 34 freq
tame' - 3 freq
team - 292 freq
tam' - 1 freq
teeeeeam - 1 freq
teeeamy - 1 freq
tuim - 102 freq
t'm - 3 freq
tyme - 219 freq
tm - 7 freq
time' - 6 freq
team' - 2 freq
'dam - 1 freq
tomb - 31 freq
dumb - 38 freq
't'm - 3 freq
'tom - 1 freq
dummy - 27 freq
dam - 28 freq
dem - 722 freq
demi - 1 freq
tomboy - 1 freq
timé - 1 freq
'dame - 1 freq
dim - 50 freq
dome - 21 freq
demo - 3 freq
tme - 3 freq
dame - 16 freq
tammie - 24 freq
tommy - 100 freq
ttme - 1 freq
t'im - 1 freq
t'im' - 1 freq
tum - 17 freq
tim - 46 freq
htm - 17 freq
'tommy - 2 freq
dum - 14 freq
daem - 1 freq
tim' - 43 freq
'dumb - 2 freq
tome - 4 freq
dem-aa - 1 freq
dem- - 1 freq
dom - 3 freq
'team'' - 1 freq
'dom - 2 freq
tammy - 48 freq
töm - 2 freq
tume - 14 freq
tym - 2 freq
ti'm - 1 freq
teum - 8 freq
domm - 1 freq
dumbo - 2 freq
tummy - 4 freq
toam - 1 freq
doomy - 1 freq
duma - 1 freq
tiym - 6 freq
-dom - 2 freq
tøm - 1 freq
tumb - 1 freq
dima-' - 1 freq
tae'm - 2 freq
deemie - 3 freq
duim - 1 freq
dummie - 1 freq
tombaugh - 1 freq
€˜tam - 6 freq
'teem - 1 freq
€˜doom - 1 freq
diem - 2 freq
€˜dem - 4 freq
dumba - 1 freq
€œtime - 1 freq
damb - 1 freq
toamy - 12 freq
deemy - 1 freq
€œdumb - 1 freq
€œtommy - 1 freq
damo - 10 freq
taim - 1 freq
€™time - 1 freq
wdma - 1 freq
teamie - 1 freq
dm - 12 freq
timmy - 2 freq
tommie - 1 freq
timeÂ’ - 1 freq
dmi - 1 freq
tmi - 1 freq
tomb” - 1 freq
time” - 1 freq
ydm - 1 freq
'taim' - 1 freq
damoa - 7 freq
tomow - 1 freq
taeyma - 1 freq
HTM
Time to execute Levenshtein function - 0.295382 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.588814 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.032172 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.077073 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001045 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.