A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hb in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hb (0) - 9 freq
hbu (1) - 1 freq
db (1) - 2 freq
qb (1) - 2 freq
hu (1) - 2 freq
hq (1) - 7 freq
'b (1) - 1 freq
hp (1) - 3 freq
b (1) - 745 freq
heb (1) - 1 freq
lb (1) - 4 freq
hl (1) - 1 freq
hob (1) - 10 freq
hj (1) - 2 freq
rb (1) - 8 freq
gb (1) - 2 freq
hw (1) - 1 freq
hbv (1) - 1 freq
hi (1) - 67 freq
xhb (1) - 1 freq
xb (1) - 3 freq
hbk (1) - 1 freq
zb (1) - 3 freq
hs (1) - 5 freq
hm (1) - 12 freq
hb (0) - 9 freq
hbo (1) - 1 freq
hub (1) - 31 freq
hbu (1) - 1 freq
heb (1) - 1 freq
hib (1) - 1 freq
hob (1) - 10 freq
hnb (2) - 1 freq
yb (2) - 3 freq
hg (2) - 5 freq
jb (2) - 2 freq
wb (2) - 4 freq
ha (2) - 180 freq
ghb (2) - 1 freq
hk (2) - 9 freq
ho (2) - 56 freq
ib (2) - 1 freq
hh (2) - 3 freq
phb (2) - 1 freq
hn (2) - 3 freq
sb (2) - 4 freq
hf (2) - 2 freq
hy (2) - 5 freq
hv (2) - 4 freq
ab (2) - 25 freq
SoundEx code - H100
have - 1198 freq
happy - 755 freq
huif - 12 freq
hope - 756 freq
hauf - 704 freq
heap - 46 freq
hip - 32 freq
hiv - 1171 freq
hav - 9 freq
hivvy - 22 freq
haev - 15 freq
howf - 18 freq
howpfu - 20 freq
howp - 160 freq
haufwey - 14 freq
hippo - 3 freq
huffy - 5 freq
huff - 34 freq
hoop - 34 freq
heave-ho - 3 freq
heavy - 192 freq
huv - 1162 freq
haippy - 2 freq
hape - 3 freq
hive - 29 freq
haip - 7 freq
hof - 1 freq
hap - 53 freq
hawf - 18 freq
'huv - 8 freq
'hope - 2 freq
hevvy - 1 freq
haaf - 34 freq
'haev - 4 freq
hove - 3 freq
hivvie - 18 freq
howif - 1 freq
hauf-wey - 4 freq
hippy- - 1 freq
houp - 3 freq
'hiv - 5 freq
howff - 56 freq
hi-fi - 3 freq
howpfae - 1 freq
heave - 13 freq
hop - 13 freq
hubby - 10 freq
hobby - 14 freq
hub - 31 freq
haeve - 5 freq
hif - 4 freq
haevy - 2 freq
haivy - 1 freq
hoof - 15 freq
houpee - 3 freq
hoap - 2 freq
haav - 1 freq
'hibee - 1 freq
hiva - 1 freq
huviae - 1 freq
hbo - 1 freq
hef - 1 freq
haufway - 3 freq
hauf-fu - 2 freq
hfe - 2 freq
h've - 1 freq
'howp - 2 freq
heapie - 1 freq
haep - 13 freq
'have - 4 freq
hubba - 1 freq
hb - 9 freq
hype - 6 freq
heebie - 3 freq
hippy - 4 freq
hobb - 1 freq
hoabby - 1 freq
'happy - 7 freq
hauf-fou - 1 freq
habbie - 9 freq
'hauf - 5 freq
hopp' - 1 freq
hob - 10 freq
haffie - 1 freq
hup - 5 freq
ho-bo - 1 freq
'hav - 1 freq
happie - 13 freq
hopp - 5 freq
hap- - 1 freq
hippie - 2 freq
höve - 1 freq
hev - 139 freq
hewvie - 1 freq
haf - 9 freq
'have' - 1 freq
hoove - 1 freq
heavie - 1 freq
habby - 2 freq
havy - 2 freq
hou've - 1 freq
hobbie - 1 freq
hubbie - 1 freq
hauf-way - 1 freq
'hup' - 1 freq
hauf-wye - 2 freq
happ - 4 freq
howpie - 1 freq
hauf- - 1 freq
houff - 3 freq
€œhiv - 8 freq
howpfie - 1 freq
haff - 3 freq
haffi - 3 freq
€˜hope - 2 freq
€˜huv - 5 freq
€˜happy - 2 freq
€œhuv - 1 freq
€œhave - 3 freq
habbie' - 1 freq
haap - 1 freq
€˜have - 2 freq
€œhauf - 1 freq
€˜hup - 1 freq
hope- - 1 freq
€œheavy - 1 freq
huf - 1 freq
hyv - 1 freq
hivy - 1 freq
€™have - 1 freq
€™happy - 1 freq
hiv' - 1 freq
hp - 3 freq
heb - 1 freq
hv - 4 freq
hoofba - 1 freq
‘hoof - 1 freq
heeve - 1 freq
hfi - 1 freq
hivÂ’i - 1 freq
hf - 2 freq
hbv - 1 freq
hbui - 1 freq
hib - 1 freq
hibee - 4 freq
hibby - 24 freq
heavey - 1 freq
hapoe - 1 freq
hbu - 1 freq
hvb - 2 freq
hubo - 1 freq
hvhh - 1 freq
haif - 1 freq
haiv - 1 freq
hpw - 1 freq
hwp - 1 freq
'heavy' - 1 freq
hppyu - 1 freq
hibbie - 2 freq
hwieb - 1 freq
hvw - 1 freq
heup - 1 freq
MetaPhone code - B
by - 4520 freq
be - 14795 freq
boy - 524 freq
bei - 55 freq
bi - 3248 freq
bay - 80 freq
buy - 378 freq
baa - 73 freq
bee' - 6 freq
boay - 199 freq
baw - 130 freq
bh - 13 freq
bi- - 5 freq
bou - 12 freq
bee - 51 freq
bow - 70 freq
bo - 18 freq
be' - 4 freq
bai - 10 freq
boa - 16 freq
bey - 24 freq
b- - 3 freq
'be - 13 freq
b - 745 freq
biow - 1 freq
'bi - 4 freq
ba - 139 freq
buoy - 3 freq
b' - 196 freq
ba' - 8 freq
booooo - 1 freq
boo - 58 freq
bb - 9 freq
beuy - 64 freq
bae - 855 freq
bia - 5 freq
buiy - 5 freq
baiy - 1 freq
booee - 1 freq
'by - 9 freq
bough - 5 freq
hbo - 1 freq
'b' - 2 freq
beh - 18 freq
'bae - 1 freq
bae' - 1 freq
bae- - 1 freq
hb - 9 freq
bi' - 3 freq
buh - 3 freq
bah - 4 freq
'b - 1 freq
bie - 15 freq
'boy - 1 freq
boy' - 3 freq
'boy' - 3 freq
'be' - 2 freq
bu - 13 freq
by' - 6 freq
'buy' - 1 freq
bue - 1 freq
bew - 7 freq
'beuy' - 1 freq
'buey' - 1 freq
buey - 10 freq
'buey - 4 freq
buey' - 1 freq
bæ - 1 freq
biy - 3 freq
by-e - 2 freq
by- - 3 freq
'by' - 2 freq
bî - 1 freq
be-e-ehh - 1 freq
b'aa - 1 freq
buyy - 1 freq
€œby - 2 freq
€˜bi - 1 freq
bí - 1 freq
béþ - 1 freq
€¦be - 1 freq
€˜boa - 2 freq
€˜b - 1 freq
€˜by - 1 freq
€˜beuy - 1 freq
€œbe - 6 freq
boiy - 4 freq
€˜boo - 1 freq
€œb - 1 freq
bio - 10 freq
bea - 2 freq
bba - 1 freq
wb - 4 freq
bw - 3 freq
boi - 3 freq
yb - 3 freq
hbui - 1 freq
baw' - 1 freq
buaai - 1 freq
“by - 1 freq
ybi - 1 freq
hbu - 1 freq
beÂ’ - 1 freq
byÂ’ - 1 freq
beee - 3 freq
biae - 1 freq
wbeo - 1 freq
HB
Time to execute Levenshtein function - 0.174697 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.337007 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028502 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040523 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001104 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.