A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hit (0) - 1229 freq
hib (1) - 1 freq
hin (1) - 36 freq
rit (1) - 13 freq
lit (1) - 283 freq
'hit (1) - 1 freq
'it (1) - 174 freq
vit (1) - 3 freq
fit (1) - 3811 freq
hip (1) - 34 freq
hid (1) - 3439 freq
hiy (1) - 1 freq
hmt (1) - 1 freq
zit (1) - 2 freq
het (1) - 262 freq
hii (1) - 1 freq
iit (1) - 1 freq
hyt (1) - 1 freq
bit (1) - 7597 freq
hft (1) - 1 freq
it (1) - 33301 freq
hiz (1) - 518 freq
hait (1) - 16 freq
mit (1) - 3 freq
sit (1) - 674 freq
hit (0) - 1229 freq
hut (1) - 85 freq
hyt (1) - 1 freq
hait (1) - 16 freq
het (1) - 262 freq
ht (1) - 8 freq
hot (1) - 206 freq
hat (1) - 177 freq
hixt (2) - 1 freq
eit (2) - 644 freq
htt (2) - 1 freq
kit (2) - 31 freq
chit (2) - 4 freq
hic (2) - 7 freq
jit (2) - 2 freq
git (2) - 1244 freq
hie (2) - 113 freq
cit (2) - 4 freq
hits (2) - 150 freq
yit (2) - 500 freq
shit (2) - 114 freq
hir (2) - 1279 freq
his (2) - 17253 freq
heat (2) - 163 freq
hyte (2) - 3 freq
SoundEx code - H300
heid - 3306 freq
had - 4994 freq
hot - 206 freq
haud - 920 freq
he'd - 974 freq
head - 260 freq
het - 262 freq
hit - 1229 freq
'haud - 40 freq
heed - 349 freq
hid - 3439 freq
haed - 1599 freq
hoat - 50 freq
heid'd - 2 freq
hate - 195 freq
hide - 189 freq
heiddae - 2 freq
hidtae - 4 freq
hat - 177 freq
howd - 1 freq
hyte - 3 freq
hoyed - 8 freq
hud - 1321 freq
heat - 163 freq
hut - 85 freq
huda - 1 freq
hood - 37 freq
hed - 1155 freq
hae't - 8 freq
hawd - 12 freq
hod - 12 freq
heidie - 70 freq
'hudd - 3 freq
hudd - 3 freq
hath - 3 freq
hait - 16 freq
haad - 71 freq
hyde - 16 freq
haet - 66 freq
heady - 6 freq
howdie - 10 freq
hied - 16 freq
haut - 3 freq
haddie' - 2 freq
hidie - 1 freq
hout - 1 freq
hoot - 28 freq
'hit - 1 freq
hett - 35 freq
heidy - 16 freq
haddie - 9 freq
hoodie - 17 freq
hoo'd - 4 freq
heth - 25 freq
haetae - 7 freq
haitd - 1 freq
heid' - 4 freq
heet - 2 freq
hyed - 1 freq
'hd - 1 freq
how'd - 8 freq
howed - 2 freq
hoody - 2 freq
heaith - 1 freq
heaied - 1 freq
heaithy - 1 freq
hythe - 2 freq
hot' - 1 freq
howdya - 1 freq
'he'd - 6 freq
hd - 4 freq
hei'd - 2 freq
heidwey - 4 freq
hid' - 1 freq
ht - 8 freq
hetty - 1 freq
hïd - 6 freq
'hoat - 1 freq
howdy - 1 freq
hae'd - 1 freq
het' - 1 freq
hi-doh - 6 freq
hadd - 70 freq
'hit' - 2 freq
hoid - 10 freq
heedtae - 1 freq
hyt - 1 freq
'het - 2 freq
'hat - 1 freq
hede - 4 freq
hatt - 1 freq
huid - 2 freq
hade - 1 freq
'hide - 1 freq
hedd - 117 freq
hidey - 2 freq
'head - 1 freq
hideawa - 1 freq
haid - 31 freq
heid-a - 1 freq
headie - 1 freq
heath - 2 freq
'hidie' - 1 freq
heid- - 1 freq
hie-heid - 3 freq
heywood - 1 freq
heute - 1 freq
huidie - 2 freq
hewitt - 3 freq
'had - 1 freq
hoidey - 1 freq
€™hud - 1 freq
€˜hued - 1 freq
'hood - 1 freq
headwye - 1 freq
€œhaud - 4 freq
€œhud - 1 freq
€œhit - 9 freq
€œheth - 1 freq
€˜hid - 2 freq
€˜hud - 1 freq
huttie - 1 freq
€œhid - 2 freq
huddie - 2 freq
hte - 1 freq
hyde' - 1 freq
€œhide - 1 freq
heedy - 3 freq
hiddae - 3 freq
howdoo - 1 freq
'heid - 2 freq
hattie - 1 freq
how-d - 1 freq
€œhout - 1 freq
€œhowt - 1 freq
€œhet - 1 freq
hyd - 7 freq
€™head - 1 freq
€œhadd - 3 freq
heyd - 1 freq
haaed - 1 freq
heidwie - 1 freq
€™hd - 1 freq
hi'd - 2 freq
hoad - 1 freq
'hud - 1 freq
heÂ’d - 3 freq
haute - 1 freq
hoodoo - 1 freq
haddo - 1 freq
hudduo - 1 freq
heidi - 1 freq
“had - 1 freq
hthy - 1 freq
howty - 1 freq
hudty - 1 freq
htew - 1 freq
hudtae - 1 freq
htt - 1 freq
heyday - 1 freq
'had' - 1 freq
MetaPhone code - HT
heid - 3306 freq
had - 4994 freq
hot - 206 freq
haud - 920 freq
he'd - 974 freq
head - 260 freq
het - 262 freq
hit - 1229 freq
'haud - 40 freq
heed - 349 freq
hid - 3439 freq
haed - 1599 freq
hoat - 50 freq
hate - 195 freq
hide - 189 freq
height - 45 freq
heiddae - 2 freq
hat - 177 freq
howd - 1 freq
hud - 1321 freq
heat - 163 freq
hut - 85 freq
huda - 1 freq
hood - 37 freq
hed - 1155 freq
hae't - 8 freq
hawd - 12 freq
hod - 12 freq
heidie - 70 freq
'hudd - 3 freq
hudd - 3 freq
hait - 16 freq
haad - 71 freq
haet - 66 freq
heady - 6 freq
howdie - 10 freq
hied - 16 freq
haut - 3 freq
haddie' - 2 freq
hidie - 1 freq
hout - 1 freq
hoot - 28 freq
'hit - 1 freq
hett - 35 freq
heidy - 16 freq
haddie - 9 freq
hoodie - 17 freq
hoo'd - 4 freq
haughheid - 1 freq
haughty - 3 freq
haetae - 7 freq
heid' - 4 freq
heet - 2 freq
how'd - 8 freq
hoody - 2 freq
heaied - 1 freq
hot' - 1 freq
'he'd - 6 freq
hei'd - 2 freq
hid' - 1 freq
hetty - 1 freq
haughed - 1 freq
'hoat - 1 freq
howdy - 1 freq
hae'd - 1 freq
het' - 1 freq
hi-doh - 6 freq
hadd - 70 freq
'hit' - 2 freq
hoid - 10 freq
'het - 2 freq
'hat - 1 freq
hede - 4 freq
hatt - 1 freq
huid - 2 freq
hade - 1 freq
'hide - 1 freq
hedd - 117 freq
hidey - 2 freq
'head - 1 freq
haid - 31 freq
heid-a - 1 freq
headie - 1 freq
'hidie' - 1 freq
heid- - 1 freq
heute - 1 freq
huidie - 2 freq
'had - 1 freq
hoidey - 1 freq
€™hud - 1 freq
€˜hued - 1 freq
'hood - 1 freq
€œhaud - 4 freq
€œhud - 1 freq
€œhit - 9 freq
€˜hid - 2 freq
€˜hud - 1 freq
huttie - 1 freq
€œhid - 2 freq
huddie - 2 freq
€œhide - 1 freq
heedy - 3 freq
hiddae - 3 freq
howdoo - 1 freq
'heid - 2 freq
hattie - 1 freq
how-d - 1 freq
€œhout - 1 freq
€œhowt - 1 freq
€œhet - 1 freq
€™head - 1 freq
€œhadd - 3 freq
heyd - 1 freq
haaed - 1 freq
hi'd - 2 freq
hoad - 1 freq
'hud - 1 freq
heÂ’d - 3 freq
haute - 1 freq
hoodoo - 1 freq
haddo - 1 freq
hudduo - 1 freq
heidi - 1 freq
“had - 1 freq
howty - 1 freq
heyday - 1 freq
'had' - 1 freq
HIT
hit - 1229 freq
hits - 150 freq
hittin - 39 freq
hitting - 4 freq
it - 33301 freq
it's - 5544 freq
its - 3335 freq
eet - 581 freq
eet's - 58 freq
hit - 1229 freq
Time to execute Levenshtein function - 0.209169 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.343384 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027644 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037897 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000971 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.