A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hairth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hairth (0) - 26 freq
dairth (1) - 1 freq
nairth (1) - 1 freq
hairt (1) - 341 freq
hirth (1) - 1 freq
hailth (1) - 2 freq
hairths (1) - 1 freq
pairth (1) - 1 freq
bairth (1) - 1 freq
airth (1) - 11 freq
hairts (1) - 61 freq
hairty (1) - 3 freq
hairst (2) - 182 freq
faith (2) - 161 freq
wairmth (2) - 8 freq
girth (2) - 5 freq
wailth (2) - 1 freq
airty (2) - 1 freq
wirth (2) - 89 freq
hearth (2) - 30 freq
mairt (2) - 5 freq
hailt (2) - 6 freq
baith (2) - 1582 freq
haitd (2) - 1 freq
hærth (2) - 1 freq
hairth (0) - 26 freq
hirth (1) - 1 freq
hairts (2) - 61 freq
airth (2) - 11 freq
hairty (2) - 3 freq
herth (2) - 3 freq
bairth (2) - 1 freq
hearth (2) - 30 freq
dairth (2) - 1 freq
pairth (2) - 1 freq
hairt (2) - 341 freq
nairth (2) - 1 freq
hailth (2) - 2 freq
hairths (2) - 1 freq
earth (3) - 251 freq
hart (3) - 28 freq
halth (3) - 10 freq
harte (3) - 1 freq
hairtie (3) - 1 freq
hateth (3) - 1 freq
yirth (3) - 41 freq
hairtet (3) - 1 freq
darth (3) - 3 freq
hairtfu (3) - 1 freq
haerts (3) - 19 freq
SoundEx code - H630
heard - 1263 freq
hard - 782 freq
hairt - 341 freq
herd - 157 freq
hert - 762 freq
hertie - 11 freq
haurd - 150 freq
haerd - 26 freq
hurrit - 11 freq
hurt - 115 freq
haird - 124 freq
hearth - 30 freq
hurried - 57 freq
hearty - 9 freq
heart - 171 freq
'hard - 2 freq
'here't - 1 freq
here't - 2 freq
hired - 27 freq
hurrayed - 1 freq
hairty - 3 freq
haired - 5 freq
hearit - 2 freq
huird - 2 freq
hairdo - 3 freq
hair-do - 2 freq
herrit - 1 freq
heairt - 1 freq
harit - 1 freq
harried - 6 freq
hairth - 26 freq
howard - 4 freq
hardy - 22 freq
hear't - 33 freq
herty - 9 freq
haert - 43 freq
heared - 14 freq
hairriet - 2 freq
hird - 27 freq
hoard - 9 freq
heyrd - 3 freq
haard - 17 freq
heryd - 1 freq
horatio - 2 freq
hirth - 1 freq
hirada - 1 freq
heired - 3 freq
heerd - 93 freq
hair-dye - 4 freq
heirt - 4 freq
harriet - 1 freq
hurriet - 4 freq
hart - 28 freq
hardie - 26 freq
herod - 49 freq
hoord - 3 freq
hairtie - 1 freq
hurdie - 4 freq
he'ard - 1 freq
her'd - 1 freq
herriet - 4 freq
horreed - 1 freq
hrt - 1 freq
heered - 2 freq
hert- - 1 freq
horrid - 9 freq
heard- - 1 freq
hærth - 1 freq
horde - 1 freq
heir'd - 1 freq
hort - 5 freq
herth - 3 freq
hearadh - 1 freq
he'rt - 4 freq
harraed - 2 freq
hered - 2 freq
herried - 4 freq
€˜heart - 1 freq
€˜hert - 2 freq
harte - 1 freq
harrit - 1 freq
hairrit - 1 freq
h'ard - 1 freq
€œhard - 2 freq
hiraeth - 1 freq
harrot - 3 freq
hir't - 1 freq
hared - 1 freq
€˜hairt - 1 freq
‘hard - 2 freq
hurdy - 1 freq
hurd - 2 freq
heird - 1 freq
MetaPhone code - HR0
hearth - 30 freq
hairth - 26 freq
hirth - 1 freq
herth - 3 freq
hiraeth - 1 freq
HAIRTH
Time to execute Levenshtein function - 0.168229 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.316025 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027653 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036578 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000855 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.