A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to heighlit in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
heighlit (0) - 1 freq
heilit (2) - 1 freq
height (2) - 45 freq
heighlicht (2) - 3 freq
heighest (2) - 2 freq
heichlie (2) - 2 freq
heighly (2) - 1 freq
feightit (2) - 1 freq
lightit (3) - 1 freq
heichtie (3) - 1 freq
hechtit (3) - 5 freq
reight (3) - 6 freq
haiglin (3) - 3 freq
dightit (3) - 1 freq
weight (3) - 55 freq
heilin (3) - 1 freq
heisit (3) - 1 freq
reishlin (3) - 2 freq
eight (3) - 69 freq
hichtit (3) - 1 freq
meithit (3) - 1 freq
highest (3) - 21 freq
feight (3) - 3 freq
hechl't (3) - 2 freq
wheichit (3) - 1 freq
heighlit (0) - 1 freq
heighest (3) - 2 freq
heighly (3) - 1 freq
height (3) - 45 freq
hight (4) - 2 freq
highest (4) - 21 freq
heilit (4) - 1 freq
hechlet (4) - 1 freq
feightit (4) - 1 freq
heichlie (4) - 2 freq
highly (4) - 17 freq
heighlicht (4) - 3 freq
heichest (5) - 24 freq
heicht (5) - 53 freq
taiglit (5) - 2 freq
hirplit (5) - 5 freq
heights (5) - 3 freq
highlicht (5) - 3 freq
hochlet (5) - 1 freq
hagglet (5) - 2 freq
hushlet (5) - 1 freq
haggult (5) - 1 freq
haglet (5) - 1 freq
highlight (5) - 16 freq
heigh (5) - 7 freq
SoundEx code - H243
haglet - 1 freq
huckled - 12 freq
heckled - 1 freq
hauchled - 1 freq
higgeldy-piggeldy - 1 freq
hochlet - 1 freq
higgledy - 1 freq
hoosehold - 12 freq
huckelt - 2 freq
hochled - 1 freq
haggult - 1 freq
household - 10 freq
heckelt - 1 freq
heukelt - 2 freq
hoosehald - 1 freq
higgledy-piggledy - 1 freq
higgelty-piggelty - 1 freq
heighlit - 1 freq
heich-leid - 1 freq
hechl't - 2 freq
hochl't - 2 freq
hazeldean' - 1 freq
hickled - 1 freq
hacklet - 1 freq
hechlet - 1 freq
househald - 2 freq
high-heeled - 1 freq
hoosehauld - 2 freq
hushlet - 1 freq
hagglet - 2 freq
higgilty - 1 freq
hazelheed - 1 freq
hooseholds - 1 freq
MetaPhone code - HLT
hoolet - 106 freq
held - 509 freq
holiday - 152 freq
hoaliday - 16 freq
haaled - 21 freq
howled - 15 freq
hold - 65 freq
halt - 28 freq
heeld - 2 freq
hauled - 21 freq
houlet - 9 freq
heelt - 3 freq
haalt - 2 freq
hilda - 23 freq
healed - 10 freq
hauld - 17 freq
haliday - 1 freq
hilt - 2 freq
houlit - 3 freq
hell'd - 1 freq
haild - 2 freq
heilit - 1 freq
haul't - 1 freq
hault - 18 freq
hult - 1 freq
hailed - 14 freq
halled - 2 freq
holed - 7 freq
hailit - 1 freq
haeled - 2 freq
hoult - 7 freq
howlt - 1 freq
helt - 5 freq
haulit - 3 freq
hailt - 6 freq
healt - 11 freq
'holiday' - 1 freq
hallaed - 1 freq
haelt - 3 freq
howld - 17 freq
heiled - 1 freq
holiday' - 1 freq
'hoolet - 2 freq
heighlit - 1 freq
heeled - 1 freq
hallooed - 1 freq
heild - 48 freq
haled - 1 freq
holieday - 1 freq
hallowday - 1 freq
€œhold - 3 freq
halloday - 1 freq
holit - 1 freq
hollaeed - 1 freq
hoalidy - 2 freq
hooled - 1 freq
heuld - 1 freq
holy-day - 1 freq
halliday - 1 freq
holaeday - 2 freq
hoo-let - 1 freq
'hoolet' - 3 freq
holt - 1 freq
HEIGHLIT
Time to execute Levenshtein function - 0.206456 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.370425 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028191 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.038766 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000884 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.