A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to harl in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
harl (0) - 5 freq
hawl (1) - 1 freq
warl (1) - 164 freq
farl (1) - 5 freq
harly (1) - 13 freq
hare (1) - 199 freq
earl (1) - 31 freq
karl (1) - 6 freq
haal (1) - 42 freq
harr (1) - 2 freq
carl (1) - 13 freq
yarl (1) - 18 freq
hart (1) - 28 freq
hark (1) - 15 freq
harm (1) - 22 freq
harz (1) - 1 freq
harn (1) - 5 freq
hari (1) - 1 freq
hael (1) - 9 freq
hail (1) - 265 freq
marl (1) - 1 freq
hayl (1) - 2 freq
harp (1) - 9 freq
varl (1) - 1 freq
jarl (1) - 36 freq
harl (0) - 5 freq
harly (1) - 13 freq
hurl (1) - 64 freq
darl (2) - 1 freq
hkrl (2) - 1 freq
haul (2) - 48 freq
hard (2) - 799 freq
harld (2) - 1 freq
harp (2) - 9 freq
varl (2) - 1 freq
jarl (2) - 36 freq
har (2) - 8 freq
harlt (2) - 5 freq
hurly (2) - 4 freq
harley (2) - 7 freq
harlie (2) - 6 freq
haurly (2) - 16 freq
barl (2) - 1 freq
hayl (2) - 2 freq
hal (2) - 4 freq
hurla (2) - 1 freq
hall (2) - 190 freq
karl (2) - 6 freq
haal (2) - 42 freq
harr (2) - 2 freq
SoundEx code - H640
hare'll - 2 freq
haurlie - 4 freq
hurl - 64 freq
haurly - 16 freq
here'll - 1 freq
harley - 7 freq
harly - 13 freq
harl - 5 freq
hair-oil - 1 freq
hurly - 4 freq
hurley - 2 freq
harlie - 6 freq
hourle - 1 freq
hurlie - 1 freq
hurla - 1 freq
harlaw - 4 freq
hourly - 1 freq
MetaPhone code - HRL
hare'll - 2 freq
haurlie - 4 freq
hurl - 64 freq
haurly - 16 freq
here'll - 1 freq
harley - 7 freq
harly - 13 freq
harl - 5 freq
hair-oil - 1 freq
hurly - 4 freq
hurley - 2 freq
harlie - 6 freq
hourle - 1 freq
hurlie - 1 freq
hurla - 1 freq
harlaw - 4 freq
hourly - 1 freq
HARL
Time to execute Levenshtein function - 0.208226 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348697 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031220 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.041957 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000913 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.