A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to harrassed in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
harrassed (0) - 2 freq
harrased (1) - 1 freq
harassed (1) - 1 freq
harasses (2) - 1 freq
harraed (2) - 2 freq
harnessed (2) - 4 freq
adressed (3) - 2 freq
harras (3) - 1 freq
hurrayed (3) - 1 freq
narraed (3) - 1 freq
harvested (3) - 1 freq
grassed (3) - 5 freq
amassed (3) - 2 freq
harassit (3) - 1 freq
croassed (3) - 7 freq
carcasses (3) - 4 freq
arrested (3) - 15 freq
harrans (3) - 1 freq
embarrassed (3) - 39 freq
arrayed (3) - 3 freq
hairsted (3) - 1 freq
harass (3) - 2 freq
canvassed (3) - 2 freq
caressed (3) - 2 freq
harried (3) - 6 freq
harrassed (0) - 2 freq
harassed (2) - 1 freq
harrased (2) - 1 freq
harnessed (3) - 4 freq
harasses (4) - 1 freq
harraed (4) - 2 freq
hairsted (5) - 1 freq
embarrassed (5) - 39 freq
arrested (5) - 15 freq
harass (5) - 2 freq
surpassed (5) - 1 freq
croassed (5) - 7 freq
harried (5) - 6 freq
caressed (5) - 2 freq
harrans (5) - 1 freq
adressed (5) - 2 freq
hurrayed (5) - 1 freq
harras (5) - 1 freq
grassed (5) - 5 freq
harvested (5) - 1 freq
harassit (5) - 1 freq
dressed (6) - 118 freq
pressed (6) - 74 freq
horsed (6) - 3 freq
embairrassed (6) - 2 freq
SoundEx code - H623
hairst - 188 freq
horsed - 3 freq
harkit - 10 freq
hairstyle - 2 freq
haircut - 22 freq
herst - 6 freq
hairstin - 14 freq
hairsts - 7 freq
hirstlin - 2 freq
huirst - 1 freq
hirst - 2 freq
harrassed - 2 freq
hairsted - 1 freq
horse-tradin - 1 freq
harrased - 1 freq
hairstit - 4 freq
hairset - 3 freq
hairst-rig - 1 freq
horchata - 2 freq
horchatería - 1 freq
hirsty - 2 freq
horse-drawn - 1 freq
hærst - 1 freq
hærsts - 1 freq
hairst-blinks - 1 freq
hairst's - 1 freq
hairstless - 1 freq
haircuts - 2 freq
haarst - 1 freq
hairst-time - 1 freq
harestanes - 1 freq
hairst-taest - 1 freq
hairst-gowd - 1 freq
hairster - 1 freq
hairstpark - 1 freq
hairsters - 4 freq
hairst-moose - 1 freq
herkit - 1 freq
harassit - 1 freq
harrowgate - 1 freq
€˜hairst - 1 freq
hirsute - 1 freq
hairstyles - 2 freq
harassed - 1 freq
heresdaibhi - 19 freq
hoorsaday - 1 freq
hoursdrive - 1 freq
hryqcdn - 1 freq
harighotra - 1 freq
harrisdistiller - 1 freq
MetaPhone code - HRST
hairst - 188 freq
horsed - 3 freq
herst - 6 freq
huirst - 1 freq
hirst - 2 freq
harrassed - 2 freq
harrased - 1 freq
hairset - 3 freq
hirsty - 2 freq
haarst - 1 freq
harassit - 1 freq
€˜hairst - 1 freq
hirsute - 1 freq
harassed - 1 freq
hoorsaday - 1 freq
HARRASSED
Time to execute Levenshtein function - 0.623048 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.905721 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.093472 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.102500 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000928 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.