A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to hargey in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
hargey (0) - 7 freq
harley (1) - 7 freq
harvey (1) - 5 freq
harken (2) - 7 freq
hurley (2) - 2 freq
barged (2) - 2 freq
hared (2) - 1 freq
charley (2) - 1 freq
targe (2) - 5 freq
argy (2) - 6 freq
harly (2) - 13 freq
harper (2) - 3 freq
harder (2) - 67 freq
sharger (2) - 1 freq
barley (2) - 31 freq
harry (2) - 186 freq
hanger (2) - 3 freq
sergey (2) - 1 freq
badgey (2) - 1 freq
charge' (2) - 1 freq
harte (2) - 1 freq
barge (2) - 15 freq
charge (2) - 61 freq
carey (2) - 2 freq
kargem (2) - 1 freq
hargey (0) - 7 freq
harvey (2) - 5 freq
harley (2) - 7 freq
hare (3) - 199 freq
harray (3) - 3 freq
harem (3) - 1 freq
hares (3) - 14 freq
harte (3) - 1 freq
hardy (3) - 22 freq
charge (3) - 61 freq
marge (3) - 1 freq
harse (3) - 1 freq
hauge (3) - 2 freq
bargy (3) - 1 freq
harne (3) - 2 freq
sergey (3) - 1 freq
large (3) - 56 freq
barge (3) - 15 freq
targe (3) - 5 freq
argy (3) - 6 freq
harly (3) - 13 freq
hurley (3) - 2 freq
hared (3) - 1 freq
harry (3) - 186 freq
verge (4) - 9 freq
SoundEx code - H620
harsh - 20 freq
hairs - 35 freq
horse - 232 freq
hears - 46 freq
here's - 265 freq
hershaw - 15 freq
hair's - 18 freq
hoarse - 29 freq
hare's - 6 freq
hers - 84 freq
hers-she - 3 freq
her's - 3 freq
hoorish - 1 freq
hours - 114 freq
'here's - 16 freq
hairse - 14 freq
herr's - 1 freq
heroes - 39 freq
heroic - 8 freq
hero's - 3 freq
hark - 15 freq
heirs - 7 freq
hairies - 4 freq
hoors - 49 freq
heres - 8 freq
hurrays - 1 freq
harry's - 8 freq
hersh - 2 freq
heroes' - 2 freq
harris - 14 freq
hor's - 1 freq
herk - 3 freq
hooers - 2 freq
hayr's - 1 freq
hurries - 3 freq
herc - 1 freq
harz - 1 freq
hearsay - 2 freq
heros - 1 freq
horace - 23 freq
hoor's - 3 freq
hearse - 2 freq
hurs - 4 freq
hurs-she - 1 freq
hirs - 5 freq
hirs-shae - 1 freq
heresy - 3 freq
harass - 2 freq
hairy's - 3 freq
'horsey' - 1 freq
'horace - 1 freq
horace' - 1 freq
hurehoose - 1 freq
haerse - 4 freq
'hers' - 1 freq
horss - 22 freq
harsk - 3 freq
harks - 1 freq
hier's - 1 freq
horraquoy - 1 freq
hours' - 1 freq
herries - 4 freq
heyrick - 2 freq
hir's - 1 freq
hera's - 1 freq
herz - 1 freq
horse' - 1 freq
harse - 1 freq
houres - 8 freq
herks - 1 freq
hares - 14 freq
harras - 1 freq
horsie - 7 freq
'horse' - 1 freq
horus - 1 freq
haris - 1 freq
harrouis - 1 freq
hires - 1 freq
herrs - 1 freq
hrs - 13 freq
hures - 1 freq
hargey - 7 freq
heris - 1 freq
here’s - 22 freq
hrxq - 1 freq
horse” - 1 freq
heers - 2 freq
hrish - 1 freq
hhroego - 1 freq
harrys - 2 freq
MetaPhone code - HRJ
hargey - 7 freq
HARGEY
Time to execute Levenshtein function - 0.490870 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.676528 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.029020 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.089326 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000849 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.