A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dearth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dearth (0) - 9 freq
derth (1) - 2 freq
hearth (1) - 30 freq
death (1) - 170 freq
darth (1) - 3 freq
earth (1) - 253 freq
dearbh (1) - 1 freq
foarth (2) - 1 freq
search (2) - 116 freq
heaith (2) - 1 freq
earthy (2) - 1 freq
dart (2) - 16 freq
feart' (2) - 1 freq
heart (2) - 184 freq
erth (2) - 3 freq
€œearth (2) - 1 freq
feart (2) - 540 freq
deary (2) - 4 freq
dear' (2) - 1 freq
dearie (2) - 37 freq
peart (2) - 1 freq
dwarts (2) - 1 freq
depart (2) - 7 freq
leart (2) - 4 freq
deat (2) - 1 freq
dearth (0) - 9 freq
derth (1) - 2 freq
darth (1) - 3 freq
earth (2) - 253 freq
dairth (2) - 1 freq
dearbh (2) - 1 freq
hearth (2) - 30 freq
death (2) - 170 freq
berth (3) - 21 freq
noarth (3) - 23 freq
daith (3) - 294 freq
drooth (3) - 20 freq
druith (3) - 1 freq
garth (3) - 3 freq
derts (3) - 2 freq
deepth (3) - 1 freq
warth (3) - 64 freq
depth (3) - 23 freq
drouth (3) - 49 freq
eirth (3) - 1 freq
werth (3) - 2 freq
earthy (3) - 1 freq
dart (3) - 16 freq
perth (3) - 41 freq
daeth (3) - 17 freq
SoundEx code - D630
drouthy - 17 freq
dirt - 70 freq
dirty - 107 freq
driet - 2 freq
dreid - 51 freq
draaed - 1 freq
dried - 74 freq
durty - 11 freq
dairt - 6 freq
drouth - 49 freq
dorty - 7 freq
dread - 34 freq
doort - 1 freq
drooth - 20 freq
door-ti - 1 freq
daured - 24 freq
draw'd - 1 freq
dirt' - 1 freq
dorothy - 45 freq
dared - 11 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
dird - 2 freq
dreed - 13 freq
dryed - 4 freq
dirtie - 1 freq
droid - 1 freq
dart - 16 freq
dorito - 1 freq
drawed - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
drat - 2 freq
dry't - 2 freq
daurd - 1 freq
droothy - 4 freq
druith - 1 freq
dorty-wye - 1 freq
daared - 2 freq
durtie - 2 freq
dort - 2 freq
derth - 2 freq
drowt - 1 freq
dredd - 6 freq
draed - 1 freq
darth - 3 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
dheireadh - 1 freq
druid - 2 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dairth - 1 freq
dorado - 1 freq
dryte - 1 freq
€œdorothy - 1 freq
dee-haird - 1 freq
dae-or-dee - 1 freq
€˜dirty - 1 freq
doherty - 1 freq
droit - 1 freq
drowth - 1 freq
MetaPhone code - TR0
drouthy - 17 freq
truth - 294 freq
trowth - 26 freq
troth - 18 freq
drouth - 49 freq
truith - 62 freq
drooth - 20 freq
dorothy - 45 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
trooth - 2 freq
droothy - 4 freq
druith - 1 freq
'truth' - 1 freq
derth - 2 freq
trewth - 8 freq
darth - 3 freq
dairth - 1 freq
€˜truth - 1 freq
€œdorothy - 1 freq
treuth - 6 freq
trith - 1 freq
drowth - 1 freq
DEARTH
Time to execute Levenshtein function - 0.502682 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.101156 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.075557 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037761 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.049140 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.