A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to darth in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
darth (0) - 3 freq
dart (1) - 16 freq
dearth (1) - 9 freq
dairth (1) - 1 freq
derth (1) - 2 freq
darts (1) - 31 freq
warth (1) - 64 freq
earth (1) - 251 freq
garth (1) - 3 freq
dath (1) - 1 freq
daeth (1) - 17 freq
daith (1) - 286 freq
werth (2) - 2 freq
tart (2) - 12 freq
arts (2) - 34 freq
cath (2) - 4 freq
dairt (2) - 6 freq
math (2) - 6 freq
marsh (2) - 8 freq
deith (2) - 102 freq
wamth (2) - 1 freq
fart (2) - 27 freq
dairths (2) - 1 freq
march (2) - 138 freq
wirth (2) - 89 freq
darth (0) - 3 freq
dairth (1) - 1 freq
derth (1) - 2 freq
dearth (1) - 9 freq
dath (2) - 1 freq
daeth (2) - 17 freq
dart (2) - 16 freq
garth (2) - 3 freq
daith (2) - 286 freq
darts (2) - 31 freq
warth (2) - 64 freq
earth (2) - 251 freq
dearbh (3) - 1 freq
deeth (3) - 4 freq
wurth (3) - 7 freq
nairth (3) - 1 freq
worth (3) - 248 freq
herth (3) - 3 freq
bairth (3) - 1 freq
hirth (3) - 1 freq
depth (3) - 22 freq
foarth (3) - 1 freq
dartit (3) - 2 freq
dartin (3) - 5 freq
byrth (3) - 1 freq
SoundEx code - D630
drouthy - 17 freq
dirt - 69 freq
dirty - 103 freq
driet - 2 freq
dreid - 51 freq
draaed - 1 freq
dried - 71 freq
durty - 11 freq
dairt - 6 freq
drouth - 49 freq
dorty - 7 freq
dread - 30 freq
doort - 1 freq
drooth - 20 freq
door-ti - 1 freq
daured - 24 freq
draw'd - 1 freq
dirt' - 1 freq
dorothy - 45 freq
dared - 11 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
dird - 2 freq
dreed - 13 freq
dryed - 4 freq
dirtie - 1 freq
droid - 1 freq
dart - 16 freq
dorito - 1 freq
drawed - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
drat - 2 freq
dry't - 2 freq
daurd - 1 freq
droothy - 4 freq
druith - 1 freq
dorty-wye - 1 freq
daared - 2 freq
durtie - 2 freq
dort - 2 freq
derth - 2 freq
drowt - 1 freq
dredd - 6 freq
draed - 1 freq
darth - 3 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
dheireadh - 1 freq
druid - 2 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dairth - 1 freq
dorado - 1 freq
dryte - 1 freq
€œdorothy - 1 freq
dee-haird - 1 freq
dae-or-dee - 1 freq
€˜dirty - 1 freq
doherty - 1 freq
droit - 1 freq
drowth - 1 freq
MetaPhone code - TR0
drouthy - 17 freq
truth - 287 freq
trowth - 26 freq
troth - 18 freq
drouth - 49 freq
truith - 61 freq
drooth - 20 freq
dorothy - 45 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
trooth - 2 freq
droothy - 4 freq
druith - 1 freq
'truth' - 1 freq
derth - 2 freq
trewth - 8 freq
darth - 3 freq
dairth - 1 freq
€˜truth - 1 freq
€œdorothy - 1 freq
treuth - 6 freq
trith - 1 freq
drowth - 1 freq
DARTH
Time to execute Levenshtein function - 0.229641 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.336304 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027061 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036335 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000823 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.