A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dorothy in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dorothy (0) - 45 freq
frothy (2) - 6 freq
droothy (2) - 4 freq
toothy (2) - 4 freq
drouthy (2) - 17 freq
€œdorothy (2) - 1 freq
dorty (2) - 7 freq
dorothy's (2) - 3 freq
worthy (2) - 21 freq
roothy (2) - 1 freq
drachy (3) - 1 freq
dort (3) - 2 freq
bothy (3) - 49 freq
dooted (3) - 4 freq
dbooth (3) - 1 freq
north (3) - 402 freq
dootit (3) - 2 freq
sooth (3) - 335 freq
drooths (3) - 1 freq
doot (3) - 573 freq
ooty (3) - 21 freq
doony (3) - 2 freq
mooths (3) - 76 freq
troth (3) - 18 freq
earthy (3) - 1 freq
dorothy (0) - 45 freq
drouthy (2) - 17 freq
droothy (2) - 4 freq
roothy (3) - 1 freq
drooth (3) - 20 freq
drouth (3) - 49 freq
derth (3) - 2 freq
worthy (3) - 21 freq
darth (3) - 3 freq
dorty (3) - 7 freq
frothy (3) - 6 freq
dortan (4) - 1 freq
dorito (4) - 1 freq
dirty (4) - 107 freq
wirthy (4) - 7 freq
toothy (4) - 4 freq
furthy (4) - 1 freq
roth (4) - 2 freq
dairth (4) - 1 freq
dorts (4) - 8 freq
froth (4) - 6 freq
ruthy (4) - 2 freq
druith (4) - 1 freq
forth (4) - 82 freq
drushy (4) - 1 freq
SoundEx code - D630
drouthy - 17 freq
dirt - 70 freq
dirty - 107 freq
driet - 2 freq
dreid - 51 freq
draaed - 1 freq
dried - 74 freq
durty - 11 freq
dairt - 6 freq
drouth - 49 freq
dorty - 7 freq
dread - 34 freq
doort - 1 freq
drooth - 20 freq
door-ti - 1 freq
daured - 24 freq
draw'd - 1 freq
dirt' - 1 freq
dorothy - 45 freq
dared - 11 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
dird - 2 freq
dreed - 13 freq
dryed - 4 freq
dirtie - 1 freq
droid - 1 freq
dart - 16 freq
dorito - 1 freq
drawed - 1 freq
drite - 1 freq
daurt - 6 freq
draa'd - 3 freq
durt - 2 freq
drat - 2 freq
dry't - 2 freq
daurd - 1 freq
droothy - 4 freq
druith - 1 freq
dorty-wye - 1 freq
daared - 2 freq
durtie - 2 freq
dort - 2 freq
derth - 2 freq
drowt - 1 freq
dredd - 6 freq
draed - 1 freq
darth - 3 freq
derrida - 1 freq
derd - 1 freq
dortie - 3 freq
dheireadh - 1 freq
druid - 2 freq
'dreid' - 1 freq
daar't - 1 freq
€˜dread - 1 freq
dairth - 1 freq
dorado - 1 freq
dryte - 1 freq
€œdorothy - 1 freq
dee-haird - 1 freq
dae-or-dee - 1 freq
€˜dirty - 1 freq
doherty - 1 freq
droit - 1 freq
drowth - 1 freq
MetaPhone code - TR0
drouthy - 17 freq
truth - 294 freq
trowth - 26 freq
troth - 18 freq
drouth - 49 freq
truith - 62 freq
drooth - 20 freq
dorothy - 45 freq
drouthie - 12 freq
droothie - 1 freq
dearth - 9 freq
trooth - 2 freq
droothy - 4 freq
druith - 1 freq
'truth' - 1 freq
derth - 2 freq
trewth - 8 freq
darth - 3 freq
dairth - 1 freq
€˜truth - 1 freq
€œdorothy - 1 freq
treuth - 6 freq
trith - 1 freq
drowth - 1 freq
DOROTHY
Time to execute Levenshtein function - 0.226267 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.386214 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.030771 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036707 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000960 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.