A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to dursley in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
dursley (0) - 48 freq
dursleys (1) - 5 freq
purpley (2) - 1 freq
worsley (2) - 1 freq
darnley (2) - 59 freq
dunskey (2) - 5 freq
dusney (2) - 1 freq
daisley (2) - 2 freq
surley (2) - 3 freq
dudley (2) - 11 freq
hurley (2) - 2 freq
parsley (2) - 8 freq
darkley (2) - 7 freq
burley (2) - 1 freq
murmled (3) - 5 freq
birsle (3) - 1 freq
curses (3) - 8 freq
dusted (3) - 6 freq
purses (3) - 1 freq
hurly (3) - 4 freq
burgled (3) - 2 freq
pulley (3) - 9 freq
disney (3) - 19 freq
hurple (3) - 1 freq
firsled (3) - 1 freq
dursley (0) - 48 freq
dursleys (2) - 5 freq
parsley (3) - 8 freq
darkley (3) - 7 freq
daisley (3) - 2 freq
darnley (3) - 59 freq
worsley (3) - 1 freq
darkle (4) - 1 freq
darkly (4) - 3 freq
wursel (4) - 2 freq
durst (4) - 1 freq
girsle (4) - 2 freq
drily (4) - 1 freq
hursel (4) - 16 freq
dryly (4) - 1 freq
warsle (4) - 24 freq
derkly (4) - 1 freq
daurkly (4) - 1 freq
'ursel (4) - 1 freq
deesle (4) - 3 freq
deisle (4) - 1 freq
dorsal (4) - 1 freq
dourly (4) - 16 freq
dures (4) - 3 freq
dorset (4) - 2 freq
SoundEx code - D624
draiglins - 1 freq
droukelt - 2 freq
daurklins - 3 freq
derklins - 2 freq
dreglins - 1 freq
dreezle - 1 freq
draiglt - 1 freq
drizzle - 13 freq
daurklin - 2 freq
darklins - 1 freq
darkly - 3 freq
dracula - 4 freq
darkle - 1 freq
drochle - 5 freq
darjeeling - 2 freq
dark-sweelin - 1 freq
dreechle's - 1 freq
dairkly - 1 freq
darkley - 7 freq
direckly - 9 freq
dreichly - 3 freq
dreggled - 2 freq
draggelt - 2 freq
dreezlin - 1 freq
drochles - 1 freq
drookled - 3 freq
dry-cleaned - 1 freq
dorsal - 1 freq
direk-lik - 1 freq
dreich-lookin - 1 freq
dreichly-dressed - 1 freq
drug-lairds - 1 freq
derkly - 1 freq
draigelt - 2 freq
draigless - 1 freq
drizzil - 1 freq
draigglin - 1 freq
draiggled - 2 freq
draiglety - 2 freq
draggled - 1 freq
drookle - 2 freq
drookleen - 1 freq
drooklin - 1 freq
drochlin - 2 freq
draigled - 1 freq
drochlinest - 1 freq
draiglty - 1 freq
derklie - 1 freq
derklin - 1 freq
daurkly - 1 freq
direcklie - 1 freq
dreezil - 2 freq
draigle - 1 freq
darklin - 2 freq
dursley - 48 freq
dursleys - 5 freq
drizzled - 2 freq
draiglin - 1 freq
drowsily - 1 freq
MetaPhone code - TRSL
tressell - 2 freq
dreezle - 1 freq
drizzle - 13 freq
t'ersel - 7 freq
trees'll - 1 freq
dorsal - 1 freq
drizzil - 1 freq
dreezil - 2 freq
dursley - 48 freq
drowsily - 1 freq
traicily - 2 freq
DURSLEY
Time to execute Levenshtein function - 0.178456 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.461514 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.065073 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.052524 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001214 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.