A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to parsley in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
parsley (0) - 8 freq
paisley (1) - 61 freq
patsley (1) - 1 freq
partly (2) - 10 freq
barnsley (2) - 1 freq
warsled (2) - 13 freq
eardley (2) - 1 freq
parly (2) - 2 freq
presley (2) - 1 freq
harley (2) - 7 freq
dursley (2) - 48 freq
partey (2) - 3 freq
sparkley (2) - 2 freq
parle (2) - 1 freq
bareley (2) - 1 freq
darnley (2) - 59 freq
paroled (2) - 1 freq
marley (2) - 4 freq
paisly (2) - 1 freq
darkley (2) - 7 freq
parlez (2) - 1 freq
parse (2) - 1 freq
arseley (2) - 1 freq
warsles (2) - 2 freq
warslet (2) - 6 freq
parsley (0) - 8 freq
presley (2) - 1 freq
paisley (2) - 61 freq
patsley (2) - 1 freq
parse (3) - 1 freq
arseley (3) - 1 freq
presly (3) - 2 freq
purpley (3) - 1 freq
warsle (3) - 24 freq
parle (3) - 1 freq
worsley (3) - 1 freq
paisly (3) - 1 freq
parly (3) - 2 freq
dursley (3) - 48 freq
partly (3) - 10 freq
pars (4) - 9 freq
purely (4) - 16 freq
pursuer (4) - 2 freq
girsle (4) - 2 freq
parasol (4) - 3 freq
pursue (4) - 9 freq
persil (4) - 1 freq
parlie (4) - 7 freq
persew (4) - 1 freq
crosley (4) - 1 freq
SoundEx code - P624
parcel - 45 freq
parsley - 8 freq
paircel - 11 freq
paircels - 5 freq
parasols - 2 freq
proselytise - 1 freq
proclaim - 5 freq
proclaimed - 17 freq
proclamation - 9 freq
park'll - 1 freq
parochial - 11 freq
porcelain - 10 freq
proclaimit - 3 freq
pricklt - 1 freq
preclude - 2 freq
percolated - 1 freq
paircel' - 1 freq
prickles - 2 freq
priggle - 1 freq
parasol - 3 freq
prickly - 2 freq
priceless - 30 freq
proclaimin - 3 freq
proclivities - 1 freq
parucular - 1 freq
pirckles - 1 freq
porshalin - 1 freq
pre-scuil - 1 freq
proclamatioun - 2 freq
proclaimers - 5 freq
parcels - 4 freq
parochialism - 2 freq
paraglidin - 2 freq
preclare - 2 freq
pre-schuil - 4 freq
persil - 1 freq
preschuil - 4 freq
parklans - 1 freq
proclemis - 1 freq
precludes - 1 freq
proselytisin - 1 freq
proclaimer's - 1 freq
proclaims - 4 freq
presly - 2 freq
percolatin - 1 freq
prickle - 1 freq
pairkhill - 4 freq
proclamations - 2 freq
€˜parcel - 1 freq
parcelsaweek - 1 freq
presley - 1 freq
pricklygerry - 1 freq
parsleyjane - 1 freq
peerieselkie - 1 freq
MetaPhone code - PRSL
parcel - 45 freq
parsley - 8 freq
paircel - 11 freq
paircel' - 1 freq
parasol - 3 freq
persil - 1 freq
presly - 2 freq
€˜parcel - 1 freq
presley - 1 freq
PARSLEY
Time to execute Levenshtein function - 0.205256 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.364955 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027344 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037569 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000891 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.