A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to preclare in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
preclare (0) - 2 freq
rteclare (2) - 1 freq
preclude (2) - 2 freq
prepare (2) - 24 freq
declare (2) - 25 freq
peckage (3) - 2 freq
preparet (3) - 4 freq
recharge (3) - 1 freq
pectae (3) - 1 freq
eclaire (3) - 1 freq
re-clart (3) - 1 freq
recline (3) - 2 freq
prepared (3) - 44 freq
reglar (3) - 39 freq
procuire (3) - 1 freq
pre-war (3) - 2 freq
pressure (3) - 64 freq
pedlar (3) - 3 freq
prelates (3) - 2 freq
declares (3) - 5 freq
pre-made (3) - 1 freq
'preciate (3) - 1 freq
precludes (3) - 1 freq
pedlars (3) - 1 freq
prepart (3) - 1 freq
preclare (0) - 2 freq
preclude (3) - 2 freq
procuire (4) - 1 freq
procure (4) - 1 freq
parucular (4) - 1 freq
proclaim (4) - 5 freq
prepare (4) - 24 freq
declare (4) - 25 freq
rteclare (4) - 1 freq
preceese (5) - 5 freq
precise (5) - 14 freq
clare (5) - 12 freq
probulary (5) - 2 freq
reclee (5) - 1 freq
preshure (5) - 1 freq
reclaim (5) - 7 freq
prelude (5) - 3 freq
preemary (5) - 1 freq
pruchry (5) - 1 freq
marieclaire (5) - 1 freq
peculiar (5) - 7 freq
prepaar (5) - 1 freq
parteiclar (5) - 8 freq
declaire (5) - 1 freq
reglar (5) - 39 freq
SoundEx code - P624
parcel - 45 freq
parsley - 8 freq
paircel - 11 freq
paircels - 5 freq
parasols - 2 freq
proselytise - 1 freq
proclaim - 5 freq
proclaimed - 17 freq
proclamation - 9 freq
park'll - 1 freq
parochial - 11 freq
porcelain - 10 freq
proclaimit - 3 freq
pricklt - 1 freq
preclude - 2 freq
percolated - 1 freq
paircel' - 1 freq
prickles - 2 freq
priggle - 1 freq
parasol - 3 freq
prickly - 2 freq
priceless - 30 freq
proclaimin - 3 freq
proclivities - 1 freq
parucular - 1 freq
pirckles - 1 freq
porshalin - 1 freq
pre-scuil - 1 freq
proclamatioun - 2 freq
proclaimers - 5 freq
parcels - 4 freq
parochialism - 2 freq
paraglidin - 2 freq
preclare - 2 freq
pre-schuil - 4 freq
persil - 1 freq
preschuil - 4 freq
parklans - 1 freq
proclemis - 1 freq
precludes - 1 freq
proselytisin - 1 freq
proclaimer's - 1 freq
proclaims - 4 freq
presly - 2 freq
percolatin - 1 freq
prickle - 1 freq
pairkhill - 4 freq
proclamations - 2 freq
€˜parcel - 1 freq
parcelsaweek - 1 freq
presley - 1 freq
pricklygerry - 1 freq
parsleyjane - 1 freq
peerieselkie - 1 freq
MetaPhone code - PRKLR
parucular - 1 freq
preclare - 2 freq
PRECLARE
Time to execute Levenshtein function - 0.207506 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.375531 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027873 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037562 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000930 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.