A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to peanut in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
peanut (0) - 11 freq
peanuts (1) - 10 freq
peyn't (2) - 6 freq
meenut (2) - 8 freq
pennit (2) - 1 freq
paut (2) - 1 freq
planet (2) - 210 freq
peint (2) - 5 freq
pedant (2) - 3 freq
granut (2) - 1 freq
beaut (2) - 5 freq
panet (2) - 1 freq
mean't (2) - 4 freq
weant (2) - 1 freq
pent (2) - 39 freq
peynt (2) - 1 freq
plankt (2) - 1 freq
peart (2) - 1 freq
leant (2) - 34 freq
peakit (2) - 1 freq
meant (2) - 454 freq
penud (2) - 1 freq
peent (2) - 1 freq
speant (2) - 1 freq
peans (2) - 1 freq
peanut (0) - 11 freq
peynt (2) - 1 freq
pent (2) - 39 freq
pant (2) - 2 freq
peent (2) - 1 freq
peint (2) - 5 freq
panet (2) - 1 freq
peanuts (2) - 10 freq
pynt (3) - 276 freq
openit (3) - 4 freq
poynt (3) - 1 freq
pint (3) - 186 freq
peat (3) - 59 freq
panto (3) - 5 freq
plant (3) - 82 freq
pynit (3) - 3 freq
punt (3) - 9 freq
point (3) - 393 freq
apent (3) - 29 freq
panty (3) - 1 freq
pinot (3) - 4 freq
opent (3) - 85 freq
paint (3) - 60 freq
peans (3) - 1 freq
planet (3) - 210 freq
SoundEx code - P530
pynt - 276 freq
paint - 60 freq
point - 393 freq
pint - 186 freq
peened - 10 freq
pent - 39 freq
pund - 37 freq
pend - 3 freq
pyntie - 1 freq
pound - 127 freq
punt - 9 freq
phont - 3 freq
pyned - 1 freq
panned - 6 freq
penned - 14 freq
phoned - 73 freq
peend - 4 freq
pond - 50 freq
pownte - 1 freq
pinned - 22 freq
pynit - 3 freq
poond - 13 freq
peyn't - 6 freq
peynt - 1 freq
peint - 5 freq
pawned - 2 freq
peanut - 11 freq
punto - 4 freq
pointy - 6 freq
pmt - 3 freq
puntee - 1 freq
panda - 1 freq
pennit - 1 freq
pin't - 1 freq
phone't - 1 freq
pained - 1 freq
pant - 2 freq
pinot - 4 freq
pine-widd - 2 freq
pinnet - 1 freq
poynt - 1 freq
panto - 5 freq
pened - 1 freq
piynt - 2 freq
pinnied - 1 freq
pin-heid - 1 freq
pennied - 1 freq
punt' - 1 freq
powneed - 1 freq
pynty - 1 freq
pawnd - 1 freq
panet - 1 freq
-pund - 1 freq
peent - 1 freq
pand - 3 freq
pinewood - 1 freq
pandyyyy - 2 freq
panty - 1 freq
pnd - 1 freq
penud - 1 freq
punnet - 1 freq
pmd - 1 freq
MetaPhone code - PNT
pynt - 276 freq
paint - 60 freq
point - 393 freq
pint - 186 freq
peened - 10 freq
pent - 39 freq
pund - 37 freq
pend - 3 freq
pyntie - 1 freq
pound - 127 freq
punt - 9 freq
pyned - 1 freq
panned - 6 freq
penned - 14 freq
peend - 4 freq
pond - 50 freq
pownte - 1 freq
pinned - 22 freq
pynit - 3 freq
poond - 13 freq
peyn't - 6 freq
peynt - 1 freq
peint - 5 freq
pawned - 2 freq
peanut - 11 freq
punto - 4 freq
pointy - 6 freq
puntee - 1 freq
panda - 1 freq
pennit - 1 freq
pin't - 1 freq
pained - 1 freq
pant - 2 freq
pinot - 4 freq
pinnet - 1 freq
poynt - 1 freq
panto - 5 freq
pened - 1 freq
piynt - 2 freq
pinnied - 1 freq
pennied - 1 freq
punt' - 1 freq
powneed - 1 freq
pynty - 1 freq
pawnd - 1 freq
panet - 1 freq
-pund - 1 freq
peent - 1 freq
pand - 3 freq
pandyyyy - 2 freq
panty - 1 freq
penud - 1 freq
punnet - 1 freq
PEANUT
Time to execute Levenshtein function - 0.606323 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 1.110450 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.094856 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.099676 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000992 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.