A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pentecost in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pentecost (0) - 2 freq
pent-pot (3) - 1 freq
pertectit (3) - 1 freq
penters (3) - 2 freq
petticot (3) - 1 freq
perteckit (3) - 1 freq
penthoose (3) - 2 freq
fentest (3) - 1 freq
petticoat (3) - 3 freq
penter's (3) - 5 freq
peugeot (4) - 2 freq
peter's (4) - 4 freq
interest (4) - 190 freq
opencast (4) - 1 freq
perfect (4) - 181 freq
pentit (4) - 44 freq
pinters (4) - 1 freq
bestest (4) - 5 freq
fenceposts (4) - 1 freq
penneys (4) - 1 freq
mentalest (4) - 1 freq
bentos (4) - 1 freq
petticoats (4) - 3 freq
eftercast (4) - 5 freq
pelters (4) - 9 freq
pentecost (0) - 2 freq
fentest (5) - 1 freq
petticoat (5) - 3 freq
penthoose (5) - 2 freq
pantheist (5) - 4 freq
opencast (5) - 1 freq
perteckit (5) - 1 freq
pertectit (5) - 1 freq
pent-pot (5) - 1 freq
petticot (5) - 1 freq
penters (5) - 2 freq
pynters (6) - 1 freq
punters (6) - 25 freq
pinechest (6) - 1 freq
pences (6) - 3 freq
protect (6) - 39 freq
oncost (6) - 1 freq
entrust (6) - 1 freq
protectit (6) - 8 freq
lanercost (6) - 1 freq
neatest (6) - 1 freq
prentices (6) - 1 freq
contest (6) - 16 freq
pantet (6) - 1 freq
pentins (6) - 13 freq
SoundEx code - P532
pents - 5 freq
phonetic - 11 freq
poonds - 9 freq
points - 79 freq
pints - 80 freq
pants - 34 freq
phantasmagoria - 1 freq
paint-covered - 1 freq
pint-cans - 1 freq
pontiac - 1 freq
pounds - 28 freq
pynts - 79 freq
punts - 5 freq
pownd's - 1 freq
pynt's - 1 freq
peanuts - 11 freq
pandj - 2 freq
pantheism - 2 freq
ponds - 5 freq
painites - 1 freq
punds - 9 freq
pends - 2 freq
pontius - 2 freq
pondicherry - 3 freq
'pants - 1 freq
penthoose - 2 freq
pendicles - 1 freq
poynts - 2 freq
pentecost - 2 freq
pantheist - 4 freq
phantasies - 2 freq
phantasie - 2 freq
phonetically - 5 freq
paints - 2 freq
pound's - 1 freq
pundis - 2 freq
'phonetic' - 1 freq
'phantasmagoria' - 1 freq
€˜phantasmagoria - 1 freq
phonetics - 1 freq
pendice - 2 freq
€˜phonetic - 1 freq
phoneticisms - 1 freq
peendgin - 1 freq
paint-slaigert - 1 freq
panties - 2 freq
pint-stowp - 1 freq
pentagram - 3 freq
pendicle - 5 freq
pandjsport - 1 freq
pounds'll - 1 freq
pmwatsonobe - 2 freq
phoneticism - 1 freq
MetaPhone code - PNTKST
pentecost - 2 freq
PENTECOST
Time to execute Levenshtein function - 0.210111 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.410358 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027475 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040327 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000922 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.