A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pents in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pents (0) - 5 freq
prents (1) - 10 freq
lents (1) - 1 freq
pests (1) - 2 freq
peats (1) - 49 freq
tents (1) - 22 freq
pets (1) - 12 freq
pynts (1) - 78 freq
rents (1) - 11 freq
pens (1) - 30 freq
pelts (1) - 3 freq
cents (1) - 3 freq
gents (1) - 7 freq
ments (1) - 1 freq
pent (1) - 39 freq
penis (1) - 1 freq
ents (1) - 1 freq
vents (1) - 1 freq
perts (1) - 6 freq
punts (1) - 5 freq
pends (1) - 2 freq
pints (1) - 78 freq
pants (1) - 32 freq
bents (1) - 11 freq
dunts (2) - 22 freq
pents (0) - 5 freq
pynts (1) - 78 freq
punts (1) - 5 freq
pints (1) - 78 freq
pants (1) - 32 freq
perts (2) - 6 freq
vents (2) - 1 freq
ents (2) - 1 freq
points (2) - 74 freq
pends (2) - 2 freq
penis (2) - 1 freq
paints (2) - 2 freq
peanuts (2) - 10 freq
poynts (2) - 2 freq
bents (2) - 11 freq
tents (2) - 22 freq
prents (2) - 10 freq
pests (2) - 2 freq
pent (2) - 39 freq
peats (2) - 49 freq
pets (2) - 12 freq
lents (2) - 1 freq
ments (2) - 1 freq
rents (2) - 11 freq
cents (2) - 3 freq
SoundEx code - P532
pents - 5 freq
phonetic - 11 freq
poonds - 9 freq
points - 74 freq
pints - 78 freq
pants - 32 freq
phantasmagoria - 1 freq
paint-covered - 1 freq
pint-cans - 1 freq
pontiac - 1 freq
pounds - 28 freq
pynts - 78 freq
punts - 5 freq
pownd's - 1 freq
pynt's - 1 freq
peanuts - 10 freq
pantheism - 2 freq
ponds - 5 freq
painites - 1 freq
punds - 9 freq
pends - 2 freq
pontius - 2 freq
pondicherry - 3 freq
pandj - 1 freq
'pants - 1 freq
penthoose - 2 freq
pendicles - 1 freq
poynts - 2 freq
pentecost - 2 freq
pantheist - 4 freq
phantasies - 2 freq
phantasie - 2 freq
phonetically - 5 freq
paints - 2 freq
pound's - 1 freq
pundis - 2 freq
'phonetic' - 1 freq
'phantasmagoria' - 1 freq
€˜phantasmagoria - 1 freq
phonetics - 1 freq
pendice - 2 freq
€˜phonetic - 1 freq
phoneticisms - 1 freq
peendgin - 1 freq
paint-slaigert - 1 freq
panties - 2 freq
pint-stowp - 1 freq
pentagram - 3 freq
pendicle - 5 freq
pandjsport - 1 freq
pounds'll - 1 freq
pmwatsonobe - 2 freq
phoneticism - 1 freq
MetaPhone code - PNTS
pents - 5 freq
poonds - 9 freq
points - 74 freq
pints - 78 freq
pants - 32 freq
pounds - 28 freq
pynts - 78 freq
punts - 5 freq
pownd's - 1 freq
pynt's - 1 freq
hypnotise - 1 freq
peanuts - 10 freq
ponds - 5 freq
painites - 1 freq
punds - 9 freq
pends - 2 freq
pontius - 2 freq
'pants - 1 freq
poynts - 2 freq
paints - 2 freq
pound's - 1 freq
pundis - 2 freq
pendice - 2 freq
panties - 2 freq
PENTS
Time to execute Levenshtein function - 0.185527 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.401008 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.031553 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.040976 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000887 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.