A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to plum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
plum (0) - 15 freq
prum (1) - 1 freq
lum (1) - 139 freq
ploum (1) - 2 freq
plug (1) - 28 freq
paum (1) - 2 freq
pum (1) - 3 freq
plume (1) - 7 freq
glum (1) - 10 freq
plums (1) - 7 freq
slum (1) - 5 freq
plu (1) - 2 freq
plump (1) - 16 freq
plul (1) - 1 freq
plumb (1) - 1 freq
plus (1) - 69 freq
clue (2) - 85 freq
pup (2) - 37 freq
pl (2) - 14 freq
klux (2) - 1 freq
pouk (2) - 5 freq
baum (2) - 2 freq
pleb (2) - 4 freq
clump (2) - 18 freq
prim (2) - 2 freq
plum (0) - 15 freq
plume (1) - 7 freq
ploum (1) - 2 freq
plumb (2) - 1 freq
plul (2) - 1 freq
ploom (2) - 11 freq
prum (2) - 1 freq
paulm (2) - 2 freq
palm (2) - 33 freq
plu (2) - 2 freq
plus (2) - 69 freq
plump (2) - 16 freq
lum (2) - 139 freq
plug (2) - 28 freq
slum (2) - 5 freq
paum (2) - 2 freq
plums (2) - 7 freq
glum (2) - 10 freq
pum (2) - 3 freq
plew (3) - 1 freq
glim (3) - 8 freq
upcum (3) - 2 freq
glam (3) - 2 freq
pul (3) - 6 freq
i'lum (3) - 1 freq
SoundEx code - P450
plume - 7 freq
plain - 174 freq
playin - 338 freq
pilin - 7 freq
pullin - 104 freq
peelin - 5 freq
plan - 231 freq
palm - 33 freq
plane - 68 freq
ploomy - 1 freq
pleyin - 12 freq
pulein - 2 freq
pailin - 8 freq
plooin - 17 freq
ploom - 11 freq
plouin - 13 freq
playan - 15 freq
paulm - 2 freq
'plane - 1 freq
playin' - 6 freq
playen - 2 freq
pullen - 1 freq
plaan - 1 freq
plum - 15 freq
pleuan - 1 freq
paulin' - 1 freq
palma - 2 freq
pauline - 36 freq
plen - 109 freq
pallin - 2 freq
paloma - 1 freq
palin - 4 freq
pey-line - 1 freq
plaen - 1 freq
'palm - 1 freq
pillowin - 1 freq
plene - 1 freq
plenn - 4 freq
polonie - 1 freq
plooan - 1 freq
pillion - 1 freq
pullan - 6 freq
pewlin - 1 freq
pollen - 3 freq
plaine - 1 freq
plein - 1 freq
pallion - 1 freq
playeen - 1 freq
pulin - 1 freq
palmie - 1 freq
pollin - 10 freq
plewin - 1 freq
€˜plain - 5 freq
pùllin - 1 freq
plaein - 1 freq
poolin - 2 freq
plyin - 1 freq
plummy - 2 freq
€˜playin - 1 freq
ploum - 2 freq
pawlin - 1 freq
plena - 2 freq
puhlin - 1 freq
pleen - 1 freq
pollença - 1 freq
plan' - 1 freq
“plain - 1 freq
pvlne - 1 freq
MetaPhone code - PLM
plume - 7 freq
palm - 33 freq
ploomy - 1 freq
ploom - 11 freq
paulm - 2 freq
plum - 15 freq
palma - 2 freq
paloma - 1 freq
'palm - 1 freq
palmie - 1 freq
plummy - 2 freq
ploum - 2 freq
plumb - 1 freq
PLUM
Time to execute Levenshtein function - 0.174986 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.348298 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027526 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036887 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000881 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.