A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example sonsie

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to plum in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
plum (0) - 15 freq
lum (1) - 140 freq
plul (1) - 1 freq
prum (1) - 1 freq
glum (1) - 10 freq
pum (1) - 3 freq
ploum (1) - 2 freq
plume (1) - 7 freq
plums (1) - 7 freq
paum (1) - 2 freq
plumb (1) - 1 freq
plug (1) - 28 freq
plu (1) - 2 freq
plump (1) - 16 freq
plus (1) - 71 freq
slum (1) - 5 freq
slut (2) - 5 freq
poem (2) - 369 freq
pluke (2) - 1 freq
pleg (2) - 2 freq
clam (2) - 10 freq
ileum (2) - 1 freq
ploo (2) - 45 freq
slur (2) - 4 freq
clump (2) - 18 freq
plum (0) - 15 freq
plume (1) - 7 freq
ploum (1) - 2 freq
plu (2) - 2 freq
plump (2) - 16 freq
slum (2) - 5 freq
palm (2) - 33 freq
paulm (2) - 2 freq
plug (2) - 28 freq
plus (2) - 71 freq
ploom (2) - 11 freq
lum (2) - 140 freq
prum (2) - 1 freq
plumb (2) - 1 freq
glum (2) - 10 freq
plul (2) - 1 freq
paum (2) - 2 freq
pum (2) - 3 freq
plums (2) - 7 freq
lm (3) - 5 freq
plod (3) - 1 freq
flam (3) - 3 freq
pul (3) - 6 freq
plout (3) - 1 freq
clem (3) - 4 freq
SoundEx code - P450
plume - 7 freq
plain - 178 freq
playin - 347 freq
pilin - 9 freq
pullin - 107 freq
peelin - 5 freq
plan - 236 freq
palm - 33 freq
plane - 68 freq
ploomy - 1 freq
pleyin - 12 freq
pulein - 2 freq
pailin - 8 freq
plooin - 17 freq
ploom - 11 freq
plouin - 13 freq
playan - 15 freq
paulm - 2 freq
'plane - 1 freq
playin' - 7 freq
playen - 2 freq
pullen - 1 freq
plaan - 1 freq
plum - 15 freq
pylon - 1 freq
pleuan - 1 freq
paulin' - 1 freq
palma - 2 freq
pauline - 36 freq
plen - 109 freq
pallin - 2 freq
paloma - 1 freq
palin - 4 freq
pey-line - 1 freq
plaen - 1 freq
'palm - 1 freq
pillowin - 1 freq
plene - 1 freq
plenn - 4 freq
polonie - 1 freq
plooan - 1 freq
pillion - 1 freq
pullan - 6 freq
pewlin - 1 freq
pollen - 3 freq
plaine - 1 freq
plein - 1 freq
pallion - 1 freq
playeen - 1 freq
pulin - 1 freq
palmie - 1 freq
pollin - 10 freq
plewin - 1 freq
€˜plain - 5 freq
pùllin - 1 freq
plaein - 1 freq
poolin - 2 freq
plyin - 1 freq
plummy - 2 freq
€˜playin - 1 freq
ploum - 2 freq
pawlin - 1 freq
plena - 2 freq
puhlin - 1 freq
pleen - 1 freq
pollença - 1 freq
plan' - 1 freq
“plain - 1 freq
pvlne - 1 freq
MetaPhone code - PLM
plume - 7 freq
palm - 33 freq
ploomy - 1 freq
ploom - 11 freq
paulm - 2 freq
plum - 15 freq
palma - 2 freq
paloma - 1 freq
'palm - 1 freq
palmie - 1 freq
plummy - 2 freq
ploum - 2 freq
plumb - 1 freq
PLUM
Time to execute Levenshtein function - 0.179347 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.341919 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027557 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.036854 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000901 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.