A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to plume in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
plume (0) - 7 freq
lume (1) - 1 freq
pluke (1) - 1 freq
plumb (1) - 1 freq
blume (1) - 3 freq
plums (1) - 7 freq
plumes (1) - 3 freq
plump (1) - 16 freq
plum (1) - 15 freq
plumed (1) - 1 freq
luce (2) - 5 freq
flute (2) - 16 freq
lame (2) - 10 freq
plumin (2) - 1 freq
prune (2) - 5 freq
ploums (2) - 2 freq
plunge (2) - 6 freq
blame (2) - 155 freq
slums (2) - 8 freq
paule (2) - 1 freq
slum (2) - 5 freq
plane (2) - 68 freq
paum (2) - 2 freq
lube (2) - 1 freq
ume (2) - 2 freq
plume (0) - 7 freq
plum (1) - 15 freq
plump (2) - 16 freq
plumes (2) - 3 freq
ploum (2) - 2 freq
plumed (2) - 1 freq
lume (2) - 1 freq
plums (2) - 7 freq
pluke (2) - 1 freq
plumb (2) - 1 freq
blume (2) - 3 freq
palm (3) - 33 freq
pleumen (3) - 1 freq
lum (3) - 140 freq
pum (3) - 3 freq
prime (3) - 36 freq
pluto (3) - 2 freq
poame (3) - 1 freq
clime (3) - 7 freq
glum (3) - 10 freq
palma (3) - 2 freq
clame (3) - 1 freq
plate (3) - 183 freq
slime (3) - 6 freq
plus (3) - 71 freq
SoundEx code - P450
plume - 7 freq
plain - 178 freq
playin - 347 freq
pilin - 9 freq
pullin - 107 freq
peelin - 5 freq
plan - 236 freq
palm - 33 freq
plane - 68 freq
ploomy - 1 freq
pleyin - 12 freq
pulein - 2 freq
pailin - 8 freq
plooin - 17 freq
ploom - 11 freq
plouin - 13 freq
playan - 15 freq
paulm - 2 freq
'plane - 1 freq
playin' - 7 freq
playen - 2 freq
pullen - 1 freq
plaan - 1 freq
plum - 15 freq
pylon - 1 freq
pleuan - 1 freq
paulin' - 1 freq
palma - 2 freq
pauline - 36 freq
plen - 109 freq
pallin - 2 freq
paloma - 1 freq
palin - 4 freq
pey-line - 1 freq
plaen - 1 freq
'palm - 1 freq
pillowin - 1 freq
plene - 1 freq
plenn - 4 freq
polonie - 1 freq
plooan - 1 freq
pillion - 1 freq
pullan - 6 freq
pewlin - 1 freq
pollen - 3 freq
plaine - 1 freq
plein - 1 freq
pallion - 1 freq
playeen - 1 freq
pulin - 1 freq
palmie - 1 freq
pollin - 10 freq
plewin - 1 freq
€˜plain - 5 freq
pùllin - 1 freq
plaein - 1 freq
poolin - 2 freq
plyin - 1 freq
plummy - 2 freq
€˜playin - 1 freq
ploum - 2 freq
pawlin - 1 freq
plena - 2 freq
puhlin - 1 freq
pleen - 1 freq
pollença - 1 freq
plan' - 1 freq
“plain - 1 freq
pvlne - 1 freq
MetaPhone code - PLM
plume - 7 freq
palm - 33 freq
ploomy - 1 freq
ploom - 11 freq
paulm - 2 freq
plum - 15 freq
palma - 2 freq
paloma - 1 freq
'palm - 1 freq
palmie - 1 freq
plummy - 2 freq
ploum - 2 freq
plumb - 1 freq
PLUME
Time to execute Levenshtein function - 0.196197 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.345197 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027912 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037064 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000859 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.