A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ahint

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pilgrim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pilgrim (0) - 14 freq
pilgim (1) - 1 freq
pilgrims (1) - 16 freq
pilgrim's (2) - 7 freq
pilgrimer (2) - 4 freq
pilrims (2) - 1 freq
widdrim (3) - 2 freq
pilgrimers (3) - 2 freq
grim (3) - 53 freq
pinglin (3) - 1 freq
pidgin (3) - 4 freq
megrim (3) - 4 freq
pilgremer (3) - 1 freq
program (3) - 25 freq
fillim (3) - 2 freq
pirm (3) - 1 freq
polaris (3) - 3 freq
pilit (3) - 2 freq
diagram (3) - 3 freq
piggin (3) - 10 freq
piggie (3) - 13 freq
pilin (3) - 9 freq
pirrie (3) - 1 freq
kingrip (3) - 1 freq
prim (3) - 2 freq
pilgrim (0) - 14 freq
pilgrims (2) - 16 freq
pilgim (2) - 1 freq
pilgrimer (3) - 4 freq
pilgremer (4) - 1 freq
program (4) - 25 freq
pilgrimage (4) - 12 freq
epigram (4) - 2 freq
pilrims (4) - 1 freq
pilgrim's (4) - 7 freq
hologram (5) - 1 freq
prim (5) - 2 freq
purim (5) - 1 freq
telegram (5) - 8 freq
polaris (5) - 3 freq
diagram (5) - 3 freq
pilgrimers (5) - 2 freq
megrim (5) - 4 freq
grim (5) - 53 freq
pirm (5) - 1 freq
filigree (6) - 2 freq
grimy (6) - 2 freq
progres (6) - 1 freq
prime (6) - 36 freq
paltry (6) - 1 freq
SoundEx code - P426
playgrund - 29 freq
pilgrim's - 7 freq
pleasure - 74 freq
pleisures - 4 freq
plaisure - 2 freq
pleesure - 23 freq
plooshares - 1 freq
pilgrimage - 12 freq
pleasures - 11 freq
pleisur - 33 freq
playgroup - 4 freq
pleisure - 60 freq
pleyscrievin - 2 freq
playgrun - 17 freq
playgruns - 1 freq
pilgrims - 16 freq
pilgrim - 14 freq
pleisurit - 1 freq
pleesher - 2 freq
playgroond - 11 freq
pleisour - 1 freq
play-gruns - 1 freq
playground - 6 freq
polygraph - 1 freq
pleesuir - 2 freq
pleesuirs - 2 freq
pleesures - 3 freq
pliesjir - 1 freq
plagiarist - 1 freq
plaisir - 3 freq
plaesur - 2 freq
ploushare - 2 freq
plooshare - 2 freq
pleasour - 1 freq
pilgrimer - 4 freq
pluscarden - 1 freq
pleisurable - 1 freq
pilgrimers - 2 freq
pleisir - 3 freq
plagiarisin - 1 freq
playgrunn - 2 freq
pleisur-snowker - 1 freq
plagiarism - 2 freq
plei-sured - 1 freq
pleesurin - 1 freq
pilgremer - 1 freq
pleygroup - 1 freq
pleisured - 1 freq
pleisurs - 1 freq
pleygroups - 1 freq
policework - 1 freq
placards - 2 freq
placard - 2 freq
pleygrund - 1 freq
pleygrun - 1 freq
playgroun - 2 freq
pylqzqr - 1 freq
pauljcorrigan - 1 freq
pilchard - 1 freq
polisher - 1 freq
paulgardinerdj - 1 freq
plucker - 1 freq
MetaPhone code - PLKRM
pilgrim - 14 freq
PILGRIM
Time to execute Levenshtein function - 0.207550 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.356753 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028377 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039576 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000896 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.