A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pilgrim in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pilgrim (0) - 14 freq
pilgim (1) - 1 freq
pilgrims (1) - 15 freq
pilrims (2) - 1 freq
pilgrim's (2) - 7 freq
pilgrimer (2) - 4 freq
eilrig (3) - 5 freq
piggie (3) - 13 freq
widdrim (3) - 2 freq
polaris (3) - 3 freq
pirm (3) - 1 freq
pilgrimage (3) - 12 freq
pierie (3) - 8 freq
pilgrimers (3) - 2 freq
pirrie (3) - 1 freq
pilin (3) - 7 freq
fillim (3) - 2 freq
kingrip (3) - 1 freq
ingrid (3) - 17 freq
pilit (3) - 2 freq
diagram (3) - 3 freq
eildrig (3) - 2 freq
piggin (3) - 10 freq
megrim (3) - 4 freq
pirie (3) - 5 freq
pilgrim (0) - 14 freq
pilgrims (2) - 15 freq
pilgim (2) - 1 freq
pilgrimer (3) - 4 freq
program (4) - 24 freq
pilgremer (4) - 1 freq
pilgrim's (4) - 7 freq
epigram (4) - 2 freq
pilrims (4) - 1 freq
pilgrimage (4) - 12 freq
diagram (5) - 3 freq
purim (5) - 1 freq
prim (5) - 2 freq
megrim (5) - 4 freq
pilgrimers (5) - 2 freq
polaris (5) - 3 freq
pirm (5) - 1 freq
hologram (5) - 1 freq
telegram (5) - 7 freq
grim (5) - 51 freq
filigree (6) - 2 freq
prom (6) - 8 freq
piggar (6) - 1 freq
plagiarism (6) - 2 freq
playgrun (6) - 17 freq
SoundEx code - P426
playgrund - 29 freq
pilgrim's - 7 freq
pleasure - 71 freq
pleisures - 4 freq
plaisure - 2 freq
pleesure - 23 freq
plooshares - 1 freq
pilgrimage - 12 freq
pleasures - 11 freq
pleisur - 32 freq
playgroup - 4 freq
pleisure - 60 freq
pleyscrievin - 2 freq
playgrun - 17 freq
playgruns - 1 freq
pilgrims - 15 freq
pilgrim - 14 freq
pleisurit - 1 freq
pleesher - 2 freq
pleisour - 1 freq
playgroond - 6 freq
play-gruns - 1 freq
playground - 6 freq
polygraph - 1 freq
pleesuir - 2 freq
pleesuirs - 2 freq
pleesures - 3 freq
pliesjir - 1 freq
plagiarist - 1 freq
plaisir - 3 freq
plaesur - 2 freq
ploushare - 2 freq
plooshare - 2 freq
pleasour - 1 freq
pilgrimer - 4 freq
pluscarden - 1 freq
pleisurable - 1 freq
pilgrimers - 2 freq
pleisir - 3 freq
plagiarisin - 1 freq
playgrunn - 2 freq
pleisur-snowker - 1 freq
plagiarism - 2 freq
plei-sured - 1 freq
pleesurin - 1 freq
pilgremer - 1 freq
pleygroup - 1 freq
pleisured - 1 freq
pleisurs - 1 freq
pleygroups - 1 freq
policework - 1 freq
placards - 2 freq
placard - 2 freq
pleygrund - 1 freq
pleygrun - 1 freq
playgroun - 2 freq
pylqzqr - 1 freq
pauljcorrigan - 1 freq
pilchard - 1 freq
polisher - 1 freq
paulgardinerdj - 1 freq
plucker - 1 freq
MetaPhone code - PLKRM
pilgrim - 14 freq
PILGRIM
Time to execute Levenshtein function - 0.175501 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.342594 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.028251 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.039153 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.001125 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.