A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to paragraph in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
paragraph (0) - 19 freq
paragraphs (1) - 6 freq
parrygraph (2) - 2 freq
polygraph (3) - 1 freq
i-digraph (4) - 1 freq
program (4) - 24 freq
paragons (4) - 1 freq
geograph (4) - 2 freq
telegraph (4) - 8 freq
padagnan (4) - 1 freq
paparrazi (4) - 1 freq
farrach (4) - 1 freq
pasaran (4) - 1 freq
graph (4) - 4 freq
vaagrant (4) - 1 freq
anagram (4) - 1 freq
programs (4) - 13 freq
autograph (4) - 1 freq
monograph (4) - 1 freq
fragrant (4) - 6 freq
digraph (4) - 3 freq
paraphrase (4) - 3 freq
parkash (4) - 1 freq
photograph (4) - 21 freq
parawd (5) - 1 freq
paragraph (0) - 19 freq
paragraphs (2) - 6 freq
parrygraph (3) - 2 freq
polygraph (4) - 1 freq
programs (6) - 13 freq
autograph (6) - 1 freq
photograph (6) - 21 freq
digraph (6) - 3 freq
monograph (6) - 1 freq
graph (6) - 4 freq
program (6) - 24 freq
i-digraph (6) - 1 freq
telegraph (6) - 8 freq
geograph (6) - 2 freq
biography (7) - 7 freq
topography (7) - 2 freq
seaography (7) - 6 freq
peregrine (7) - 1 freq
programme (7) - 102 freq
geography (7) - 37 freq
'geography (7) - 1 freq
portrush (7) - 1 freq
photography (7) - 6 freq
prugh (7) - 5 freq
paratrooper (7) - 1 freq
SoundEx code - P626
programme - 102 freq
preacher - 20 freq
pre-crust - 2 freq
parker - 19 freq
pressure - 64 freq
'programme' - 1 freq
preachers - 6 freq
preservin - 8 freq
preserve's - 1 freq
pressurised - 1 freq
progress - 95 freq
preserved - 7 freq
programmes - 54 freq
porkers - 2 freq
progressive - 24 freq
paragraph - 19 freq
proscribed - 1 freq
procure - 1 freq
preserve - 20 freq
prescribed - 3 freq
preshure - 1 freq
preesher - 5 freq
priservin - 1 freq
preservation - 12 freq
progresses - 1 freq
programmed - 3 freq
pressures - 8 freq
programmin - 2 freq
prescription - 8 freq
precarious - 3 freq
'progress' - 2 freq
pursuers - 1 freq
progression - 10 freq
pre-christian - 1 freq
program - 24 freq
programs - 13 freq
pruchry - 1 freq
preacher's - 1 freq
paragraphs - 6 freq
progremmes - 1 freq
parrygraph - 2 freq
presairve - 4 freq
precariously - 3 freq
procrastination - 3 freq
preservin' - 3 freq
programmin' - 3 freq
progressin - 2 freq
progressin' - 1 freq
progressed - 4 freq
proker - 1 freq
procuratour - 1 freq
pursuer - 2 freq
prescriptive - 4 freq
prescriptions - 4 freq
presairver - 1 freq
perqueerest - 1 freq
pressured - 2 freq
prizery - 1 freq
peregrine - 1 freq
preservit - 2 freq
progres - 1 freq
progressit - 1 freq
€˜prescriptive - 1 freq
precursor - 1 freq
proscribin - 1 freq
proscribit - 1 freq
presarve - 1 freq
powercaird - 1 freq
€˜preservin - 1 freq
prescribe - 1 freq
precairious - 1 freq
preser - 1 freq
perqueir - 1 freq
progressing - 1 freq
perjurors - 1 freq
procuire - 1 freq
perscription - 2 freq
presirvaeshin - 2 freq
pressjournal - 21 freq
perscriptions - 1 freq
procrastinate - 1 freq
pbrcrzj - 1 freq
perserverence - 1 freq
'progressive' - 1 freq
procrastinatin - 1 freq
parsreview - 2 freq
“prasher” - 1 freq
progressscots - 1 freq
progressives - 1 freq
programme' - 1 freq
progressscot - 1 freq
MetaPhone code - PRKRF
paragraph - 19 freq
parrygraph - 2 freq
PARAGRAPH
Time to execute Levenshtein function - 0.189972 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.361136 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027526 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037322 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000838 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.