A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

Levenshtein Distance

Enter a word to find nearest neighbouring words, for example ablow

- basic concord - pre-sorted concord - post-sorted concord - map and chronology - chronogrid - fine-grain concord -

Similar words to pronouns in Corpus

Levenshtein Double Levenshtein SoundEx MetaPhone Manually curated
pronouns (0) - 42 freq
pronoun (1) - 23 freq
pronoons (1) - 3 freq
pronounce (2) - 24 freq
profound (2) - 6 freq
pronoon (2) - 6 freq
profoun (2) - 2 freq
pysonous (3) - 2 freq
drouns (3) - 1 freq
prongs (3) - 2 freq
propont (3) - 1 freq
propos (3) - 1 freq
prognosis (3) - 2 freq
broons (3) - 28 freq
promotes (3) - 6 freq
croons (3) - 13 freq
droons (3) - 4 freq
pronoonce (3) - 11 freq
proteins (3) - 1 freq
provosts (3) - 1 freq
cronus (3) - 1 freq
crouns (3) - 4 freq
proof's (3) - 1 freq
presons (3) - 1 freq
proports (3) - 1 freq
pronouns (0) - 42 freq
pronoons (1) - 3 freq
pronoun (2) - 23 freq
pronoon (3) - 6 freq
pronounce (3) - 24 freq
propones (4) - 14 freq
presons (4) - 1 freq
proteins (4) - 1 freq
pronoonce (4) - 11 freq
prongs (4) - 2 freq
profound (4) - 6 freq
profoun (4) - 2 freq
prisons (4) - 1 freq
prannies (4) - 1 freq
prunks (5) - 1 freq
prentis (5) - 1 freq
preeens (5) - 1 freq
prunes (5) - 3 freq
ronnies (5) - 1 freq
prawns (5) - 6 freq
pranny (5) - 19 freq
prunin (5) - 2 freq
prences (5) - 2 freq
prinks (5) - 1 freq
preens (5) - 23 freq
SoundEx code - P655
pronunciation - 31 freq
pronunciations - 10 freq
permanent - 30 freq
pronounce - 24 freq
preenin - 7 freq
pronoonceable - 1 freq
perimenopause - 1 freq
promenade - 7 freq
premonition - 7 freq
promenader - 1 freq
pronooncit - 2 freq
permanently - 6 freq
prunin - 2 freq
premium - 2 freq
pronooncements - 1 freq
preanin - 1 freq
pronounce''t - 1 freq
prominent - 9 freq
pronounced - 29 freq
preenin' - 1 freq
pronunced - 3 freq
pronouns - 42 freq
pronoun - 23 freq
paranems - 1 freq
pronuncin - 2 freq
premonitions - 2 freq
permanant - 1 freq
pornounciation - 1 freq
pernyim - 1 freq
pre-eminent - 1 freq
pronominal - 1 freq
pronouncit - 2 freq
pronunciatioun - 1 freq
pronoonce - 11 freq
pronunciâtion - 1 freq
permament - 1 freq
pronuncement - 1 freq
pronoonciation - 15 freq
pronoon - 6 freq
pronoonced - 4 freq
promontory - 2 freq
pronouncin - 1 freq
pronunciaetion - 1 freq
pronunciaetions - 1 freq
primming - 1 freq
prominence - 1 freq
permency - 1 freq
permanence - 1 freq
pronooncin - 4 freq
perineum - 2 freq
pronoons - 3 freq
pronunseeaeshins - 1 freq
preeemium - 1 freq
paramountabdn - 1 freq
pronounciation - 1 freq
MetaPhone code - PRNNS
pronounce - 24 freq
pronouns - 42 freq
pronoonce - 11 freq
pronoons - 3 freq
PRONOUNS
Time to execute Levenshtein function - 0.191167 milliseconds
The Levenshtein distance is the number of characters you have to replace, insert or delete to transform one word into another, its useful for detecting typos and alternative spellings
Time to execute Double Levenshtein function - 0.355663 milliseconds
In a stroke of genius, this runs the Levenshtein function twice, once without vowels and adds the distance together, giving double weight to consonants.
Time to execute SoundEx function - 0.027561 milliseconds
Soundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for homophones to be encoded to the same representation so that they can be matched despite minor differences in spelling.
Time to execute MetaPhone function - 0.037601 milliseconds
Metaphone is a phonetic algorithm, published by Lawrence Philips in 1990, for indexing words by their English pronunciation.[1] It fundamentally improves on the Soundex algorithm by using information about variations and inconsistencies in English spelling and pronunciation to produce a more accurate encoding, which does a better job of matching words and names which sound similar.
Time to execute Manually curated function - 0.000852 milliseconds
Manual Curation uses a lookup table / lexicon which has been created by hand which links words to their lemmas, and includes obvious typos and spelling variations. Not all words are covered.