A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Love, Rowena M.

Basic Stats

Total words by this author in corpus - 268
Total unique words used by this author in corpus - 192
Ratio of total words to unique words - 1.396
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, a, wi, in, as, o, is, tae, ma,

List of texts in corpus

Lallans 82 - Simmer Strand
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry (96 words)
Lallans 82 - Hame tae Dunbar Harbour
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry (104 words)
Lallans 82 - Hamecomin
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry (68 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
wuid2 7,462.6912.362
hert2 7,462.699.657
oan3 11,194.037.207
as6 22,388.066.949
nor2 7,462.696.003
time3 11,194.035.980
hame2 7,462.695.333
wi6 22,388.065.263
heid2 7,462.694.659
bi2 7,462.694.090
awa2 7,462.693.991
auld2 7,462.693.621
sae2 7,462.693.106
is4 14,925.372.399
ma3 11,194.031.977
in6 22,388.060.715
an9 33,582.090.667
tae4 14,925.370.597
it2 7,462.690.505
the18 67,164.180.373
a7 26,119.400.098
o5 18,656.720.049
he2 7,462.690.049
that3 11,194.030.046
plet2 7,462.69nan