A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

BardOfMilebush

Basic Stats

Total words by this author in corpus - 422
Total unique words used by this author in corpus - 263
Ratio of total words to unique words - 1.605
Tagged as SUL (South Antrim (Between Sixmilewater and Belfast)) dialect.
Top ten most common words - and, the, i, yer, that, in, all, my, to, me,

List of texts in corpus

Untitled poem
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (281 words)

To A Merican Food
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (141 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
all6 14,218.0137.192
and17 40,284.3633.632
yer10 23,696.6833.476
scotia's2 4,739.3427.492
i14 33,175.3625.945
i'm4 9,478.6725.059
my6 14,218.0122.617
mice2 4,739.3422.317
thomson2 4,739.3419.836
talk3 7,109.0018.260
dinnae4 9,478.6718.209
without2 4,739.3417.071
worst2 4,739.3416.004
find3 7,109.0015.469
i'd2 4,739.3415.459
scotch2 4,739.3415.459
stand2 4,739.3415.358
food2 4,739.3414.318
to5 11,848.3413.865
o'3 7,109.0013.823
ulster2 4,739.3412.379
cannae3 7,109.0012.216
they're2 4,739.3411.901
an2 4,739.3411.156
land2 4,739.3410.745
about2 4,739.348.788
way2 4,739.348.210
words2 4,739.347.725
men2 4,739.346.742
when3 7,109.006.439
look2 4,739.346.255
me5 11,848.345.865
a5 11,848.345.841
just2 4,739.345.338
the15 35,545.024.433
gie2 4,739.344.057
you3 7,109.003.816
say2 4,739.343.527
frae3 7,109.003.458
o4 9,478.673.264
or4 9,478.673.242
so2 4,739.342.906
that8 18,957.352.803
but4 9,478.672.428
ken2 4,739.342.355
auld2 4,739.342.244
like3 7,109.002.243
of2 4,739.342.146
some2 4,739.342.103
if2 4,739.341.868
their2 4,739.341.543
ye4 9,478.671.082
wee2 4,739.341.038
it3 7,109.000.910
they3 7,109.000.908
no2 4,739.340.605
wi2 4,739.340.491
at2 4,739.340.205
scots2 4,739.340.196
on2 4,739.340.158
in7 16,587.680.026