A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

BardOfMilebush

Basic Stats

Total words by this author in corpus - 422
Total unique words used by this author in corpus - 263
Ratio of total words to unique words - 1.605
Tagged as SUL (South Antrim (Between Sixmilewater and Belfast)) dialect.
Top ten most common words - and, the, i, yer, that, in, all, my, to, me,

List of texts in corpus

Untitled poem
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (281 words)

To A Merican Food
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (141 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
all6 14,218.0137.280
yer10 23,696.6833.417
and17 40,284.3633.235
scotia's2 4,739.3427.442
i14 33,175.3625.507
i'm4 9,478.6724.914
my6 14,218.0122.465
mice2 4,739.3422.268
thomson2 4,739.3420.837
dinnae4 9,478.6718.233
talk3 7,109.0018.144
without2 4,739.3416.727
scotch2 4,739.3415.619
i'd2 4,739.3415.410
find3 7,109.0015.370
worst2 4,739.3415.310
stand2 4,739.3414.842
food2 4,739.3414.120
o'3 7,109.0013.895
to5 11,848.3413.561
ulster2 4,739.3412.823
cannae3 7,109.0011.998
they're2 4,739.3411.894
an2 4,739.3410.876
land2 4,739.3410.697
about2 4,739.348.688
way2 4,739.348.163
words2 4,739.347.705
men2 4,739.346.727
when3 7,109.006.188
look2 4,739.346.040
a5 11,848.345.804
me5 11,848.345.636
just2 4,739.345.166
the15 35,545.024.568
gie2 4,739.344.028
you3 7,109.003.630
frae3 7,109.003.530
say2 4,739.343.469
or4 9,478.673.342
o4 9,478.673.204
so2 4,739.342.878
that8 18,957.352.813
but4 9,478.672.392
ken2 4,739.342.375
auld2 4,739.342.213
like3 7,109.002.177
some2 4,739.342.090
of2 4,739.342.074
if2 4,739.341.825
their2 4,739.341.547
ye4 9,478.671.071
wee2 4,739.341.039
it3 7,109.000.941
they3 7,109.000.921
no2 4,739.340.574
wi2 4,739.340.480
at2 4,739.340.203
scots2 4,739.340.195
on2 4,739.340.146
in7 16,587.680.028