A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

BardOfMilebush

Basic Stats

Total words by this author in corpus - 422
Total unique words used by this author in corpus - 263
Ratio of total words to unique words - 1.605
Tagged as SUL (South Antrim (Between Sixmilewater and Belfast)) dialect.
Top ten most common words - and, the, i, yer, that, in, all, my, to, me,

List of texts in corpus

Untitled poem
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (281 words)

To A Merican Food
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (141 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
all6 14,218.0137.261
yer10 23,696.6833.427
and17 40,284.3633.193
scotia's2 4,739.3427.436
i14 33,175.3625.477
i'm4 9,478.6724.901
my6 14,218.0122.447
mice2 4,739.3422.262
thomson2 4,739.3420.830
dinnae4 9,478.6718.220
talk3 7,109.0018.134
without2 4,739.3416.721
scotch2 4,739.3415.613
i'd2 4,739.3415.404
find3 7,109.0015.361
worst2 4,739.3415.303
stand2 4,739.3414.836
food2 4,739.3414.114
o'3 7,109.0013.885
to5 11,848.3413.546
ulster2 4,739.3412.817
cannae3 7,109.0012.003
they're2 4,739.3411.887
an2 4,739.3410.891
land2 4,739.3410.691
about2 4,739.348.682
way2 4,739.348.157
words2 4,739.347.699
men2 4,739.346.721
when3 7,109.006.189
look2 4,739.346.059
a5 11,848.345.805
me5 11,848.345.632
just2 4,739.345.166
the15 35,545.024.563
gie2 4,739.344.023
you3 7,109.003.632
frae3 7,109.003.523
say2 4,739.343.471
or4 9,478.673.335
o4 9,478.673.219
so2 4,739.342.876
that8 18,957.352.818
but4 9,478.672.386
ken2 4,739.342.370
auld2 4,739.342.212
like3 7,109.002.175
some2 4,739.342.089
of2 4,739.342.070
if2 4,739.341.824
their2 4,739.341.547
ye4 9,478.671.080
wee2 4,739.341.040
it3 7,109.000.938
they3 7,109.000.931
no2 4,739.340.578
wi2 4,739.340.481
at2 4,739.340.202
scots2 4,739.340.194
on2 4,739.340.148
in7 16,587.680.027