A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

BardOfMilebush

Basic Stats

Total words by this author in corpus - 422
Total unique words used by this author in corpus - 263
Ratio of total words to unique words - 1.605
Tagged as SUL (South Antrim (Between Sixmilewater and Belfast)) dialect.
Top ten most common words - and, the, i, yer, that, in, all, my, to, me,

List of texts in corpus

Untitled poem
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (281 words)

To A Merican Food
Reddit (28-06-2017) in Ulster dialect (SUL), categorised as poetry (141 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
all6 14,218.0137.303
yer10 23,696.6833.719
and17 40,284.3633.410
scotia's2 4,739.3427.434
i14 33,175.3625.782
i'm4 9,478.6724.945
my6 14,218.0122.543
mice2 4,739.3422.260
thomson2 4,739.3420.829
dinnae4 9,478.6718.376
talk3 7,109.0018.175
without2 4,739.3417.014
worst2 4,739.3415.947
scotch2 4,739.3415.720
i'd2 4,739.3415.402
find3 7,109.0015.385
stand2 4,739.3415.301
food2 4,739.3414.338
to5 11,848.3413.792
o'3 7,109.0013.740
ulster2 4,739.3412.815
cannae3 7,109.0012.316
they're2 4,739.3411.885
an2 4,739.3411.067
land2 4,739.3410.689
about2 4,739.348.769
way2 4,739.348.295
words2 4,739.347.779
men2 4,739.346.781
when3 7,109.006.393
look2 4,739.346.220
me5 11,848.345.889
a5 11,848.345.765
just2 4,739.345.307
the15 35,545.024.605
gie2 4,739.344.069
you3 7,109.003.770
say2 4,739.343.570
frae3 7,109.003.477
or4 9,478.673.271
o4 9,478.673.241
so2 4,739.342.875
that8 18,957.352.818
but4 9,478.672.429
ken2 4,739.342.344
like3 7,109.002.218
auld2 4,739.342.217
of2 4,739.342.123
some2 4,739.342.088
if2 4,739.341.876
their2 4,739.341.532
ye4 9,478.671.102
wee2 4,739.341.051
they3 7,109.000.916
it3 7,109.000.910
no2 4,739.340.593
wi2 4,739.340.497
at2 4,739.340.211
scots2 4,739.340.181
on2 4,739.340.154
in7 16,587.680.027