A Corpus of 21st Century Scots Texts
Intro
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
Texts
Writers
Statistics
Top200
Search
Compare
. .Previous author - Next author
- dialect comparison -
fine grain dialect comparison -
Venn diagrams -
punctuation analysis -
chronology -
Warwick, Matthew
Basic Stats
Total words by this author in corpus - 930
Total unique words used by this author in corpus - 397
Ratio of total words to unique words - 2.343
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - tha, a, an, ye, tae, o, wullie, they, at, feardie,
List of texts in corpus
Tha Feardie Geng hae tae bide at hame
Sensory Attachment Centre (13-05-2020) in Ulster (Ballymena) dialect, categorised as weans
(531 words)
Fergie an Freens oan tha fairm
Ulster-Scots Community Network (2011 ) in Ulster (Rural Mid-Antrim) dialect, categorised as weans
(399 words)
Author word usage frequencies