A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Warwick, Matthew

Basic Stats

Total words by this author in corpus - 930
Total unique words used by this author in corpus - 397
Ratio of total words to unique words - 2.343
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - tha, a, an, ye, tae, o, wullie, they, at, feardie,

List of texts in corpus

Tha Feardie Geng hae tae bide at hame
Sensory Attachment Centre (13-05-2020) in Ulster (Ballymena) dialect (BUL), categorised as weans (531 words)

Fergie an Freens oan tha fairm
Ulster-Scots Community Network (2011 ) in Ulster (Rural Mid-Antrim) dialect (BUL), categorised as weans (399 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha72 77,419.35516.807
feardie9 9,677.42119.964
geng9 9,677.4286.606
wullie12 12,903.2378.324
docter6 6,451.6175.296
fergie7 7,526.8868.090
sadie5 5,376.3461.864
bug6 6,451.6159.880
toul5 5,376.3445.706
saip4 4,301.0845.101
liz5 5,376.3442.546
g4 4,301.0832.670
wash4 4,301.0829.448
weans5 5,376.3429.039
axed4 4,301.0827.457
ir6 6,451.6126.302
thaim9 9,677.4225.740
the'2 2,150.5424.271
lake3 3,225.8123.579
hame8 8,602.1523.474
ye19 20,430.1123.130
thon's3 3,225.8123.112
loanen2 2,150.5422.546
wile3 3,225.8122.275
gye3 3,225.8121.207
smit2 2,150.5420.457
aisy2 2,150.5420.457
bide5 5,376.3420.061
oan9 9,677.4219.907
thon7 7,526.8819.744
jock4 4,301.0818.353
whut3 3,225.8117.840
smittal5 5,376.34nan
cannae5 5,376.3417.912
hauns4 4,301.0817.838
gether2 2,150.5417.673
hi2 2,150.5417.292
pepper2 2,150.5416.945
bae3 3,225.8115.918
guid7 7,526.8815.853
crack3 3,225.8115.540
frae9 9,677.4214.721
reddin2 2,150.5414.190
cantie2 2,150.5414.190
forbye3 3,225.8113.863
thrie2 2,150.5413.724
wee9 9,677.4213.594
boady2 2,150.5413.054
reek2 2,150.5412.589
why3 3,225.8111.903
luk2 2,150.5411.891
doag2 2,150.5411.891
risin2 2,150.5411.627
maks3 3,225.8111.448
gies3 3,225.8111.305
need4 4,301.0811.087
bes2 2,150.5411.072
freens2 2,150.5411.072
fur9 9,677.4210.822
dae6 6,451.6110.601
ach2 2,150.5410.160
their7 7,526.8810.006
they11 11,827.969.829
yer7 7,526.889.646
naw3 3,225.818.866
in5 5,376.348.452
til4 4,301.088.216
watter3 3,225.818.131
yin4 4,301.087.966
dinnae3 3,225.817.711
hale3 3,225.817.642
gien3 3,225.817.508
buik2 2,150.546.890
hae7 7,526.886.735
ava2 2,150.546.729
siz3 3,225.81nan
yersel2 2,150.545.950
it4 4,301.085.861
haein2 2,150.545.320
hoo2 2,150.544.649
gie3 3,225.814.233
comin2 2,150.544.100
it's3 3,225.813.954
that4 4,301.083.730
hoose3 3,225.813.487
best2 2,150.543.386
at11 11,827.963.378
fowk4 4,301.083.219
keep2 2,150.543.000
things2 2,150.542.936
then3 3,225.812.757
wur2 2,150.542.678
same2 2,150.542.514
ah2 2,150.542.496
o13 13,978.492.289
awa3 3,225.812.132
aa4 4,301.082.064
as3 3,225.811.561
aye3 3,225.811.458
tak2 2,150.541.394
wiz2 2,150.541.385
up6 6,451.611.364
sae3 3,225.811.249
noo3 3,225.811.175
heid2 2,150.540.996
oot6 6,451.610.968
haes2 2,150.540.960
bit4 4,301.080.901
lang2 2,150.540.890
tae16 17,204.300.860
she3 3,225.810.752
be3 3,225.810.700
wi5 5,376.340.624
s9 9,677.420.594
richt2 2,150.540.450
an27 29,032.260.414
gulders2 2,150.54nan
his4 4,301.080.368
said2 2,150.540.355
ower3 3,225.810.353
a30 32,258.060.293
aboot2 2,150.540.241
jist2 2,150.540.240
if2 2,150.540.233
totey2 2,150.54nan
by2 2,150.540.191
see2 2,150.540.160
him2 2,150.540.099
time2 2,150.540.068
da2 2,150.540.042
is6 6,451.610.012
caa'ed3 3,225.81nan
cud2 2,150.543.393