A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Warwick, Matthew

Basic Stats

Total words by this author in corpus - 930
Total unique words used by this author in corpus - 397
Ratio of total words to unique words - 2.343
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - tha, a, an, ye, tae, o, wullie, they, at, feardie,

List of texts in corpus

Tha Feardie Geng hae tae bide at hame
Sensory Attachment Centre (13-05-2020) in Ulster (Ballymena) dialect (BUL), categorised as weans (531 words)

Fergie an Freens oan tha fairm
Ulster-Scots Community Network (2011 ) in Ulster (Rural Mid-Antrim) dialect (BUL), categorised as weans (399 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha72 77,419.35516.876
feardie9 9,677.42119.973
geng9 9,677.4286.615
wullie12 12,903.2377.493
docter6 6,451.6175.302
fergie7 7,526.8868.096
sadie5 5,376.3461.869
bug6 6,451.6159.886
toul5 5,376.3445.711
saip4 4,301.0845.105
liz5 5,376.3442.551
g4 4,301.0832.674
wash4 4,301.0829.452
weans5 5,376.3429.129
axed4 4,301.0827.760
ir6 6,451.6126.548
thaim9 9,677.4225.599
the'2 2,150.5424.273
lake3 3,225.8123.582
hame8 8,602.1523.429
thon's3 3,225.8123.115
ye19 20,430.1122.952
loanen2 2,150.5422.548
wile3 3,225.8122.277
gye3 3,225.8121.210
smit2 2,150.5420.459
aisy2 2,150.5420.459
thon7 7,526.8820.229
bide5 5,376.3420.066
oan9 9,677.4219.505
jock4 4,301.0818.357
whut3 3,225.8117.842
cannae5 5,376.3417.398
smittal5 5,376.34nan
gether2 2,150.5417.675
hauns4 4,301.0817.360
bae3 3,225.8117.237
pepper2 2,150.5416.947
hi2 2,150.5416.628
guid7 7,526.8815.724
crack3 3,225.8115.543
frae9 9,677.4214.874
reddin2 2,150.5414.192
cantie2 2,150.5414.192
forbye3 3,225.8113.820
thrie2 2,150.5413.726
wee9 9,677.4213.523
boady2 2,150.5413.055
reek2 2,150.5412.591
luk2 2,150.5411.893
doag2 2,150.5411.893
why3 3,225.8111.781
risin2 2,150.5411.715
maks3 3,225.8111.451
gies3 3,225.8111.251
bes2 2,150.5411.074
freens2 2,150.5411.074
need4 4,301.0810.981
fur9 9,677.4210.615
dae6 6,451.6110.526
ach2 2,150.5410.339
their7 7,526.8810.073
they11 11,827.969.910
yer7 7,526.889.478
naw3 3,225.818.939
in5 5,376.348.442
til4 4,301.088.239
watter3 3,225.818.164
yin4 4,301.087.988
hale3 3,225.817.700
dinnae3 3,225.817.604
gien3 3,225.817.497
buik2 2,150.546.892
hae7 7,526.886.760
siz3 3,225.81nan
ava2 2,150.546.731
it4 4,301.085.961
yersel2 2,150.545.916
haein2 2,150.545.336
hoo2 2,150.544.650
gie3 3,225.814.169
comin2 2,150.544.092
it's3 3,225.813.962
that4 4,301.083.730
hoose3 3,225.813.577
at11 11,827.963.439
best2 2,150.543.357
fowk4 4,301.083.276
ah2 2,150.543.054
keep2 2,150.542.995
things2 2,150.542.976
then3 3,225.812.740
wur2 2,150.542.612
same2 2,150.542.515
o13 13,978.492.260
awa3 3,225.812.200
aa4 4,301.082.193
aye3 3,225.811.587
as3 3,225.811.516
wiz2 2,150.541.449
tak2 2,150.541.425
up6 6,451.611.390
sae3 3,225.811.262
noo3 3,225.811.156
haes2 2,150.541.022
heid2 2,150.540.971
oot6 6,451.610.963
bit4 4,301.080.905
lang2 2,150.540.903
tae16 17,204.300.833
she3 3,225.810.792
be3 3,225.810.697
wi5 5,376.340.597
s9 9,677.420.517
gulders2 2,150.54nan
an27 29,032.260.480
richt2 2,150.540.458
ower3 3,225.810.362
his4 4,301.080.346
a30 32,258.060.278
totey2 2,150.54nan
said2 2,150.540.362
jist2 2,150.540.264
aboot2 2,150.540.231
if2 2,150.540.210
by2 2,150.540.198
see2 2,150.540.158
him2 2,150.540.094
time2 2,150.540.062
da2 2,150.540.042
is6 6,451.610.012
caa'ed3 3,225.81nan
cud2 2,150.543.395