A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Warwick, Matthew

Basic Stats

Total words by this author in corpus - 930
Total unique words used by this author in corpus - 397
Ratio of total words to unique words - 2.343
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - tha, a, an, ye, tae, o, wullie, they, at, feardie,

List of texts in corpus

Tha Feardie Geng hae tae bide at hame
Sensory Attachment Centre (13-05-2020) in Ulster (Ballymena) dialect (BUL), categorised as weans (531 words)

Fergie an Freens oan tha fairm
Ulster-Scots Community Network (2011 ) in Ulster (Rural Mid-Antrim) dialect (BUL), categorised as weans (399 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha72 77,419.35517.107
feardie9 9,677.42120.002
geng9 9,677.4286.644
wullie12 12,903.2377.531
docter6 6,451.6175.321
fergie7 7,526.8868.119
sadie5 5,376.3461.885
bug6 6,451.6159.905
toul5 5,376.3445.727
saip4 4,301.0845.118
liz5 5,376.3442.567
g4 4,301.0832.687
wash4 4,301.0829.464
weans5 5,376.3428.727
axed4 4,301.0827.772
ir6 6,451.6126.567
thaim9 9,677.4225.626
the'2 2,150.5424.279
lake3 3,225.8123.592
hame8 8,602.1523.435
thon's3 3,225.8123.124
ye19 20,430.1122.877
loanen2 2,150.5422.555
wile3 3,225.8122.287
gye3 3,225.8121.219
smit2 2,150.5420.465
aisy2 2,150.5420.465
thon7 7,526.8820.216
bide5 5,376.3420.081
oan9 9,677.4219.330
jock4 4,301.0818.369
whut3 3,225.8117.852
cannae5 5,376.3417.389
smittal5 5,376.34nan
gether2 2,150.5417.681
hauns4 4,301.0817.372
bae3 3,225.8117.246
pepper2 2,150.5416.954
hi2 2,150.5416.634
guid7 7,526.8815.743
crack3 3,225.8115.552
frae9 9,677.4214.897
reddin2 2,150.5414.199
cantie2 2,150.5414.199
forbye3 3,225.8113.830
thrie2 2,150.5413.733
wee9 9,677.4213.514
boady2 2,150.5413.062
reek2 2,150.5412.598
luk2 2,150.5411.899
doag2 2,150.5411.899
why3 3,225.8111.790
risin2 2,150.5411.721
maks3 3,225.8111.460
gies3 3,225.8111.260
bes2 2,150.5411.081
freens2 2,150.5411.081
need4 4,301.0810.993
fur9 9,677.4210.587
dae6 6,451.6110.487
ach2 2,150.5410.346
their7 7,526.8810.072
they11 11,827.969.860
yer7 7,526.889.472
naw3 3,225.818.842
in5 5,376.348.429
til4 4,301.088.250
watter3 3,225.818.173
yin4 4,301.087.989
hale3 3,225.817.695
dinnae3 3,225.817.613
gien3 3,225.817.506
buik2 2,150.546.898
hae7 7,526.886.776
siz3 3,225.81nan
ava2 2,150.546.737
it4 4,301.085.971
yersel2 2,150.545.922
haein2 2,150.545.342
hoo2 2,150.544.656
gie3 3,225.814.177
comin2 2,150.544.087
it's3 3,225.813.970
that4 4,301.083.737
hoose3 3,225.813.584
at11 11,827.963.435
best2 2,150.543.362
fowk4 4,301.083.284
ah2 2,150.543.061
keep2 2,150.542.994
things2 2,150.542.981
then3 3,225.812.732
wur2 2,150.542.606
same2 2,150.542.520
o13 13,978.492.240
awa3 3,225.812.206
aa4 4,301.082.200
aye3 3,225.811.574
as3 3,225.811.525
tak2 2,150.541.429
wiz2 2,150.541.390
up6 6,451.611.386
sae3 3,225.811.267
noo3 3,225.811.160
haes2 2,150.541.025
heid2 2,150.540.971
oot6 6,451.610.960
bit4 4,301.080.909
lang2 2,150.540.906
tae16 17,204.300.831
she3 3,225.810.790
be3 3,225.810.698
wi5 5,376.340.595
s9 9,677.420.511
gulders2 2,150.54nan
an27 29,032.260.485
richt2 2,150.540.461
ower3 3,225.810.365
his4 4,301.080.349
said2 2,150.540.347
a30 32,258.060.279
jist2 2,150.540.262
aboot2 2,150.540.231
if2 2,150.540.211
totey2 2,150.54nan
by2 2,150.540.198
see2 2,150.540.159
him2 2,150.540.095
time2 2,150.540.063
da2 2,150.540.041
is6 6,451.610.012
caa'ed3 3,225.81nan
cud2 2,150.543.400