A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

The Westender

Basic Stats

Total words by this author in corpus - 2,026
Total unique words used by this author in corpus - 656
Ratio of total words to unique words - 3.088
Tagged as SEA (South East (Borders)) dialect.
Top ten most common words - the, a, tae, and, o', was, in, that, they, for,

List of texts in corpus

Pipin for the Young Yins
The Hawick Paper (11-01-2008) in Southern dialect (SEA), categorised as newspaper (368 words)

Mairchers did oor toon prood
The Hawick Paper (15-11-2007) in Southern dialect (SEA), categorised as newspaper (513 words)

Loupin Salmon
The Hawick Paper (13-12-2007) in Southern dialect (SEA), categorised as newspaper (571 words)

Ins and oots
The Hawick Paper (06-12-2007) in Southern dialect (SEA), categorised as newspaper (574 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
o'41 20,236.92277.595
was37 18,262.59148.128
eet13 6,416.58109.786
es13 6,416.58104.619
se10 4,935.8394.617
hev11 5,429.4293.162
a'14 6,910.1791.384
folk20 9,871.6791.266
salmon8 3,948.6780.804
band10 4,935.8371.749
o2 987.1768.275
and48 23,692.0053.774
yins9 4,442.2550.821
sei8 3,948.6750.152
an11 5,429.4248.598
fish9 4,442.2547.297
oo9 4,442.2546.354
ee11 5,429.4245.347
slope5 2,467.9241.881
it's15 7,403.7540.920
share7 3,455.0840.768
a'm9 4,442.2540.633
pipe6 2,961.5038.399
curry4 1,974.3337.597
mei6 2,961.5037.204
hawick6 2,961.5036.120
there's7 3,455.0833.369
whae8 3,948.6731.852
hed10 4,935.8331.184
gauns4 1,974.3330.928
hei7 3,455.0830.765
italians3 1,480.7530.733
remembrance3 1,480.7530.733
thum6 2,961.5029.538
thone3 1,480.7527.911
police4 1,974.3327.380
hall5 2,467.9226.952
wasnae4 1,974.3326.206
they've4 1,974.3324.076
they25 12,339.5923.825
other6 2,961.5023.712
were13 6,416.5823.476
practice4 1,974.3322.750
juist11 5,429.4222.526
slitrig2 987.1721.154
settlers2 987.1721.154
thame3 1,480.7519.722
oov8 3,948.67nan
loupin'2 987.1721.154
what8 3,948.6719.637
shops4 1,974.3319.535
hednae2 987.1719.432
nowadays2 987.1719.432
soom2 987.1719.432
poppy2 987.1719.432
pipers2 987.1719.432
watchin'2 987.1719.432
conflict2 987.1719.432
there18 8,884.5019.263
foreign3 1,480.7519.195
bit18 8,884.5019.176
guid11 5,429.4218.912
mony8 3,948.6718.796
hink4 1,974.3318.575
how7 3,455.0818.297
raisins2 987.1718.251
bein'2 987.1718.251
high5 2,467.9218.070
still9 4,442.2517.486
sunday5 2,467.9217.469
owt2 987.1717.347
suppose4 1,974.3317.293
wi'4 1,974.3316.349
a92 45,409.6716.114
con2 987.1715.996
doobt2 987.1715.996
ca'd2 987.1715.996
restaurant2 987.1715.996
borders3 1,480.7515.850
bottom3 1,480.7515.575
popular3 1,480.7515.443
contribute2 987.1714.993
way5 2,467.9214.699
gei2 987.1714.574
indian2 987.1714.574
chip2 987.1714.574
schule3 1,480.7514.265
pin2 987.1714.195
witter4 1,974.33nan
lot5 2,467.9214.219
dander2 987.1714.195
yaised4 1,974.3314.037
vexed2 987.1713.850
hes4 1,974.3313.524
fair6 2,961.5012.578
his2 987.1712.258
local4 1,974.3312.182
grand3 1,480.7511.886
wars2 987.1711.835
wi4 1,974.3311.789
away4 1,974.3311.600
memorial2 987.1711.287
somebody3 1,480.7511.000
names3 1,480.7510.500
for24 11,846.0010.457
their11 5,429.4210.442
few4 1,974.3310.085
life6 2,961.509.632
settled2 987.179.545
huge2 987.179.336
credit2 987.179.336
cream2 987.179.236
pupils2 987.179.139
remember2 987.179.044
world3 1,480.758.995
awfi4 1,974.33nan
course3 1,480.759.115
involved2 987.178.951
along2 987.178.861
suin3 1,480.758.840
filled2 987.178.773
they'll2 987.178.603
community3 1,480.758.543
cos3 1,480.758.508
now3 1,480.758.298
ontae3 1,480.758.264
abody2 987.178.210
is4 1,974.337.935
an'4 1,974.337.916
support3 1,480.757.810
able3 1,480.757.780
fact3 1,480.757.749
duin3 1,480.757.658
seemed3 1,480.757.570
young4 1,974.337.561
rose2 987.177.532
settlin'3 1,480.75nan
div3 1,480.757.688
lucky2 987.177.532
club2 987.177.471
ana2 987.177.411
toon3 1,480.757.397
it12 5,923.007.242
eventually2 987.177.237
twae2 987.177.125
hear4 1,974.337.051
every3 1,480.757.017
when7 3,455.086.982
year6 2,961.506.781
cauld4 1,974.336.747
stanes2 987.176.711
find3 1,480.756.618
watched2 987.176.522
gaun5 2,467.926.490
years4 1,974.336.475
enough3 1,480.756.409
river2 987.176.342
where3 1,480.756.297
yince2 987.176.172
ana'3 1,480.75nan
service2 987.176.299
this2 987.176.077
buy2 987.175.970
red2 987.175.855
got6 2,961.505.798
ice2 987.175.566
different3 1,480.755.349
that31 15,301.095.271
mind5 2,467.925.151
aulder2 987.175.115
under2 987.175.055
war5 2,467.924.969
laddie2 987.174.856
masel3 1,480.754.759
park2 987.174.695
did5 2,467.924.682
tae58 28,627.844.497
make2 987.174.468
some7 3,455.084.407
stood2 987.174.396
paper2 987.174.103
important2 987.174.082
late2 987.173.936
nane2 987.173.916
if7 3,455.083.785
thumsels2 987.17nan
heard3 1,480.753.757
right3 1,480.753.745
the138 68,114.513.365
no10 4,935.833.346
seen4 1,974.333.325
nae2 987.173.324
day8 3,948.673.291
aboot11 5,429.423.278
which3 1,480.753.267
yin4 1,974.333.243
chinese2 987.173.182
less2 987.173.137
story2 987.173.137
ma4 1,974.332.979
oan6 2,961.502.870
likeet2 987.17nan
him2 987.172.830
want3 1,480.752.753
new4 1,974.332.747
wunder2 987.17nan
went3 1,480.752.592
fund2 987.172.578
or3 1,480.752.537
oor6 2,961.502.520
sic3 1,480.752.471
hard2 987.172.363
left3 1,480.752.354
could4 1,974.332.354
same3 1,480.752.216
thing3 1,480.752.189
afore5 2,467.922.149
bide2 987.172.148
took2 987.171.963
wull2 987.171.860
much2 987.171.851
git2 987.171.835
though2 987.171.746
came2 987.171.730
oot5 2,467.921.713
end2 987.171.477
whae've2 987.17nan
wee2 987.171.455
feel2 987.171.450
said5 2,467.921.447
of5 2,467.921.437
time6 2,961.501.335
be14 6,910.171.269
pairt2 987.171.212
up11 5,429.421.177
even3 1,480.751.166
here4 1,974.331.132
side2 987.171.130
wad4 1,974.331.126
didnae2 987.171.083
need2 987.171.014
on16 7,897.330.903
yer2 987.170.824
back6 2,961.500.717
muckle3 1,480.750.704
intae2 987.170.694
like7 3,455.080.499
been5 2,467.920.403
in35 17,275.420.362
telt2 987.170.354
then3 1,480.750.345
made2 987.170.199
mair5 2,467.920.153
maist2 987.170.124
are2 987.170.118
hame2 987.170.077
dae2 987.170.069
weel2 987.170.051
ower5 2,467.920.044
language2 987.170.040
doon4 1,974.330.032
efter2 987.170.028
standin'2 987.17nan
so2 987.170.012
ain2 987.170.007
frae4 1,974.330.003
aye3 1,480.750.000
at13 6,416.580.000