A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

The Westender

Basic Stats

Total words by this author in corpus - 2,026
Total unique words used by this author in corpus - 656
Ratio of total words to unique words - 3.088
Tagged as SEA (South East (Borders)) dialect.
Top ten most common words - the, a, tae, and, o', was, in, that, they, for,

List of texts in corpus

Pipin for the Young Yins
The Hawick Paper (11-01-2008) in Southern dialect (SEA), categorised as newspaper (368 words)

Mairchers did oor toon prood
The Hawick Paper (15-11-2007) in Southern dialect (SEA), categorised as newspaper (513 words)

Loupin Salmon
The Hawick Paper (13-12-2007) in Southern dialect (SEA), categorised as newspaper (571 words)

Ins and oots
The Hawick Paper (06-12-2007) in Southern dialect (SEA), categorised as newspaper (574 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
o'41 20,236.92279.778
was37 18,262.59150.946
eet13 6,416.58109.799
es13 6,416.58105.223
se10 4,935.8394.627
hev11 5,429.4293.173
a'14 6,910.1792.493
folk20 9,871.6790.999
salmon8 3,948.6780.811
band10 4,935.8371.759
o2 987.1768.088
and48 23,692.0053.245
yins9 4,442.2550.830
sei8 3,948.6750.160
an11 5,429.4247.780
fish9 4,442.2547.167
oo9 4,442.2546.494
ee11 5,429.4245.508
slope5 2,467.9242.604
it's15 7,403.7540.966
a'm9 4,442.2540.641
share7 3,455.0840.226
pipe6 2,961.5038.405
curry4 1,974.3337.601
mei6 2,961.5037.209
hawick6 2,961.5036.126
there's7 3,455.0833.376
whae8 3,948.6731.860
hed10 4,935.8331.193
gauns4 1,974.3330.932
hei7 3,455.0830.772
italians3 1,480.7530.736
remembrance3 1,480.7530.736
thum6 2,961.5029.544
thone3 1,480.7527.914
police4 1,974.3327.384
hall5 2,467.9226.664
wasnae4 1,974.3325.942
they've4 1,974.3324.080
they25 12,339.5924.014
other6 2,961.5023.514
were13 6,416.5823.505
practice4 1,974.3322.930
oov8 3,948.67nan
juist11 5,429.4222.859
settlers2 987.1721.156
slitrig2 987.1721.156
loupin'2 987.1721.156
what8 3,948.6719.728
thame3 1,480.7519.725
watchin'2 987.1719.434
conflict2 987.1719.434
soom2 987.1719.434
pipers2 987.1719.434
hednae2 987.1719.434
poppy2 987.1719.434
nowadays2 987.1719.434
shops4 1,974.3319.426
bit18 8,884.5019.208
foreign3 1,480.7519.198
there18 8,884.5019.141
mony8 3,948.6719.068
guid11 5,429.4218.720
how7 3,455.0818.366
high5 2,467.9218.304
raisins2 987.1718.253
bein'2 987.1718.253
still9 4,442.2517.533
owt2 987.1717.349
sunday5 2,467.9217.317
suppose4 1,974.3317.213
hink4 1,974.3316.809
wi'4 1,974.3316.427
con2 987.1715.997
doobt2 987.1715.997
restaurant2 987.1715.997
ca'd2 987.1715.997
a92 45,409.6715.936
borders3 1,480.7515.714
bottom3 1,480.7515.578
popular3 1,480.7515.446
contribute2 987.1714.995
indian2 987.1714.575
gei2 987.1714.575
chip2 987.1714.575
way5 2,467.9214.368
schule3 1,480.7514.268
pin2 987.1714.197
dander2 987.1714.197
lot5 2,467.9213.976
vexed2 987.1713.852
hes4 1,974.3313.528
fair6 2,961.5012.561
local4 1,974.3312.105
his2 987.1712.099
grand3 1,480.7511.958
wars2 987.1711.645
wi4 1,974.3311.631
memorial2 987.1711.289
away4 1,974.3311.242
somebody3 1,480.7511.003
for24 11,846.0010.685
their11 5,429.4210.535
names3 1,480.7510.450
few4 1,974.339.858
life6 2,961.509.590
settled2 987.179.547
huge2 987.179.441
awfi4 1,974.33nan
witter4 1,974.33nan
yaised4 1,974.3314.257
cream2 987.179.238
credit2 987.179.238
course3 1,480.759.199
pupils2 987.179.141
along2 987.178.863
involved2 987.178.863
world3 1,480.758.843
suin3 1,480.758.843
remember2 987.178.775
filled2 987.178.689
they'll2 987.178.605
cos3 1,480.758.475
community3 1,480.758.475
ontae3 1,480.758.266
now3 1,480.758.266
abody2 987.178.212
an'4 1,974.338.022
is4 1,974.337.932
support3 1,480.757.844
fact3 1,480.757.782
div3 1,480.757.752
ana2 987.177.724
seemed3 1,480.757.691
able3 1,480.757.661
rose2 987.177.596
settlin'3 1,480.75nan
duin3 1,480.757.602
young4 1,974.337.565
lucky2 987.177.473
it12 5,923.007.413
club2 987.177.354
toon3 1,480.757.315
hear4 1,974.337.213
twae2 987.177.127
eventually2 987.177.072
every3 1,480.756.841
year6 2,961.506.764
stanes2 987.176.713
cauld4 1,974.336.685
watched2 987.176.617
find3 1,480.756.597
when7 3,455.086.593
years4 1,974.336.463
gaun5 2,467.926.375
where3 1,480.756.344
ana'3 1,480.75nan
river2 987.176.344
service2 987.176.300
yince2 987.176.173
enough3 1,480.756.126
this2 987.176.092
red2 987.175.856
buy2 987.175.856
got6 2,961.505.733
ice2 987.175.465
different3 1,480.755.351
that31 15,301.095.271
aulder2 987.175.239
mind5 2,467.925.127
war5 2,467.925.008
laddie2 987.174.858
under2 987.174.803
park2 987.174.644
did5 2,467.924.635
tae58 28,627.844.599
masel3 1,480.754.526
stood2 987.174.445
some7 3,455.084.411
make2 987.174.350
paper2 987.174.236
important2 987.174.062
late2 987.173.979
nane2 987.173.839
right3 1,480.753.782
heard3 1,480.753.724
if7 3,455.083.633
thumsels2 987.17nan
the138 68,114.513.450
ma4 1,974.333.429
seen4 1,974.333.357
aboot11 5,429.423.344
nae2 987.173.298
no10 4,935.833.267
yin4 1,974.333.260
day8 3,948.673.214
chinese2 987.173.199
less2 987.173.169
story2 987.173.154
him2 987.172.798
want3 1,480.752.730
new4 1,974.332.725
oan6 2,961.502.702
fund2 987.172.651
went3 1,480.752.586
wunder2 987.17nan
oor6 2,961.502.548
sic3 1,480.752.487
or3 1,480.752.448
could4 1,974.332.367
left3 1,480.752.363
hard2 987.172.322
afore5 2,467.922.247
same3 1,480.752.218
thing3 1,480.752.204
bide2 987.172.150
took2 987.172.027
wull2 987.171.878
git2 987.171.861
much2 987.171.853
came2 987.171.771
oot5 2,467.921.721
though2 987.171.708
end2 987.171.519
wee2 987.171.478
said5 2,467.921.471
feel2 987.171.374
of5 2,467.921.354
be14 6,910.171.278
up11 5,429.421.211
pairt2 987.171.197
whae've2 987.17nan
time6 2,961.501.294
likeet2 987.17nan
which3 1,480.753.289
even3 1,480.751.148
here4 1,974.331.140
side2 987.171.137
wad4 1,974.331.118
didnae2 987.170.996
need2 987.170.981
on16 7,897.330.939
yer2 987.170.875
intae2 987.170.735
muckle3 1,480.750.724
back6 2,961.500.721
like7 3,455.080.461
been5 2,467.920.391
in35 17,275.420.366
telt2 987.170.357
then3 1,480.750.337
made2 987.170.190
mair5 2,467.920.168
are2 987.170.147
maist2 987.170.128
dae2 987.170.075
hame2 987.170.075
ower5 2,467.920.049
language2 987.170.040
doon4 1,974.330.034
efter2 987.170.033
so2 987.170.012
ain2 987.170.007
aye3 1,480.750.007
frae4 1,974.330.005
at13 6,416.580.000
standin'2 987.17nan
weel2 987.170.050