A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

The Westender

Basic Stats

Total words by this author in corpus - 2,026
Total unique words used by this author in corpus - 656
Ratio of total words to unique words - 3.088
Tagged as SEA (South East (Borders)) dialect.
Top ten most common words - the, a, tae, and, o', was, in, that, they, for,

List of texts in corpus

Pipin for the Young Yins
The Hawick Paper (11-01-2008) in Southern dialect (SEA), categorised as newspaper (368 words)

Mairchers did oor toon prood
The Hawick Paper (15-11-2007) in Southern dialect (SEA), categorised as newspaper (513 words)

Loupin Salmon
The Hawick Paper (13-12-2007) in Southern dialect (SEA), categorised as newspaper (571 words)

Ins and oots
The Hawick Paper (06-12-2007) in Southern dialect (SEA), categorised as newspaper (574 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
o'41 20,236.92279.909
was37 18,262.59151.060
eet13 6,416.58109.841
es13 6,416.58105.265
se10 4,935.8394.659
hev11 5,429.4293.208
a'14 6,910.1792.538
folk20 9,871.6791.062
salmon8 3,948.6780.837
band10 4,935.8371.791
o2 987.1767.956
and48 23,692.0053.348
yins9 4,442.2550.859
sei8 3,948.6750.185
an11 5,429.4247.711
fish9 4,442.2547.196
oo9 4,442.2546.522
ee11 5,429.4245.542
slope5 2,467.9242.620
it's15 7,403.7541.010
a'm9 4,442.2540.669
share7 3,455.0840.249
pipe6 2,961.5038.424
curry4 1,974.3337.614
mei6 2,961.5037.229
hawick6 2,961.5036.145
there's7 3,455.0833.398
whae8 3,948.6731.884
hed10 4,935.8331.223
gauns4 1,974.3330.945
hei7 3,455.0830.794
italians3 1,480.7530.746
remembrance3 1,480.7530.746
thum6 2,961.5029.563
thone3 1,480.7527.923
police4 1,974.3327.397
hall5 2,467.9226.537
wasnae4 1,974.3325.955
they've4 1,974.3324.093
they25 12,339.5923.898
other6 2,961.5023.532
were13 6,416.5823.521
practice4 1,974.3322.942
oov8 3,948.67nan
juist11 5,429.4222.889
slitrig2 987.1721.163
settlers2 987.1721.163
loupin'2 987.1721.163
what8 3,948.6719.751
thame3 1,480.7519.734
poppy2 987.1719.441
nowadays2 987.1719.441
pipers2 987.1719.441
hednae2 987.1719.441
watchin'2 987.1719.441
soom2 987.1719.441
conflict2 987.1719.441
shops4 1,974.3319.438
bit18 8,884.5019.240
foreign3 1,480.7519.208
there18 8,884.5019.125
mony8 3,948.6719.091
guid11 5,429.4218.749
raisins2 987.1718.260
bein'2 987.1718.260
how7 3,455.0818.231
high5 2,467.9218.203
still9 4,442.2517.557
owt2 987.1717.355
sunday5 2,467.9217.332
suppose4 1,974.3317.226
hink4 1,974.3316.514
wi'4 1,974.3316.439
con2 987.1716.004
ca'd2 987.1716.004
restaurant2 987.1716.004
doobt2 987.1716.004
a92 45,409.6715.942
borders3 1,480.7515.723
bottom3 1,480.7515.588
popular3 1,480.7515.455
contribute2 987.1715.001
indian2 987.1714.582
chip2 987.1714.582
gei2 987.1714.582
witter4 1,974.33nan
way5 2,467.9214.383
schule3 1,480.7514.277
yaised4 1,974.3314.269
dander2 987.1714.203
pin2 987.1714.203
lot5 2,467.9213.991
vexed2 987.1713.858
hes4 1,974.3313.540
fair6 2,961.5012.578
his2 987.1712.120
local4 1,974.3312.117
grand3 1,480.7511.967
wars2 987.1711.652
wi4 1,974.3311.621
memorial2 987.1711.295
away4 1,974.3311.254
somebody3 1,480.7511.012
for24 11,846.0010.726
their11 5,429.4210.534
names3 1,480.7510.407
few4 1,974.339.869
settled2 987.179.553
huge2 987.179.344
awfi4 1,974.33nan
life6 2,961.509.606
credit2 987.179.244
course3 1,480.759.208
cream2 987.179.147
pupils2 987.179.147
along2 987.178.869
involved2 987.178.869
suin3 1,480.758.852
world3 1,480.758.814
remember2 987.178.781
filled2 987.178.695
they'll2 987.178.611
cos3 1,480.758.483
community3 1,480.758.483
now3 1,480.758.275
ontae3 1,480.758.275
abody2 987.178.218
an'4 1,974.338.033
is4 1,974.337.915
support3 1,480.757.853
fact3 1,480.757.791
div3 1,480.757.760
ana2 987.177.730
seemed3 1,480.757.700
settlin'3 1,480.75nan
able3 1,480.757.670
duin3 1,480.757.610
rose2 987.177.603
young4 1,974.337.575
lucky2 987.177.479
it12 5,923.007.430
club2 987.177.360
toon3 1,480.757.269
hear4 1,974.337.224
twae2 987.177.133
eventually2 987.177.078
every3 1,480.756.824
year6 2,961.506.778
stanes2 987.176.719
cauld4 1,974.336.696
find3 1,480.756.606
when7 3,455.086.592
watched2 987.176.576
years4 1,974.336.473
river2 987.176.350
service2 987.176.306
where3 1,480.756.286
yince2 987.176.179
ana'3 1,480.75nan
gaun5 2,467.926.388
this2 987.176.085
enough3 1,480.756.050
buy2 987.175.862
red2 987.175.862
got6 2,961.505.747
ice2 987.175.471
different3 1,480.755.359
that31 15,301.095.256
aulder2 987.175.245
mind5 2,467.925.129
war5 2,467.925.020
laddie2 987.174.863
under2 987.174.809
park2 987.174.650
did5 2,467.924.630
tae58 28,627.844.604
masel3 1,480.754.534
stood2 987.174.451
some7 3,455.084.413
make2 987.174.287
paper2 987.174.110
important2 987.174.068
late2 987.173.984
heard3 1,480.753.732
right3 1,480.753.732
if7 3,455.083.636
the138 68,114.513.440
ma4 1,974.333.424
seen4 1,974.333.351
thumsels2 987.17nan
nane2 987.173.844
aboot11 5,429.423.344
nae2 987.173.298
which3 1,480.753.296
yin4 1,974.333.262
no10 4,935.833.248
day8 3,948.673.219
chinese2 987.173.205
story2 987.173.129
him2 987.172.801
new4 1,974.332.733
want3 1,480.752.720
fund2 987.172.656
oan6 2,961.502.630
went3 1,480.752.570
wunder2 987.17nan
likeet2 987.17nan
less2 987.173.174
oor6 2,961.502.554
sic3 1,480.752.494
or3 1,480.752.438
could4 1,974.332.353
left3 1,480.752.348
hard2 987.172.327
afore5 2,467.922.251
same3 1,480.752.224
thing3 1,480.752.211
bide2 987.172.154
took2 987.172.023
wull2 987.171.882
much2 987.171.849
git2 987.171.849
came2 987.171.775
oot5 2,467.921.726
though2 987.171.713
end2 987.171.523
whae've2 987.17nan
wee2 987.171.481
said5 2,467.921.425
feel2 987.171.372
of5 2,467.921.361
time6 2,961.501.302
be14 6,910.171.275
up11 5,429.421.205
pairt2 987.171.200
even3 1,480.751.153
here4 1,974.331.143
side2 987.171.141
wad4 1,974.331.124
didnae2 987.170.994
need2 987.170.985
on16 7,897.330.950
yer2 987.170.876
intae2 987.170.739
muckle3 1,480.750.728
back6 2,961.500.713
like7 3,455.080.463
been5 2,467.920.391
in35 17,275.420.371
telt2 987.170.357
then3 1,480.750.333
made2 987.170.192
mair5 2,467.920.170
are2 987.170.145
maist2 987.170.129
dae2 987.170.078
hame2 987.170.075
ower5 2,467.920.051
language2 987.170.041
doon4 1,974.330.034
efter2 987.170.033
so2 987.170.012
ain2 987.170.007
aye3 1,480.750.006
frae4 1,974.330.006
at13 6,416.580.000
standin'2 987.17nan
weel2 987.170.050