A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

The Westender

Basic Stats

Total words by this author in corpus - 2,026
Total unique words used by this author in corpus - 656
Ratio of total words to unique words - 3.088
Tagged as SEA (South East (Borders)) dialect.
Top ten most common words - the, a, tae, and, o', was, in, that, they, for,

List of texts in corpus

Pipin for the Young Yins
The Hawick Paper (11-01-2008) in Southern dialect (SEA), categorised as newspaper (368 words)

Mairchers did oor toon prood
The Hawick Paper (15-11-2007) in Southern dialect (SEA), categorised as newspaper (513 words)

Loupin Salmon
The Hawick Paper (13-12-2007) in Southern dialect (SEA), categorised as newspaper (571 words)

Ins and oots
The Hawick Paper (06-12-2007) in Southern dialect (SEA), categorised as newspaper (574 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
o'41 20,236.92278.761
was37 18,262.59146.936
eet13 6,416.58110.159
es13 6,416.58104.991
se10 4,935.8394.904
hev11 5,429.4293.478
a'14 6,910.1791.782
folk20 9,871.6790.877
salmon8 3,948.6781.034
band10 4,935.8371.663
o2 987.1768.464
and48 23,692.0054.315
sei8 3,948.6750.379
yins9 4,442.2549.608
an11 5,429.4249.014
fish9 4,442.2547.411
oo9 4,442.2546.606
ee11 5,429.4245.649
slope5 2,467.9242.024
it's15 7,403.7541.307
a'm9 4,442.2540.882
share7 3,455.0840.781
pipe6 2,961.5038.320
curry4 1,974.3337.712
mei6 2,961.5037.374
hawick6 2,961.5036.290
there's7 3,455.0833.247
whae8 3,948.6732.071
hed10 4,935.8331.448
gauns4 1,974.3331.042
hei7 3,455.0830.958
italians3 1,480.7530.820
thum6 2,961.5029.705
remembrance3 1,480.7529.236
thone3 1,480.7527.997
police4 1,974.3327.494
hall5 2,467.9227.093
wasnae4 1,974.3326.052
they've4 1,974.3324.190
other6 2,961.5023.807
they25 12,339.5923.727
were13 6,416.5823.376
practice4 1,974.3322.691
slitrig2 987.1721.212
loupin'2 987.1721.212
oov8 3,948.67nan
juist11 5,429.4222.667
settlers2 987.1721.212
thame3 1,480.7519.807
what8 3,948.6719.727
shops4 1,974.3319.647
pipers2 987.1719.490
nowadays2 987.1719.490
poppy2 987.1719.490
soom2 987.1719.490
watchin'2 987.1719.490
conflict2 987.1719.490
hednae2 987.1719.490
foreign3 1,480.7519.280
bit18 8,884.5019.250
there18 8,884.5019.231
mony8 3,948.6718.893
guid11 5,429.4218.689
hink4 1,974.3318.686
how7 3,455.0818.477
bein'2 987.1718.309
high5 2,467.9218.092
sunday5 2,467.9217.551
owt2 987.1717.404
raisins2 987.1717.404
still9 4,442.2517.389
suppose4 1,974.3317.237
wi'4 1,974.3316.458
ca'd2 987.1716.053
con2 987.1716.053
doobt2 987.1716.053
borders3 1,480.7515.934
a92 45,409.6715.778
bottom3 1,480.7515.660
restaurant2 987.1715.519
popular3 1,480.7515.397
contribute2 987.1715.050
indian2 987.1714.631
gei2 987.1714.631
way5 2,467.9214.494
dander2 987.1714.252
pin2 987.1714.252
witter4 1,974.33nan
chip2 987.1714.252
yaised4 1,974.3314.145
schule3 1,480.7514.140
lot5 2,467.9214.137
hes4 1,974.3313.631
vexed2 987.1713.590
fair6 2,961.5012.637
his2 987.1712.246
local4 1,974.3311.970
grand3 1,480.7511.968
wars2 987.1711.891
wi4 1,974.3311.726
away4 1,974.3311.704
somebody3 1,480.7511.081
for24 11,846.0010.706
their11 5,429.4210.507
memorial2 987.1710.307
names3 1,480.7510.272
few4 1,974.3310.070
life6 2,961.509.632
settled2 987.179.600
huge2 987.179.392
awfi4 1,974.33nan
credit2 987.179.392
cream2 987.179.292
pupils2 987.179.194
course3 1,480.759.194
remember2 987.179.099
involved2 987.179.007
world3 1,480.758.996
suin3 1,480.758.919
along2 987.178.916
filled2 987.178.742
they'll2 987.178.658
cos3 1,480.758.586
community3 1,480.758.480
ontae3 1,480.758.308
now3 1,480.758.274
abody2 987.178.264
an'4 1,974.338.013
is4 1,974.337.924
support3 1,480.757.856
able3 1,480.757.826
div3 1,480.757.765
fact3 1,480.757.765
duin3 1,480.757.705
settlin'3 1,480.75nan
seemed3 1,480.757.588
young4 1,974.337.544
club2 987.177.465
ana2 987.177.465
rose2 987.177.406
lucky2 987.177.406
it12 5,923.007.239
eventually2 987.177.234
twae2 987.177.179
toon3 1,480.757.145
every3 1,480.757.092
when7 3,455.087.069
hear4 1,974.336.974
year6 2,961.506.835
cauld4 1,974.336.824
find3 1,480.756.693
stanes2 987.176.668
gaun5 2,467.926.575
watched2 987.176.529
years4 1,974.336.444
enough3 1,480.756.438
ana'3 1,480.75nan
river2 987.176.395
where3 1,480.756.371
service2 987.176.309
this2 987.176.127
yince2 987.176.102
buy2 987.175.945
red2 987.175.832
got6 2,961.505.670
ice2 987.175.617
different3 1,480.755.332
that31 15,301.095.228
mind5 2,467.925.179
aulder2 987.175.166
under2 987.175.077
war5 2,467.924.954
laddie2 987.174.906
masel3 1,480.754.783
park2 987.174.719
did5 2,467.924.705
tae58 28,627.844.464
some7 3,455.084.452
stood2 987.174.398
make2 987.174.398
important2 987.174.109
paper2 987.174.067
late2 987.173.925
nane2 987.173.905
heard3 1,480.753.798
right3 1,480.753.798
if7 3,455.083.763
the138 68,114.513.733
no10 4,935.833.413
seen4 1,974.333.378
day8 3,948.673.328
which3 1,480.753.320
nae2 987.173.314
aboot11 5,429.423.278
chinese2 987.173.228
less2 987.173.124
story2 987.173.080
oan6 2,961.502.946
ma4 1,974.332.936
likeet2 987.17nan
yin4 1,974.332.922
new4 1,974.332.794
want3 1,480.752.771
fund2 987.172.621
wunder2 987.17nan
or3 1,480.752.578
went3 1,480.752.535
sic3 1,480.752.483
could4 1,974.332.410
oor6 2,961.502.394
left3 1,480.752.368
hard2 987.172.363
same3 1,480.752.244
bide2 987.172.180
afore5 2,467.922.149
thing3 1,480.752.057
took2 987.172.003
much2 987.171.833
git2 987.171.817
though2 987.171.769
came2 987.171.753
oot5 2,467.921.703
wull2 987.171.692
end2 987.171.500
wee2 987.171.483
of5 2,467.921.473
said5 2,467.921.461
time6 2,961.501.345
be14 6,910.171.287
whae've2 987.17nan
feel2 987.171.467
pairt2 987.171.224
even3 1,480.751.182
up11 5,429.421.169
wad4 1,974.331.168
here4 1,974.331.128
side2 987.171.116
didnae2 987.171.056
need2 987.171.036
on16 7,897.330.881
yer2 987.170.866
back6 2,961.500.757
muckle3 1,480.750.734
intae2 987.170.681
like7 3,455.080.522
been5 2,467.920.427
telt2 987.170.369
in35 17,275.420.357
then3 1,480.750.318
made2 987.170.197
mair5 2,467.920.147
maist2 987.170.123
are2 987.170.112
thumsels2 987.17nan
him2 987.172.810
hame2 987.170.072
dae2 987.170.066
weel2 987.170.066
ower5 2,467.920.050
language2 987.170.048
doon4 1,974.330.034
efter2 987.170.031
standin'2 987.17nan
so2 987.170.009
ain2 987.170.007
frae4 1,974.330.002
aye3 1,480.750.000
at13 6,416.580.000