A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

White, George

Basic Stats

Total words by this author in corpus - 974
Total unique words used by this author in corpus - 439
Ratio of total words to unique words - 2.219
Tagged as LAL (General Central) dialect.
Top ten most common words - a, an, the, wis, tae, he, it, o, s, wi,

List of texts in corpus

A visited him this efternuin
Facebook (2024-07-27) in Central dialect (LAL), categorised as poetry (172 words)

Uncle Wull
Facebook (2024-06-30) in Central dialect (LAL), categorised as poetry (206 words)

When you're an auld man
Facebook (2024-07-28) in Central dialect (LAL), categorised as poetry (174 words)

But the heart is gallus
Facebook (2024-08-04) in Central dialect (LAL), categorised as prose (123 words)

And sae it gaes oan
Facebook (2024-08-05) in Central dialect (LAL), categorised as poetry (88 words)

Juist a wee while back
Facebook (2024-08-11) in Central dialect (LAL), categorised as poetry (211 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
peenie4 4,106.7839.894
thay7 7,186.8630.206
wan8 8,213.5528.584
the24 24,640.6624.858
hiv6 6,160.1621.983
thae6 6,160.1620.813
bingo2 2,053.3918.973
ach3 3,080.0817.559
laucht2 2,053.3917.544
nou5 5,133.4716.857
a52 53,388.0915.979
aw10 10,266.9414.103
nivir2 2,053.3914.063
nouadays2 2,053.3913.901
juist6 6,160.1613.641
ayeweys2 2,053.3912.926
gallus2 2,053.3912.687
life5 5,133.4712.681
lost3 3,080.0812.648
wans2 2,053.3912.355
eatin2 2,053.3911.764
freends2 2,053.3911.764
heart2 2,053.3911.674
coffee2 2,053.3911.174
hearin2 2,053.3911.021
turned3 3,080.0810.561
loss2 2,053.3910.399
daen2 2,053.3910.273
weel6 6,160.1610.198
something3 3,080.0810.123
things4 4,106.7810.047
awa6 6,160.169.928
faur3 3,080.089.386
thair5 5,133.478.856
leukin2 2,053.398.235
aye6 6,160.168.137
ma11 11,293.638.093
young3 3,080.087.958
so5 5,133.477.902
micht4 4,106.787.189
ve4 4,106.787.147
but10 10,266.947.050
folk3 3,080.087.043
wis19 19,507.196.877
he17 17,453.806.608
too2 2,053.396.249
auld5 5,133.476.204
oan5 5,133.476.050
less2 2,053.395.626
kirk2 2,053.395.294
man4 4,106.785.233
him7 7,186.865.140
siller2 2,053.395.133
thare2 2,053.395.062
wee6 6,160.165.049
when4 4,106.785.038
sae5 5,133.474.930
whit6 6,160.164.893
fine3 3,080.084.891
nae7 7,186.864.531
anither3 3,080.084.469
naw2 2,053.394.310
his11 11,293.634.124
niver2 2,053.393.848
did3 3,080.083.775
s13 13,347.023.770
cannae2 2,053.393.660
some4 4,106.783.419
say3 3,080.083.294
for10 10,266.943.037
in9 9,240.253.008
while2 2,053.392.978
be9 9,240.252.769
wi12 12,320.332.575
mony2 2,053.392.564
days2 2,053.392.558
place2 2,053.392.387
this7 7,186.862.283
as10 10,266.942.274
whan3 3,080.082.268
thing2 2,053.392.252
wir2 2,053.392.242
ither3 3,080.082.203
o14 14,373.722.135
been4 4,106.782.073
afore3 3,080.082.047
are3 3,080.081.882
an32 32,854.211.806
richt3 3,080.081.783
bein2 2,053.391.754
it16 16,427.101.640
back4 4,106.781.593
mind2 2,053.391.569
still2 2,053.391.533
cam2 2,053.391.403
mak2 2,053.391.373
is9 9,240.251.302
ower4 4,106.781.234
come2 2,053.391.223
thocht2 2,053.391.165
ye3 3,080.081.137
hoose2 2,053.391.132
pit2 2,053.391.105
and9 9,240.250.929
then2 2,053.390.730
wad2 2,053.390.655
m2 2,053.390.480
tae18 18,480.490.440
ken2 2,053.390.402
aff2 2,053.390.344
oot3 3,080.080.320
at5 5,133.470.271
we5 5,133.470.270
that11 11,293.630.191
fae2 2,053.390.178
no2 2,053.390.117
see2 2,053.390.116
bit2 2,053.390.107
like3 3,080.080.083
up4 4,106.780.016
or3 3,080.080.015
doon2 2,053.390.005
me3 3,080.080.003