A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

White, George

Basic Stats

Total words by this author in corpus - 974
Total unique words used by this author in corpus - 439
Ratio of total words to unique words - 2.219
Tagged as LAL (General Central) dialect.
Top ten most common words - a, an, the, wis, tae, he, it, o, s, wi,

List of texts in corpus

A visited him this efternuin
Facebook (2024-07-27) in Central dialect (LAL), categorised as poetry (172 words)

Uncle Wull
Facebook (2024-06-30) in Central dialect (LAL), categorised as poetry (206 words)

When you're an auld man
Facebook (2024-07-28) in Central dialect (LAL), categorised as poetry (174 words)

But the heart is gallus
Facebook (2024-08-04) in Central dialect (LAL), categorised as prose (123 words)

And sae it gaes oan
Facebook (2024-08-05) in Central dialect (LAL), categorised as poetry (88 words)

Juist a wee while back
Facebook (2024-08-11) in Central dialect (LAL), categorised as poetry (211 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
peenie4 4,106.7839.779
thay7 7,186.8630.013
wan8 8,213.5528.711
the24 24,640.6625.447
hiv6 6,160.1621.821
thae6 6,160.1620.701
bingo2 2,053.3918.915
ach3 3,080.0817.565
laucht2 2,053.3917.487
nou5 5,133.4716.817
a52 53,388.0916.220
nivir2 2,053.3914.006
aw10 10,266.9413.924
nouadays2 2,053.3913.844
juist6 6,160.1613.562
ayeweys2 2,053.3912.995
life5 5,133.4712.681
gallus2 2,053.3912.631
lost3 3,080.0812.603
wans2 2,053.3912.298
eatin2 2,053.3911.800
freends2 2,053.3911.708
heart2 2,053.3911.708
coffee2 2,053.3911.277
hearin2 2,053.3910.964
daen2 2,053.3910.746
loss2 2,053.3910.607
turned3 3,080.0810.557
weel6 6,160.1610.402
things4 4,106.7810.227
something3 3,080.0810.090
awa6 6,160.169.903
faur3 3,080.089.307
thair5 5,133.479.042
leukin2 2,053.398.181
aye6 6,160.168.119
ma11 11,293.638.026
young3 3,080.087.973
so5 5,133.477.824
micht4 4,106.787.154
ve4 4,106.787.095
folk3 3,080.087.093
but10 10,266.947.055
wis19 19,507.196.669
he17 17,453.806.601
too2 2,053.396.319
auld5 5,133.476.135
oan5 5,133.475.965
less2 2,053.395.642
kirk2 2,053.395.288
man4 4,106.785.244
thare2 2,053.395.125
him7 7,186.865.115
wee6 6,160.165.098
siller2 2,053.395.082
when4 4,106.784.985
sae5 5,133.474.931
fine3 3,080.084.836
whit6 6,160.164.819
nae7 7,186.864.519
anither3 3,080.084.475
naw2 2,053.394.327
his11 11,293.634.118
niver2 2,053.393.894
s13 13,347.023.772
did3 3,080.083.760
cannae2 2,053.393.718
some4 4,106.783.390
say3 3,080.083.351
while2 2,053.393.014
in9 9,240.252.998
for10 10,266.942.948
be9 9,240.252.750
days2 2,053.392.582
wi12 12,320.332.549
mony2 2,053.392.543
place2 2,053.392.444
thing2 2,053.392.355
this7 7,186.862.311
as10 10,266.942.269
whan3 3,080.082.265
ither3 3,080.082.229
wir2 2,053.392.206
o14 14,373.722.106
afore3 3,080.082.048
been4 4,106.782.032
an32 32,854.211.877
are3 3,080.081.863
bein2 2,053.391.761
richt3 3,080.081.756
it16 16,427.101.639
mind2 2,053.391.559
back4 4,106.781.550
still2 2,053.391.549
mak2 2,053.391.376
cam2 2,053.391.376
is9 9,240.251.299
ower4 4,106.781.213
come2 2,053.391.205
thocht2 2,053.391.165
hoose2 2,053.391.132
ye3 3,080.081.111
pit2 2,053.391.109
and9 9,240.250.889
then2 2,053.390.759
wad2 2,053.390.633
m2 2,053.390.489
tae18 18,480.490.433
ken2 2,053.390.396
aff2 2,053.390.343
oot3 3,080.080.323
we5 5,133.470.299
at5 5,133.470.281
that11 11,293.630.196
fae2 2,053.390.188
no2 2,053.390.124
see2 2,053.390.115
bit2 2,053.390.110
like3 3,080.080.077
up4 4,106.780.017
or3 3,080.080.013
doon2 2,053.390.005
me3 3,080.080.003