A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Forrest, Liam

Basic Stats

Total words by this author in corpus - 1,175
Total unique words used by this author in corpus - 462
Ratio of total words to unique words - 2.543
Tagged as SEC ((South) East Central) dialect.
Top ten most common words - the, eh, a, they, wurr, it, tae, in, he, like,

List of texts in corpus

Hittin the Toon
Scots Hoose (2020) in Central (Dunfermline) dialect (SEC), categorised as prose (1,175 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
eh37 31,489.36279.338
wiz19 16,170.2177.962
awbdy6 5,106.3872.478
n12 10,212.7757.532
wurr25 21,276.60nan
pals7 5,957.4548.488
they26 22,127.6648.204
looked9 7,659.5746.515
like21 17,872.3445.267
queue5 4,255.3241.664
wit6 5,106.3837.987
hud9 7,659.5736.294
group7 5,957.4535.585
their15 12,765.9634.628
club5 4,255.3233.089
lassies5 4,255.3231.443
gon3 2,553.1931.176
dresses3 2,553.1931.176
um5 4,255.3229.571
nuhin3 2,553.1929.284
er5 4,255.3224.637
bunch3 2,553.1924.541
av3 2,553.1924.541
who6 5,106.3824.349
boys4 3,404.2623.128
spice2 1,702.1321.609
dum2 1,702.1320.427
pal4 3,404.2619.693
surprising2 1,702.1319.521
tanned2 1,702.1319.521
guards2 1,702.1318.786
id2 1,702.1318.166
gonny2 1,702.1317.631
got7 5,957.4514.403
stawndin6 5,106.38nan
skirts2 1,702.1317.631
to8 6,808.5114.396
way4 3,404.2614.119
hair4 3,404.2613.784
blawn2 1,702.1313.102
shop3 2,553.1912.934
security2 1,702.1312.799
nixt3 2,553.1912.084
wan5 4,255.3211.934
thurr5 4,255.32nan
front4 3,404.2613.965
obviously2 1,702.1311.892
aroond3 2,553.1911.802
git4 3,404.2611.787
waws2 1,702.1311.779
gits2 1,702.1311.456
line3 2,553.1911.069
wearin2 1,702.1310.973
surprised2 1,702.1310.544
he22 18,723.4010.162
oan7 5,957.459.935
erse2 1,702.139.615
ticht2 1,702.139.429
throw2 1,702.139.196
faces2 1,702.139.140
went4 3,404.268.626
them7 5,957.458.306
aboot10 8,510.648.021
pick2 1,702.137.840
his15 12,765.967.557
it25 21,276.607.428
flair2 1,702.137.104
morn2 1,702.137.041
fae9 7,659.576.948
since2 1,702.136.003
which3 2,553.195.827
cause2 1,702.135.803
there8 6,808.515.679
chance2 1,702.135.635
oot11 9,361.705.539
could4 3,404.265.324
two2 1,702.135.268
every2 1,702.135.161
turned2 1,702.134.874
aff5 4,255.324.842
just3 2,553.194.806
try2 1,702.134.733
skil3 2,553.19nan
blue2 1,702.135.268
where2 1,702.134.672
tryin2 1,702.134.612
tweedle3 2,553.19nan
sat2 1,702.134.357
lookin2 1,702.134.344
than4 3,404.263.979
dee2 1,702.133.953
when4 3,404.263.901
so4 3,404.263.806
well2 1,702.133.716
body2 1,702.133.569
came2 1,702.133.353
is3 2,553.193.225
men2 1,702.133.197
rain2 1,702.133.050
right2 1,702.132.920
hame3 2,553.192.821
folk2 1,702.132.766
year3 2,553.192.758
go2 1,702.132.518
booncers2 1,702.13nan
while2 1,702.132.439
fur6 5,106.382.340
up8 6,808.512.189
bouncer2 1,702.13nan
how2 1,702.132.834
was3 2,553.192.048
we2 1,702.131.964
s5 4,255.321.881
then3 2,553.191.857
as4 3,404.261.715
wid3 2,553.191.667
here3 2,553.191.634
re2 1,702.131.494
were3 2,553.191.488
aw5 4,255.321.400
afore3 2,553.191.396
been4 3,404.261.265
in23 19,574.471.125
get3 2,553.191.124
but7 5,957.451.122
even2 1,702.131.075
nicht2 1,702.130.943
if3 2,553.190.721
muckle2 1,702.130.720
be4 3,404.260.708
and10 8,510.640.572
at6 5,106.380.353
ye5 4,255.320.282
a37 31,489.360.209
that13 11,063.830.168
wi10 8,510.640.155
us2 1,702.130.154
auld2 1,702.130.114
wee2 1,702.130.087
doon3 2,553.190.082
tae24 20,425.530.061
aye2 1,702.130.043
nae3 2,553.190.038
by2 1,702.130.016
see2 1,702.130.008
the68 57,872.340.005
no3 2,553.190.001