A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Forrest, Liam

Basic Stats

Total words by this author in corpus - 1,175
Total unique words used by this author in corpus - 462
Ratio of total words to unique words - 2.543
Tagged as SEC ((South) East Central) dialect.
Top ten most common words - the, eh, a, they, wurr, it, tae, in, he, like,

List of texts in corpus

Hittin the Toon
Scots Hoose (2020) in Central (Dunfermline) dialect (SEC), categorised as prose (1,175 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
eh37 31,489.36278.719
awbdy6 5,106.3872.503
n12 10,212.7757.644
pals7 5,957.4548.328
they26 22,127.6648.295
looked9 7,659.5746.783
like21 17,872.3444.882
wurr25 21,276.60nan
wiz19 16,170.2178.038
queue5 4,255.3240.232
wit6 5,106.3837.872
group7 5,957.4535.835
hud9 7,659.5735.366
their15 12,765.9634.786
club5 4,255.3232.789
dresses3 2,553.1931.189
gon3 2,553.1931.189
lassies5 4,255.3230.928
um5 4,255.3229.592
nuhin3 2,553.1929.297
er5 4,255.3225.001
av3 2,553.1924.554
bunch3 2,553.1924.202
who6 5,106.3824.001
boys4 3,404.2623.145
spice2 1,702.1321.618
dum2 1,702.1320.435
surprising2 1,702.1320.435
pal4 3,404.2619.709
tanned2 1,702.1319.529
id2 1,702.1318.175
skirts2 1,702.1317.169
stawndin6 5,106.38nan
gonny2 1,702.1317.640
got7 5,957.4514.328
to8 6,808.5114.057
front4 3,404.2614.042
way4 3,404.2613.860
hair4 3,404.2613.741
blawn2 1,702.1313.111
shop3 2,553.1912.850
security2 1,702.1312.664
nixt3 2,553.1912.222
obviously2 1,702.1311.901
git4 3,404.2611.825
waws2 1,702.1311.787
wan5 4,255.3211.662
aroond3 2,553.1911.659
gits2 1,702.1311.465
line3 2,553.1910.814
wearin2 1,702.1310.634
surprised2 1,702.1310.552
he22 18,723.4010.356
ticht2 1,702.139.623
erse2 1,702.139.623
oan7 5,957.459.532
throw2 1,702.139.204
faces2 1,702.139.204
went4 3,404.268.587
them7 5,957.458.328
aboot10 8,510.648.107
pick2 1,702.137.810
his15 12,765.967.676
it25 21,276.607.251
flair2 1,702.137.080
morn2 1,702.137.018
fae9 7,659.576.808
since2 1,702.135.899
which3 2,553.195.861
cause2 1,702.135.811
there8 6,808.515.625
chance2 1,702.135.582
oot11 9,361.705.517
could4 3,404.265.322
blue2 1,702.135.239
turned2 1,702.135.186
two2 1,702.135.047
every2 1,702.135.030
aff5 4,255.324.852
where2 1,702.134.664
skil3 2,553.19nan
try2 1,702.134.664
thurr5 4,255.32nan
guards2 1,702.1318.794
tryin2 1,702.134.634
tweedle3 2,553.19nan
just3 2,553.194.616
sat2 1,702.134.489
lookin2 1,702.134.338
dee2 1,702.133.984
than4 3,404.263.888
so4 3,404.263.813
when4 3,404.263.681
well2 1,702.133.617
body2 1,702.133.555
came2 1,702.133.407
is3 2,553.193.215
men2 1,702.133.151
rain2 1,702.133.057
right2 1,702.132.910
hame3 2,553.192.810
year3 2,553.192.757
folk2 1,702.132.750
while2 1,702.132.505
booncers2 1,702.13nan
go2 1,702.132.485
fur6 5,106.382.234
up8 6,808.512.220
bouncer2 1,702.13nan
how2 1,702.132.817
was3 2,553.192.202
s5 4,255.322.037
we2 1,702.131.928
then3 2,553.191.835
wid3 2,553.191.689
as4 3,404.261.673
here3 2,553.191.644
were3 2,553.191.495
afore3 2,553.191.459
re2 1,702.131.423
aw5 4,255.321.346
been4 3,404.261.248
in23 19,574.471.137
but7 5,957.451.083
get3 2,553.191.076
even2 1,702.131.065
nicht2 1,702.130.943
muckle2 1,702.130.739
be4 3,404.260.705
if3 2,553.190.675
and10 8,510.640.545
at6 5,106.380.336
ye5 4,255.320.306
a37 31,489.360.195
wi10 8,510.640.173
that13 11,063.830.166
us2 1,702.130.159
auld2 1,702.130.113
wee2 1,702.130.092
doon3 2,553.190.079
aye2 1,702.130.064
tae24 20,425.530.053
nae3 2,553.190.036
by2 1,702.130.019
see2 1,702.130.007
no3 2,553.190.003
the68 57,872.340.003