A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Forrest, Liam

Basic Stats

Total words by this author in corpus - 1,175
Total unique words used by this author in corpus - 462
Ratio of total words to unique words - 2.543
Tagged as SEC ((South) East Central) dialect.
Top ten most common words - the, eh, a, they, wurr, it, tae, in, he, like,

List of texts in corpus

Hittin the Toon
Scots Hoose (2020) in Central (Dunfermline) dialect (SEC), categorised as prose (1,175 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
eh37 31,489.36278.690
awbdy6 5,106.3872.414
n12 10,212.7757.535
they26 22,127.6648.417
pals7 5,957.4548.415
looked9 7,659.5746.809
like21 17,872.3444.825
wurr25 21,276.60nan
wiz19 16,170.2178.682
queue5 4,255.3240.159
wit6 5,106.3838.354
group7 5,957.4535.884
hud9 7,659.5735.240
their15 12,765.9635.056
club5 4,255.3232.875
gon3 2,553.1931.144
dresses3 2,553.1931.144
lassies5 4,255.3231.119
um5 4,255.3229.519
nuhin3 2,553.1929.252
er5 4,255.3225.432
av3 2,553.1924.510
bunch3 2,553.1924.158
who6 5,106.3823.999
boys4 3,404.2623.086
spice2 1,702.1321.588
surprising2 1,702.1320.406
dum2 1,702.1320.406
pal4 3,404.2619.856
tanned2 1,702.1319.500
guards2 1,702.1318.765
id2 1,702.1318.145
stawndin6 5,106.38nan
gonny2 1,702.1317.610
skirts2 1,702.1317.139
got7 5,957.4514.340
front4 3,404.2614.018
to8 6,808.5113.980
way4 3,404.2613.804
hair4 3,404.2613.745
blawn2 1,702.1313.082
shop3 2,553.1912.807
security2 1,702.1312.635
nixt3 2,553.1912.180
obviously2 1,702.1311.871
waws2 1,702.1311.871
git4 3,404.2611.816
wan5 4,255.3211.688
gits2 1,702.1311.436
line3 2,553.1910.805
wearin2 1,702.1310.606
surprised2 1,702.1310.524
he22 18,723.4010.291
ticht2 1,702.139.594
erse2 1,702.139.594
oan7 5,957.459.592
faces2 1,702.139.176
throw2 1,702.139.176
went4 3,404.268.576
them7 5,957.458.296
aboot10 8,510.648.134
pick2 1,702.137.782
his15 12,765.967.648
it25 21,276.607.207
flair2 1,702.137.084
morn2 1,702.136.990
fae9 7,659.576.985
since2 1,702.135.872
which3 2,553.195.870
cause2 1,702.135.784
there8 6,808.515.633
chance2 1,702.135.556
oot11 9,361.705.489
could4 3,404.265.327
blue2 1,702.135.231
turned2 1,702.135.177
two2 1,702.135.142
every2 1,702.135.021
aff5 4,255.324.851
where2 1,702.134.684
skil3 2,553.19nan
thurr5 4,255.32nan
aroond3 2,553.1911.772
tryin2 1,702.134.668
try2 1,702.134.638
just3 2,553.194.590
sat2 1,702.134.492
lookin2 1,702.134.339
dee2 1,702.134.007
so4 3,404.263.787
tweedle3 2,553.19nan
than4 3,404.263.887
when4 3,404.263.747
well2 1,702.133.614
body2 1,702.133.541
came2 1,702.133.393
is3 2,553.193.240
men2 1,702.133.145
rain2 1,702.133.033
right2 1,702.132.927
year3 2,553.192.794
hame3 2,553.192.785
folk2 1,702.132.749
go2 1,702.132.496
booncers2 1,702.13nan
while2 1,702.132.489
fur6 5,106.382.259
up8 6,808.512.207
bouncer2 1,702.13nan
how2 1,702.132.848
was3 2,553.192.174
s5 4,255.322.051
we2 1,702.131.909
then3 2,553.191.835
wid3 2,553.191.664
as4 3,404.261.640
here3 2,553.191.630
were3 2,553.191.563
afore3 2,553.191.443
re2 1,702.131.419
aw5 4,255.321.365
been4 3,404.261.249
in23 19,574.471.161
get3 2,553.191.078
but7 5,957.451.071
even2 1,702.131.051
nicht2 1,702.130.932
muckle2 1,702.130.764
be4 3,404.260.714
if3 2,553.190.669
and10 8,510.640.572
at6 5,106.380.326
ye5 4,255.320.308
a37 31,489.360.188
that13 11,063.830.179
wi10 8,510.640.172
us2 1,702.130.165
auld2 1,702.130.110
wee2 1,702.130.091
doon3 2,553.190.077
aye2 1,702.130.065
tae24 20,425.530.054
nae3 2,553.190.037
by2 1,702.130.024
see2 1,702.130.007
no3 2,553.190.003
the68 57,872.340.002