A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Forrest, Liam

Basic Stats

Total words by this author in corpus - 1,175
Total unique words used by this author in corpus - 462
Ratio of total words to unique words - 2.543
Tagged as SEC ((South) East Central) dialect.
Top ten most common words - the, eh, a, they, wurr, it, tae, in, he, like,

List of texts in corpus

Hittin the Toon
Scots Hoose (2020) in Central (Dunfermline) dialect (SEC), categorised as prose (1,175 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
eh37 31,489.36279.115
awbdy6 5,106.3872.484
n12 10,212.7757.669
pals7 5,957.4548.495
they26 22,127.6648.440
looked9 7,659.5746.911
like21 17,872.3444.863
wurr25 21,276.60nan
wiz19 16,170.2178.892
queue5 4,255.3240.216
wit6 5,106.3838.422
group7 5,957.4535.813
hud9 7,659.5735.338
their15 12,765.9634.788
club5 4,255.3232.773
gon3 2,553.1931.179
dresses3 2,553.1931.179
lassies5 4,255.3230.912
um5 4,255.3229.576
nuhin3 2,553.1929.287
er5 4,255.3225.488
av3 2,553.1924.544
bunch3 2,553.1924.193
who6 5,106.3824.064
boys4 3,404.2623.132
spice2 1,702.1321.611
surprising2 1,702.1320.429
dum2 1,702.1320.429
pal4 3,404.2619.901
tanned2 1,702.1319.523
guards2 1,702.1318.788
id2 1,702.1318.168
stawndin6 5,106.38nan
gonny2 1,702.1317.633
skirts2 1,702.1317.162
got7 5,957.4514.309
front4 3,404.2614.061
to8 6,808.5114.036
way4 3,404.2613.848
hair4 3,404.2613.729
blawn2 1,702.1313.104
shop3 2,553.1912.841
security2 1,702.1312.658
nixt3 2,553.1912.213
obviously2 1,702.1311.894
git4 3,404.2611.858
waws2 1,702.1311.780
wan5 4,255.3211.739
gits2 1,702.1311.458
line3 2,553.1910.805
wearin2 1,702.1310.628
surprised2 1,702.1310.546
he22 18,723.4010.373
oan7 5,957.459.654
erse2 1,702.139.617
ticht2 1,702.139.617
faces2 1,702.139.198
throw2 1,702.139.198
went4 3,404.268.616
them7 5,957.458.355
aboot10 8,510.648.107
pick2 1,702.137.804
his15 12,765.967.694
it25 21,276.607.268
flair2 1,702.137.106
morn2 1,702.137.012
fae9 7,659.576.829
since2 1,702.135.893
which3 2,553.195.852
cause2 1,702.135.805
there8 6,808.515.631
chance2 1,702.135.576
oot11 9,361.705.526
could4 3,404.265.341
blue2 1,702.135.252
turned2 1,702.135.198
two2 1,702.135.162
every2 1,702.135.041
aff5 4,255.324.871
where2 1,702.134.704
skil3 2,553.19nan
thurr5 4,255.32nan
aroond3 2,553.1911.650
tryin2 1,702.134.673
try2 1,702.134.658
just3 2,553.194.617
sat2 1,702.134.512
lookin2 1,702.134.359
dee2 1,702.133.979
so4 3,404.263.809
tweedle3 2,553.19nan
than4 3,404.263.884
when4 3,404.263.682
well2 1,702.133.633
body2 1,702.133.550
came2 1,702.133.402
is3 2,553.193.223
men2 1,702.133.146
rain2 1,702.133.052
right2 1,702.132.945
hame3 2,553.192.808
year3 2,553.192.750
folk2 1,702.132.745
go2 1,702.132.500
booncers2 1,702.13nan
while2 1,702.132.500
fur6 5,106.382.247
up8 6,808.512.226
bouncer2 1,702.13nan
how2 1,702.132.851
was3 2,553.192.196
s5 4,255.322.024
we2 1,702.131.929
then3 2,553.191.842
wid3 2,553.191.684
as4 3,404.261.662
here3 2,553.191.641
were3 2,553.191.492
afore3 2,553.191.456
re2 1,702.131.430
aw5 4,255.321.362
been4 3,404.261.248
in23 19,574.471.130
but7 5,957.451.078
get3 2,553.191.076
even2 1,702.131.061
nicht2 1,702.130.939
muckle2 1,702.130.736
be4 3,404.260.704
if3 2,553.190.674
and10 8,510.640.539
at6 5,106.380.335
ye5 4,255.320.299
a37 31,489.360.195
wi10 8,510.640.172
that13 11,063.830.168
us2 1,702.130.158
auld2 1,702.130.113
wee2 1,702.130.092
doon3 2,553.190.079
aye2 1,702.130.067
tae24 20,425.530.053
nae3 2,553.190.036
by2 1,702.130.019
see2 1,702.130.007
no3 2,553.190.003
the68 57,872.340.003