A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

gov.scot

Basic Stats

Total words by this author in corpus - 391
Total unique words used by this author in corpus - 175
Ratio of total words to unique words - 2.234
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, o, gaelic, for, tae, a, in, is, consultation,

List of texts in corpus

Biggin a strang future for Gaelic an Scots
Scottish Government (2022-08-24) in Central dialect (LAL), categorised as government (391 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
gaelic15 38,363.17102.161
consultation7 17,902.8165.952
gàidhealtachd4 10,230.1841.529
thit6 15,345.2733.333
heezin3 7,672.6330.786
education5 12,787.7229.449
spikkers3 7,672.6325.200
for13 33,248.0824.950
medium3 7,672.6324.811
term3 7,672.6324.811
uisit2 5,115.0923.925
launchit2 5,115.0923.925
misures2 5,115.0922.031
siccar3 7,672.6319.899
effective2 5,115.0919.014
profile2 5,115.0917.986
growth2 5,115.0917.476
heeze2 5,115.0916.253
makkin3 7,672.6315.344
gme2 5,115.0915.229
bòrd2 5,115.0914.417
athort2 5,115.0914.273
gàidhlig2 5,115.0914.273
wull3 7,672.6313.505
pairts2 5,115.0912.579
we're2 5,115.0911.305
wantin2 5,115.0911.305
scotland4 10,230.1811.010
support2 5,115.099.871
public2 5,115.099.695
an21 53,708.449.666
whit5 12,787.729.461
scots6 15,345.278.761
new3 7,672.638.492
mak3 7,672.638.417
is8 20,460.368.092
na2 5,115.097.679
help2 5,115.097.162
gien2 5,115.096.665
throu2 5,115.096.239
hit2 5,115.096.049
place2 5,115.095.530
there4 10,230.185.071
leid2 5,115.094.959
o15 38,363.174.909
mair3 7,672.633.549
haes2 5,115.093.468
this4 10,230.182.885
are2 5,115.092.552
tae13 33,248.082.252
the30 76,726.342.234
fowk2 5,115.092.111
noo2 5,115.091.877
been2 5,115.091.532
up3 7,672.631.173
in8 20,460.360.552
they2 5,115.090.147
on3 7,672.630.139
at2 5,115.090.110
a11 28,132.990.019
be2 5,115.090.001