A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Donati, Colin

Basic Stats

Total words by this author in corpus - 9,426
Total unique words used by this author in corpus - 2,390
Ratio of total words to unique words - 3.944
Tagged as LAL (General Central) dialect.
Top ten most common words - the, o, an, tae, a, in, scots, and, be, for,

List of texts in corpus

Universal Declaration o Human Richts
Scots Language Centre (01-01-2004) in Central dialect (LAL), categorised as government (1,827 words)

Scots A Statement o Principles
Scots Parliament (2003-03 ) in Central dialect (LAL), categorised as government (4,241 words)
The Student and the Pawnwife
Itchy Coo (2003-12-08 ) in Central dialect (LAL), categorised as prose (3,358 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
airticle45 4,774.03295.197
language77 8,168.89205.242
richts35 3,713.13205.240
awbody33 3,500.95189.640
hes35 3,713.13168.163
declaration25 2,652.24162.958
principles24 2,546.15152.673
scots123 13,049.01146.662
shuid30 3,182.69122.729
universal18 1,909.61121.547
linguistic27 2,864.42118.951
o347 36,813.07100.450
haill19 2,015.7089.061
freedoms10 1,060.9087.603
its54 5,728.8484.769
commonties11 1,166.9881.751
equal14 1,485.2579.850
entitelt9 954.8178.374
ful12 1,273.0770.264
international14 1,485.2568.162
wuman11 1,166.9867.507
commonty10 1,060.9066.742
aw71 7,532.3664.803
she5 530.4564.707
forsameikle7 742.6359.989
richt46 4,880.1256.304
uphaud14 1,485.2555.575
be105 11,139.4051.869
freedom11 1,166.9851.255
trades6 636.5450.846
statement10 1,060.9049.841
ma6 636.5447.654
nations9 954.8147.248
discrimination7 742.6346.717
human15 1,591.3444.487
communications6 636.5443.182
and120 12,730.7442.169
learning7 742.6342.140
kenning5 530.4541.752
languages16 1,697.4341.652
has23 2,440.0641.036
hud22 2,333.9738.399
shal26 2,758.33nan
cultural14 1,485.2540.873
loun8 848.7237.794
multilingual7 742.6337.331
scotland35 3,713.1337.238
society10 1,060.9036.290
they7 742.6335.765
ony28 2,970.5135.405
awthing7 742.6333.817
knowledge7 742.6333.313
for100 10,608.9533.228
law11 1,166.9833.042
documentation4 424.3632.726
ye16 1,697.4332.712
chairter8 848.7232.552
til25 2,652.2432.463
purposes6 636.5432.388
names11 1,166.9832.149
stair9 954.8131.988
necessar8 848.7231.891
terms9 954.8131.737
throu20 2,121.7931.691
ability6 636.5431.076
scowth6 636.5431.076
the-day5 530.4530.967
naebody14 1,485.2530.501
conter6 636.5430.474
in218 23,127.5230.437
ongaun7 742.6330.251
assemlie4 424.3630.110
barcelona4 424.3630.110
continuity4 424.3630.110
whitiver6 636.5429.358
essential6 636.5429.358
young17 1,803.5228.945
frae45 4,774.0328.441
appendix4 424.3628.206
haverin4 424.3628.206
unitit6 636.5427.870
fundamental5 530.4527.464
gar8 848.7227.018
i22 2,333.9726.960
valued4 424.3626.694
tongues6 636.5426.558
diversity7 742.6325.997
bit4 424.3625.957
educational6 636.5425.386
her18 1,909.6125.092
airticles6 636.5424.670
speakers10 1,060.9024.579
writers7 742.6324.437
responsibility5 530.4524.377
me8 848.7224.258
ither30 3,182.6923.874
dignity5 530.4523.865
boxroom3 318.2723.803
mairrage3 318.2723.803
wuman's3 318.2723.803
occupations3 318.2723.803
secont3 318.2723.803
role6 636.5423.677
social10 1,060.9023.571
beild5 530.4523.380
naitural5 530.4523.380
uise10 1,060.9022.626
means10 1,060.9022.512
this68 7,214.0922.449
grunds4 424.3621.826
extinction3 318.2721.590
domains3 318.2721.590
status7 742.6321.431
uised9 954.8121.237
sic18 1,909.6121.090
general8 848.7220.892
howf3 318.2720.020
aboot8 848.7219.864
particular6 636.5419.761
or58 6,153.1919.700
public10 1,060.9019.540
faimily7 742.6319.197
siccar8 848.7219.152
uiss7 742.6319.042
respect5 530.4518.973
folk15 1,591.3418.818
integral3 318.2718.795
teaching3 318.2718.795
media8 848.7218.662
pairlament9 954.8118.586
members4 424.3618.426
smeddum5 530.4518.411
tongue9 954.8118.388
people10 1,060.9018.051
distinction4 424.3617.985
tae264 28,007.6417.913
uisage3 318.2717.789
prejudice3 318.2717.789
faut4 424.3617.175
presentation3 318.2716.935
proclaimed3 318.2716.935
statements3 318.2716.935
action6 636.5416.771
professional5 530.4516.687
develop4 424.3616.107
states5 530.4516.042
heritage5 530.4516.042
available5 530.4516.042
awready6 636.5415.997
chaumer5 530.4515.838
pairt14 1,485.2515.778
political7 742.6315.632
pootch3 318.2715.537
personality3 318.2715.537
member4 424.3615.475
somewey4 424.3615.475
fends2 212.1815.055
reservit2 212.1815.055
twined2 212.1815.055
flegs2 212.1815.055
elementar2 212.1815.055
perceived2 212.1815.055
vennels2 212.1815.055
gawpit2 212.1815.055
intromission2 212.1815.055
periodic2 212.1815.055
branks2 212.1815.055
donati2 212.1815.055
disposal2 212.1815.055
apairtment2 212.1815.055
definin2 212.1815.055
follae-up2 212.1815.055
daily-day2 212.1815.055
accordit2 212.1815.055
affirmit2 212.1815.055
an301 31,932.9514.925
security4 424.3614.894
maun15 1,591.3414.837
pouer5 530.4514.532
cotton3 318.2714.419
discussion4 424.3614.357
pey6 636.5414.239
up16 1,697.4314.170
document4 424.3614.104
entitled3 318.2713.934
lane6 636.5413.876
image5 530.4513.869
their33 3,500.9513.737
it75 7,956.7213.694
day5 530.4513.678
sma7 742.6313.603
dreid3 318.2713.488
ootlined2 212.1813.347
miscawed2 212.1813.347
tochered2 212.1813.347
footer2 212.1813.347
warld-wide2 212.1813.347
haurlie2 212.1813.347
layoot2 212.1813.347
whummelt2 212.1813.347
guddled2 212.1813.347
inherent2 212.1813.347
plenishins2 212.1813.347
fent2 212.1813.347
institutions3 318.2713.076
adverts3 318.2713.076
yet10 1,060.9012.757
circumstances3 318.2712.692
intent3 318.2712.692
identity5 530.4512.688
needit5 530.4512.552
nor17 1,803.5212.353
currently3 318.2712.334
thae13 1,379.1612.232
dwams2 212.1812.180
drunks2 212.1812.180
includin5 530.4512.158
person5 530.4512.158
initiatives3 318.2711.998
exact3 318.2711.998
effective3 318.2711.998
active3 318.2711.998
nerves3 318.2711.998
ainly4 424.3611.980
even18 1,909.6111.813
situation5 530.4511.783
text5 530.4511.783
thole6 636.5411.686
countries3 318.2711.682
vital3 318.2711.682
beer4 424.3611.625
yer5 530.4511.572
sudden4 424.3611.455
includes3 318.2711.383
acts3 318.2711.383
coherent2 212.1811.290
socially2 212.1811.290
backgrun2 212.1811.290
commonly2 212.1811.290
shuld2 212.1811.290
purpose3 318.2711.101
conscience3 318.2711.101
national8 848.7211.092
yella4 424.3610.969
place14 1,485.2510.868
gart5 530.4510.867
jouk3 318.2710.833
man's3 318.2710.833
street9 954.8110.739
state7 742.6310.702
practice4 424.3610.664
minority6 636.5410.648
slavery2 212.1810.570
wemen2 212.1810.570
peoples2 212.1810.570
development4 424.3610.517
fifty4 424.3610.517
back7 742.6310.482
had3 318.2710.478
authority3 318.2710.334
speakin6 636.5410.330
himsel8 848.7210.292
forrit8 848.7210.292
whan2 212.1810.268
get3 318.2710.245
signpostin5 530.45nan
justice3 318.2710.102
regaird3 318.2710.102
peacefu2 212.189.967
historically2 212.189.967
hasna2 212.189.967
meisures2 212.189.967
tassie2 212.189.967
pairts5 530.459.862
raskolnikov5 530.45nan
amang8 848.729.812
ay4 424.369.786
his78 8,274.989.736
awfie3 318.279.666
we20 2,121.799.663
interest5 530.459.586
jalouse4 424.369.574
ii3 318.279.462
providin2 212.189.447
switherin2 212.189.447
mairower2 212.189.447
adoptit2 212.189.447
kirks2 212.189.447
duly2 212.189.447
haud9 954.819.426
than2 212.189.385
month6 636.549.376
niver10 1,060.909.276
presence3 318.279.265
linguists2 212.188.992
trial2 212.188.992
oral2 212.188.992
application2 212.188.992
historic2 212.188.992
visitors2 212.188.992
territor5 530.45nan
yinst2 212.188.992
terrible3 318.278.894
weirin3 318.278.894
specific3 318.278.894
furthset3 318.278.894
policy5 530.458.739
loan2 212.188.587
spheres2 212.188.587
lanlady5 530.45nan
religioun5 530.45nan
countrie5 530.45nan
group7 742.639.283
fundit2 212.188.587
tenement2 212.188.587
sowels2 212.188.587
trade3 318.278.385
global3 318.278.385
levels3 318.278.385
free7 742.638.374
proper4 424.368.337
culture7 742.638.274
pen3 318.278.226
whuther2 212.188.223
analysis2 212.188.223
toy2 212.188.223
duties2 212.188.223
brigs2 212.188.223
mcgugan2 212.188.223
expressed2 212.188.223
priority2 212.188.223
promote3 318.278.073
doon9 954.818.039
maist16 1,697.437.984
freely2 212.187.892
glent2 212.187.892
waled2 212.187.892
german4 424.367.852
recognition3 318.277.780
books3 318.277.780
transmission4 424.36nan
providit2 212.187.892
country5 530.457.779
government8 848.727.752
there12 1,273.077.675
skinklin2 212.187.589
bink2 212.187.589
penal4 424.36nan
lands2 212.187.589
grup2 212.187.589
literary3 318.277.504
shairly3 318.277.504
forms3 318.277.504
resources3 318.277.504
groups4 424.367.404
withoot5 530.457.381
foond3 318.277.372
hat3 318.277.372
wee9 954.817.333
precious2 212.187.310
clerk2 212.187.310
needna2 212.187.310
effecks2 212.187.310
humanity2 212.187.310
curriculum3 318.277.243
forby7 742.637.082
keech2 212.187.051
thirteen2 212.187.051
signs4 424.366.991
declaration'4 424.36nan
alane4 424.367.071
yon11 1,166.986.931
heat4 424.366.912
personal3 318.276.879
rouble7 742.63nan
nationality4 424.3632.726
cross4 424.366.834
realise3 318.276.763
foregaun4 424.36nan
is79 8,381.076.748
orra4 424.366.682
of5 530.456.627
suddent2 212.186.586
expression3 318.276.541
native3 318.276.541
then3 318.276.501
hoose2 212.186.474
gear4 424.366.462
etc3 318.276.434
a236 25,037.136.337
passin3 318.276.330
seein5 530.456.323
worth4 424.366.320
been8 848.726.308
siller7 742.636.250
enjoy3 318.276.228
medium3 318.276.228
dictionary3 318.276.228
study3 318.276.228
sets3 318.276.228
meet4 424.366.182
definition2 212.186.177
translation2 212.186.177
ae7 742.636.151
juist2 212.186.138
business4 424.366.114
sindry4 424.366.048
seek3 318.276.030
year2 212.186.017
unner6 636.545.998
december2 212.185.989
bilingual2 212.185.989
future4 424.365.982
research3 318.275.842
stevenson2 212.185.812
redd3 318.275.751
ower11 1,166.985.741
gaird2 212.185.644
standart2 212.185.644
tool2 212.185.644
common4 424.365.605
nicht2 212.185.553
prent2 212.185.484
guilty2 212.185.484
stepped2 212.185.484
ile2 212.185.484
see7 742.635.431
hecht2 212.185.332
association2 212.185.332
vizzied2 212.185.332
yours2 212.185.332
ain18 1,909.615.325
the602 63,865.905.307
kennin4 424.365.256
nummer5 530.455.255
regional3 318.275.243
mindit3 318.275.243
kythe2 212.185.186
pressure2 212.185.186
kinds2 212.185.186
upmak3 318.27nan
safegairdit3 318.27nan
lear2 212.185.186
recognised2 212.185.186
space4 424.365.145
literature3 318.275.087
britain3 318.275.087
awthegither2 212.185.047
respeck2 212.185.047
landin2 212.185.047
recognise2 212.185.047
keys2 212.185.047
stapped2 212.185.047
mainteen3 318.27nan
speech3 318.275.011
meenit4 424.364.931
regairdit2 212.184.914
copecks3 318.27nan
mak14 1,485.254.891
penalised3 318.27nan
micht2 212.184.865
information3 318.274.864
nane5 530.454.794
warks3 318.274.793
trauchle2 212.184.787
importance2 212.184.787
ayewis2 212.184.787
thon2 212.184.686
else5 530.454.676
skaith2 212.184.665
communication2 212.184.665
spite2 212.184.665
hauds3 318.274.653
man17 1,803.524.592
force3 318.274.586
lead3 318.274.586
langer3 318.274.586
belief2 212.184.547
colin2 212.184.547
express2 212.184.547
flair4 424.364.533
theirsels3 318.274.519
stertit5 530.454.448
material2 212.184.434
naither2 212.184.434
kenspeckle3 318.274.390
awa5 530.454.335
hauden2 212.184.325
patient2 212.184.325
opportunities2 212.184.325
wide3 318.274.265
examples2 212.184.220
by23 2,440.064.185
isles2 212.184.118
exist2 212.184.118
quate3 318.274.027
meanin3 318.274.027
resourcin3 318.27nan
interests2 212.185.812
fifteen2 212.184.020
awlinguistic3 318.27nan
position2 212.184.020
student2 212.184.020
bi5 530.453.989
coort2 212.183.925
scunner2 212.183.925
historical2 212.183.925
comatee3 318.273.914
scottish11 1,166.983.902
as73 7,744.543.896
steer2 212.183.834
www2 212.183.834
positive2 212.183.834
accurate3 318.27nan
mairch3 318.273.804
wes3 318.273.801
at46 4,880.123.748
lobby2 212.183.745
braes2 212.183.745
meetin2 212.183.745
education6 636.543.700
aye21 2,227.883.692
byordinar2 212.183.659
influence2 212.183.659
msp2 212.183.659
staundin2 212.183.659
pairty4 424.363.640
us6 636.543.625
efter5 530.453.614
no16 1,697.433.608
british3 318.273.596
sir3 318.273.545
fou3 318.273.496
office3 318.273.496
shame2 212.183.495
claes5 530.453.458
are19 2,015.703.418
some7 742.633.411
cam3 318.273.388
pass3 318.273.352
sair6 636.543.345
steel2 212.183.341
said7 742.633.302
wid5 530.453.296
cairryin2 212.183.267
decide2 212.183.195
ivaanovna3 318.27nan
drama2 212.183.267
sowel2 212.183.195
pitten3 318.273.170
choice2 212.183.125
new3 318.273.091
thirty2 212.183.057
reality2 212.183.057
intil6 636.543.020
kist2 212.182.991
dae6 636.542.972
livin3 318.272.957
mony9 954.812.952
that77 8,168.892.951
wi85 9,017.612.931
derk2 212.182.926
stour2 212.182.926
joy2 212.182.926
learn3 318.272.876
reason3 318.272.876
level2 212.182.863
forderance2 212.18nan
needs3 318.272.836
sang4 424.362.832
scotland's2 212.182.802
confidence2 212.182.802
howfs2 212.18nan
aften4 424.362.862
aff7 742.632.777
mair27 2,864.422.753
peace2 212.182.743
windaes2 212.182.684
square2 212.182.684
order3 318.272.683
taen7 742.632.668
oor8 848.722.593
lang5 530.452.585
were6 636.542.576
race2 212.182.572
mindin2 212.182.572
clattie2 212.18nan
tent4 424.362.519
loss2 212.182.518
shape2 212.182.518
register2 212.182.518
gien7 742.632.461
will4 424.362.458
body6 636.542.447
history4 424.362.440
thocht13 1,379.162.421
generations2 212.182.414
maitter3 318.272.400
door3 318.272.363
didna10 1,060.902.352
same9 954.812.350
if9 954.812.332
how7 742.632.329
ring2 212.182.314
process2 212.182.314
present2 212.182.314
students2 212.182.314
bein3 318.272.296
whether2 212.182.266
plain2 212.182.266
least4 424.362.238
bell2 212.182.219
spoken3 318.272.207
act3 318.272.207
naething3 318.272.176
thrang2 212.182.173
him18 1,909.612.171
ach2 212.182.128
needin2 212.182.128
hert5 530.452.118
thinkin4 424.362.096
sib2 212.182.084
hame4 424.362.071
wis82 8,699.342.070
oot31 3,288.782.058
maks4 424.362.050
she's2 212.182.041
cried5 530.452.023
watch3 318.272.000
lit2 212.181.999
haen2 212.181.999
tackie2 212.18nan
auld18 1,909.612.069
staunin3 318.271.972
faur5 530.451.968
steps2 212.181.958
gane2 212.181.958
-4 424.361.940
twa8 848.721.893
agin4 424.361.876
them10 1,060.901.829
words5 530.451.808
gettin5 530.451.808
accoont2 212.181.764
every4 424.361.734
address2 212.181.728
win7 742.631.697
key2 212.181.692
last8 848.721.688
thir7 742.631.671
past5 530.451.659
ocht2 212.181.657
fit6 636.541.640
edinburgh3 318.271.636
tho7 742.631.633
set2 212.181.624
whilk2 212.181.623
system2 212.181.623
wark6 636.541.621
haudin3 318.271.589
lassies2 212.181.556
need7 742.631.512
fitnotes2 212.18nan
richt-haun2 212.18nan
cawed2 212.181.492
pittin3 318.271.433
darg2 212.181.431
muckle5 530.451.429
neb2 212.181.401
sax2 212.181.401
strangbox2 212.18nan
guideen2 212.18nan
maunbe2 212.18nan
gied2 212.181.413
hendry2 212.18nan
land3 318.271.944
cast2 212.181.371
case3 318.271.350
that's5 530.451.346
thing8 848.721.334
wisna6 636.541.313
heid6 636.541.291
footers2 212.18nan
growin2 212.181.839
ettle2 212.181.260
fowrth2 212.18nan
lass2 212.181.233
kent4 424.361.220
poetry2 212.181.181
may2 212.181.181
intae12 1,273.071.177
hinnert2 212.18nan
since3 318.271.159
english10 1,060.901.149
stert3 318.271.141
chynge2 212.181.131
i'm3 318.271.123
daft2 212.181.107
bodies2 212.181.107
ettlin2 212.181.107
whit16 1,697.431.091
biggit2 212.181.083
road6 636.541.068
ben4 424.361.044
sort2 212.181.036
again5 530.450.995
example2 212.180.992
uggin2 212.18nan
speak4 424.361.261
alyona3 318.27nan
unique2 212.184.787
roon3 318.270.991
baith8 848.720.991
years2 212.180.976
won2 212.180.970
colour2 212.180.970
got6 636.540.950
room2 212.180.934
duin3 318.270.896
press2 212.180.887
ane12 1,273.070.869
cauld2 212.180.861
shair2 212.180.828
cry2 212.180.828
writin3 318.270.824
heard2 212.180.821
europe2 212.180.791
centre2 212.180.772
readin2 212.180.772
afore14 1,485.250.772
warld5 530.450.772
corner2 212.180.737
maum2 212.18nan
ken9 954.810.755
ablow2 212.180.720
uk2 212.180.720
better3 318.270.686
kitchen2 212.180.686
fair6 636.540.681
hae26 2,758.330.681
gets2 212.180.670
far3 318.270.654
een9 954.810.653
european2 212.180.638
leid4 424.360.630
misdoot2 212.18nan
kind3 318.270.644
born2 212.180.623
wantin2 212.180.623
hingin2 212.180.608
on52 5,516.660.607
men2 212.180.588
braid2 212.180.578
dark2 212.180.549
likely2 212.180.522
onyither2 212.18nan
help2 212.180.506
keep3 318.270.492
comin2 212.180.473
thochts2 212.180.470
easy2 212.180.470
dinna3 318.270.447
care2 212.180.422
orally2 212.18nan
atween3 318.270.431
understaunding2 212.18nan
alang3 318.270.415
compluterance2 212.18nan
wulsome2 212.18nan
referent2 212.18nan
our2 212.180.399
strang2 212.180.388
whiles3 318.270.385
gaelic6 636.540.370
doun5 530.450.352
it's9 954.810.339
weel9 954.810.333
wull2 212.180.326
stood2 212.180.316
while3 318.270.314
syne5 530.450.305
makkin3 318.270.300
first7 742.630.299
really2 212.180.287
love2 212.180.279
canna4 424.360.278
roond3 318.270.272
report2 212.180.271
reduction2 212.18nan
haein3 318.270.259
but34 3,607.040.245
thegither4 424.360.239
whaur8 848.720.233
beilding2 212.18nan
eh2 212.180.230
toun2 212.180.214
met2 212.180.214
deep2 212.180.214
manifestations2 212.18nan
next3 318.270.210
left4 424.360.201
somethin3 318.270.193
here9 954.810.191
could9 954.810.188
disnae2 212.180.186
leave2 212.180.186
hoo2 212.180.183
bide2 212.180.183
ilk2 212.180.179
disna2 212.180.166
sun2 212.180.165
let3 318.270.162
taks2 212.180.160
late2 212.180.154
real2 212.180.154
brocht3 318.270.152
feart2 212.180.148
done2 212.180.148
high2 212.180.148
am2 212.180.132
tap2 212.180.128
gaun5 530.450.123
gie8 848.720.121
opinioun2 212.18nan
sister2 212.180.120
come7 742.630.117
gey4 424.360.116
end4 424.360.115
support2 212.180.109
sae16 1,697.430.106
noo14 1,485.250.105
bairns4 424.360.103
jist15 1,591.340.097
gin8 848.720.094
different3 318.270.080
life5 530.450.077
made6 636.540.077
ee3 318.270.074
pit9 954.810.070
open2 212.180.070
gaed5 530.450.065
seen7 742.630.063
wulsomely2 212.18nan
why2 212.180.059
onsets2 212.18nan
turn2 212.180.050
loses2 212.18nan
nae28 2,970.510.057
great3 318.270.046
hauf2 212.180.045
like26 2,758.330.044
hair2 212.180.040
near4 424.360.038
itsel2 212.180.037
word2 212.180.035
commonties'2 212.18nan
yin7 742.630.043
yae2 212.180.035
few2 212.180.034
table2 212.180.032
wad11 1,166.980.030
ilka3 318.270.027
mean3 318.270.027
tak8 848.720.025
sat2 212.180.022
fell2 212.180.019
staund-alane2 212.18nan
deid2 212.180.019
gies2 212.180.017
took3 318.270.014
can14 1,485.250.012
scott2 212.180.011
anither6 636.540.010
ithers2 212.180.010
five2 212.180.006
guid10 1,060.900.005
wey9 954.810.005
governance2 212.18nan
fower2 212.180.005
less2 212.180.004
looked2 212.180.003
black2 212.180.003
times3 318.270.002
still7 742.630.002
mebbe2 212.180.001
mind7 742.630.001
time17 1,803.520.001
territours2 212.18nan
aince2 212.180.001
he82 8,699.340.000
look4 424.360.000
masel3 318.270.000