A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Donati, Colin

Basic Stats

Total words by this author in corpus - 9,426
Total unique words used by this author in corpus - 2,390
Ratio of total words to unique words - 3.944
Tagged as LAL (General Central) dialect.
Top ten most common words - the, o, an, tae, a, in, scots, and, be, for,

List of texts in corpus

Universal Declaration o Human Richts
Scots Language Centre (01-01-2004) in Central dialect (LAL), categorised as government (1,827 words)

Scots A Statement o Principles
Scots Parliament (2003-03 ) in Central dialect (LAL), categorised as government (4,241 words)
The Student and the Pawnwife
Itchy Coo (2003-12-08 ) in Central dialect (LAL), categorised as prose (3,358 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
airticle45 4,774.03293.752
language77 8,168.89206.926
richts35 3,713.13206.080
awbody33 3,500.95192.990
hes35 3,713.13168.991
declaration25 2,652.24163.562
principles24 2,546.15152.108
scots123 13,049.01146.791
shuid30 3,182.69124.816
universal18 1,909.61121.982
linguistic27 2,864.42119.585
o347 36,813.0798.510
freedoms10 1,060.9087.847
haill19 2,015.7087.659
its54 5,728.8483.567
commonties11 1,166.9882.018
equal14 1,485.2579.380
entitelt9 954.8178.593
ful12 1,273.0770.552
wuman11 1,166.9867.772
international14 1,485.2567.466
aw71 7,532.3667.106
commonty10 1,060.9066.983
she5 530.4562.935
forsameikle7 742.6360.159
richt46 4,880.1256.369
uphaud14 1,485.2555.900
be105 11,139.4052.043
trades6 636.5450.993
freedom11 1,166.9849.724
statement10 1,060.9048.995
nations9 954.8147.462
discrimination7 742.6346.886
human15 1,591.3444.290
ma6 636.5444.085
and120 12,730.7443.787
learning7 742.6342.309
kenning5 530.4541.874
signpostin5 530.4541.874
languages16 1,697.4341.575
has23 2,440.0641.172
cultural14 1,485.2541.011
hud22 2,333.9740.814
loun8 848.7237.983
scotland35 3,713.1337.851
shal26 2,758.33nan
multilingual7 742.6337.498
communications6 636.5436.686
they7 742.6336.057
society10 1,060.9034.524
ony28 2,970.5133.988
awthing7 742.6333.983
for100 10,608.9533.157
law11 1,166.9833.105
knowledge7 742.6332.994
nationality4 424.3632.824
chairter8 848.7232.738
purposes6 636.5432.531
ye16 1,697.4332.499
stair9 954.8131.943
necessar8 848.7231.757
names11 1,166.9831.709
til25 2,652.2431.677
scowth6 636.5431.219
terms9 954.8131.212
throu20 2,121.7931.115
the-day5 530.4531.087
conter6 636.5430.616
ability6 636.5430.616
naebody14 1,485.2530.473
ongaun7 742.6330.415
barcelona4 424.3630.207
continuity4 424.3630.207
assemlie4 424.3630.207
documentation4 424.3630.207
in218 23,127.5230.150
essential6 636.5430.044
whitiver6 636.5428.982
young17 1,803.5228.826
valued4 424.3628.303
appendix4 424.3628.303
haverin4 424.3628.303
fundamental5 530.4527.584
frae45 4,774.0327.564
unitit6 636.5427.556
gar8 848.7226.750
tongues6 636.5426.699
bit4 424.3625.941
i22 2,333.9725.690
diversity7 742.6325.615
educational6 636.5425.527
dignity5 530.4525.037
her18 1,909.6125.036
writers7 742.6324.596
speakers10 1,060.9024.535
airticles6 636.5424.468
responsibility5 530.4523.983
mairrage3 318.2723.876
boxroom3 318.2723.876
wuman's3 318.2723.876
occupations3 318.2723.876
resourcin3 318.2723.876
secont3 318.2723.876
naitural5 530.4523.498
beild5 530.4523.498
social10 1,060.9023.423
ither30 3,182.6923.142
role6 636.5422.907
me8 848.7222.845
means10 1,060.9022.610
this68 7,214.0922.218
grunds4 424.3621.922
domains3 318.2721.663
extinction3 318.2721.663
uise10 1,060.9021.627
status7 742.6321.588
sic18 1,909.6121.021
uised9 954.8120.949
general8 848.7220.782
aboot8 848.7220.129
howf3 318.2720.093
particular6 636.5419.681
faimily7 742.6319.508
public10 1,060.9019.380
respect5 530.4519.089
media8 848.7218.953
uiss7 742.6318.892
integral3 318.2718.868
teaching3 318.2718.868
siccar8 848.7218.832
folk15 1,591.3418.724
or58 6,153.1918.651
pairlament9 954.8118.574
smeddum5 530.4518.526
people10 1,060.9018.420
tongue9 954.8118.090
distinction4 424.3618.079
uisage3 318.2717.861
prejudice3 318.2717.861
tae264 28,007.6417.324
faut4 424.3617.268
presentation3 318.2717.007
proclaimed3 318.2717.007
action6 636.5416.904
professional5 530.4516.801
statements3 318.2716.265
develop4 424.3616.200
heritage5 530.4516.155
states5 530.4516.155
pairt14 1,485.2515.979
awready6 636.5415.835
chaumer5 530.4515.751
available5 530.4515.751
personality3 318.2715.609
pootch3 318.2715.609
somewey4 424.3615.567
political7 742.6315.558
members4 424.3615.271
member4 424.3615.271
security4 424.3615.271
fends2 212.1815.103
periodic2 212.1815.103
perceived2 212.1815.103
disposal2 212.1815.103
follae-up2 212.1815.103
reservit2 212.1815.103
intromission2 212.1815.103
donati2 212.1815.103
accordit2 212.1815.103
affirmit2 212.1815.103
apairtment2 212.1815.103
daily-day2 212.1815.103
gawpit2 212.1815.103
flegs2 212.1815.103
elementar2 212.1815.103
twined2 212.1815.103
branks2 212.1815.103
vennels2 212.1815.103
definin2 212.1815.103
maun15 1,591.3414.969
pouer5 530.4514.643
pey6 636.5414.491
cotton3 318.2714.490
entitled3 318.2714.490
discussion4 424.3614.449
up16 1,697.4314.396
document4 424.3614.195
rouble7 742.63nan
institutions3 318.2714.005
image5 530.4513.979
lane6 636.5413.886
their33 3,500.9513.679
sma7 742.6313.655
dreid3 318.2713.559
footer2 212.1813.395
tochered2 212.1813.395
plenishins2 212.1813.395
layoot2 212.1813.395
guddled2 212.1813.395
inherent2 212.1813.395
fent2 212.1813.395
whummelt2 212.1813.395
ootlined2 212.1813.395
warld-wide2 212.1813.395
miscawed2 212.1813.395
haurlie2 212.1813.395
day5 530.4513.328
identity5 530.4513.220
it75 7,956.7213.116
beer4 424.3613.049
an301 31,932.9513.039
yet10 1,060.9012.779
adverts3 318.2712.762
circumstances3 318.2712.762
needit5 530.4512.660
currently3 318.2712.404
active3 318.2712.404
intent3 318.2712.404
text5 530.4512.395
socially2 212.1812.228
dwams2 212.1812.228
drunks2 212.1812.228
person5 530.4512.138
thae13 1,379.1612.102
nerves3 318.2712.068
even18 1,909.6112.026
includin5 530.4512.013
situation5 530.4512.013
nor17 1,803.5211.926
thole6 636.5411.902
exact3 318.2711.751
vital3 318.2711.751
effective3 318.2711.751
countries3 318.2711.751
initiatives3 318.2711.751
yer5 530.4511.496
acts3 318.2711.453
sudden4 424.3611.377
yella4 424.3611.377
shuld2 212.1811.338
coherent2 212.1811.338
commonly2 212.1811.338
backgrun2 212.1811.338
ainly4 424.3611.215
includes3 318.2711.170
man's3 318.2711.170
conscience3 318.2711.170
national8 848.7211.124
had3 318.2710.939
purpose3 318.2710.901
jouk3 318.2710.901
fifty4 424.3610.752
state7 742.6310.703
street9 954.8110.702
gart5 530.4510.654
slavery2 212.1810.618
oral2 212.1810.618
peoples2 212.1810.618
practice4 424.3610.461
development4 424.3610.461
whan2 212.1810.380
we20 2,121.7910.375
himsel8 848.7210.330
forrit8 848.7210.276
place14 1,485.2510.268
back7 742.6310.193
speakin6 636.5410.143
hasna2 212.1810.014
peacefu2 212.1810.014
meisures2 212.1810.014
tassie2 212.1810.014
wemen2 212.1810.014
historically2 212.1810.014
minority6 636.549.993
regaird3 318.279.947
raskolnikov5 530.45nan
pairts5 530.459.964
authority3 318.279.947
justice3 318.279.947
get3 318.279.896
ay4 424.369.648
month6 636.549.563
amang8 848.729.551
jalouse4 424.369.536
ii3 318.279.529
switherin2 212.189.494
duly2 212.189.494
adoptit2 212.189.494
mairower2 212.189.494
his78 8,274.989.417
haud9 954.819.408
interest5 530.459.336
lanlady5 530.45nan
kirks2 212.189.494
presence3 318.279.333
group7 742.639.189
awfie3 318.279.143
than2 212.189.065
visitors2 212.189.039
trial2 212.189.039
linguists2 212.189.039
countrie5 530.45nan
territor5 530.45nan
sowels2 212.189.039
providin2 212.189.039
historic2 212.189.039
application2 212.189.039
niver10 1,060.908.989
weirin3 318.278.961
terrible3 318.278.961
furthset3 318.278.961
specific3 318.278.961
levels3 318.278.785
culture7 742.638.759
fundit2 212.188.634
religioun5 530.45nan
tenement2 212.188.634
toy2 212.188.634
spheres2 212.188.634
loan2 212.188.634
proper4 424.368.628
global3 318.278.615
policy5 530.458.527
trade3 318.278.451
books3 318.278.292
duties2 212.188.269
brigs2 212.188.269
analysis2 212.188.269
mcgugan2 212.188.269
free7 742.638.203
promote3 318.278.139
pen3 318.278.139
german4 424.368.124
doon9 954.818.030
expressed2 212.187.938
glent2 212.187.938
providit2 212.187.938
transmission4 424.36nan
waled2 212.187.938
freely2 212.187.938
maist16 1,697.437.872
recognition3 318.277.845
country5 530.457.808
government8 848.727.688
yinst2 212.187.635
grup2 212.187.635
skinklin2 212.187.635
penal4 424.36nan
bink2 212.187.635
there12 1,273.077.574
shairly3 318.277.569
forms3 318.277.569
groups4 424.367.485
hat3 318.277.437
literary3 318.277.437
lands2 212.187.356
needna2 212.187.356
precious2 212.187.356
thirteen2 212.187.356
whuther2 212.187.356
humanity2 212.187.356
clerk2 212.187.356
effecks2 212.187.356
wee9 954.817.343
curriculum3 318.277.308
declaration'4 424.36nan
personal3 318.277.183
withoot5 530.457.162
forby7 742.637.160
signs4 424.367.151
realise3 318.277.061
yon11 1,166.986.986
foond3 318.276.942
cross4 424.366.914
heat4 424.366.837
resources3 318.276.827
alane4 424.366.761
orra4 424.366.761
hoose2 212.186.743
is79 8,381.076.726
keech2 212.186.631
foregaun4 424.36nan
suddent2 212.186.631
then3 318.276.620
worth4 424.366.613
expression3 318.276.604
a236 25,037.136.539
passin3 318.276.497
gear4 424.366.469
siller7 742.636.403
native3 318.276.392
enjoy3 318.276.392
dictionary3 318.276.290
juist2 212.186.275
of5 530.456.222
definition2 212.186.222
translation2 212.186.222
sets3 318.276.190
seein5 530.456.152
sindry4 424.366.125
the602 63,865.906.091
been8 848.726.053
priority2 212.186.034
seek3 318.275.997
medium3 318.275.997
etc3 318.275.997
meet4 424.365.994
future4 424.365.994
year2 212.185.944
research3 318.275.904
ae7 742.635.889
business4 424.365.866
stevenson2 212.185.857
december2 212.185.857
bilingual2 212.185.857
study3 318.275.812
ower11 1,166.985.761
redd3 318.275.723
tool2 212.185.688
interests2 212.185.688
gaird2 212.185.688
standart2 212.185.688
guilty2 212.185.688
nicht2 212.185.630
unner6 636.545.619
ile2 212.185.528
prent2 212.185.528
keys2 212.185.528
common4 424.365.444
see7 742.635.397
kennin4 424.365.387
vizzied2 212.185.375
stepped2 212.185.375
association2 212.185.375
thon2 212.185.345
ain18 1,909.615.304
mindit3 318.275.303
regional3 318.275.303
lear2 212.185.230
pressure2 212.185.230
speech3 318.275.224
britain3 318.275.224
upmak3 318.27nan
kinds2 212.185.230
space4 424.365.110
recognise2 212.185.091
hecht2 212.185.091
awthegither2 212.185.091
respeck2 212.185.091
yours2 212.185.091
literature3 318.274.996
recognised2 212.184.958
regairdit2 212.184.958
mainteen3 318.27nan
unique2 212.184.958
meenit4 424.364.952
micht2 212.184.923
nane5 530.454.919
information3 318.274.851
warks3 318.274.851
landin2 212.184.830
stapped2 212.184.830
nummer5 530.454.721
hauds3 318.274.712
copecks3 318.27nan
trauchle2 212.184.830
importance2 212.184.707
spite2 212.184.707
skaith2 212.184.707
penalised3 318.27nan
kythe2 212.185.091
safegairdit3 318.27nan
man17 1,803.524.761
communication2 212.184.707
flair4 424.364.652
mak14 1,485.254.600
belief2 212.184.589
express2 212.184.589
else5 530.454.568
awa5 530.454.550
force3 318.274.511
kenspeckle3 318.274.511
lead3 318.274.511
naither2 212.184.476
ayewis2 212.184.476
colin2 212.184.476
stertit5 530.454.456
theirsels3 318.274.384
patient2 212.184.367
opportunities2 212.184.367
scottish11 1,166.984.296
hauden2 212.184.261
examples2 212.184.261
langer3 318.274.200
wide3 318.274.200
material2 212.184.160
fifteen2 212.184.160
exist2 212.184.160
meanin3 318.274.083
quate3 318.274.083
position2 212.184.061
student2 212.184.061
awlinguistic3 318.27nan
wes3 318.273.981
comatee3 318.273.969
isles2 212.183.967
positive2 212.183.967
historical2 212.183.967
coort2 212.183.967
bi5 530.453.952
by23 2,440.063.949
us6 636.543.918
steer2 212.183.875
accurate3 318.27nan
www2 212.183.875
are19 2,015.703.841
pairty4 424.363.823
at46 4,880.123.801
scunner2 212.183.786
braes2 212.183.786
office3 318.273.753
as73 7,744.543.713
mairch3 318.273.701
msp2 212.183.700
staundin2 212.183.700
influence2 212.183.700
education6 636.543.682
sir3 318.273.650
british3 318.273.600
efter5 530.453.573
lobby2 212.183.535
byordinar2 212.183.535
fou3 318.273.501
shame2 212.183.457
claes5 530.453.418
pass3 318.273.405
drama2 212.183.380
meetin2 212.183.380
sair6 636.543.357
some7 742.633.354
no16 1,697.433.308
cairryin2 212.183.306
decide2 212.183.306
alyona3 318.27nan
steel2 212.183.306
choice2 212.183.234
sowel2 212.183.234
ivaanovna3 318.27nan
cam3 318.273.291
pitten3 318.273.223
wid5 530.453.215
aye21 2,227.883.211
said7 742.633.206
livin3 318.273.135
reality2 212.183.096
thirty2 212.183.096
that77 8,168.892.991
new3 318.272.990
reason3 318.272.967
stour2 212.182.965
derk2 212.182.965
kist2 212.182.965
aften4 424.362.923
sang4 424.362.893
forderance2 212.18nan
oor8 848.722.884
learn3 318.272.848
howfs2 212.18nan
intil6 636.542.841
scotland's2 212.182.840
level2 212.182.840
mony9 954.812.836
dae6 636.542.820
wi85 9,017.612.794
aff7 742.632.786
confidence2 212.182.780
joy2 212.182.780
needs3 318.272.733
peace2 212.182.722
windaes2 212.182.722
taen7 742.632.700
square2 212.182.665
were6 636.542.642
lang5 530.452.628
order3 318.272.623
mindin2 212.182.609
register2 212.182.555
clattie2 212.18nan
race2 212.182.609
mair27 2,864.422.547
history4 424.362.525
body6 636.542.522
maitter3 318.272.517
thocht13 1,379.162.508
ring2 212.182.502
gien7 742.632.493
tent4 424.362.472
how7 742.632.460
generations2 212.182.450
shape2 212.182.400
least4 424.362.394
same9 954.812.391
didna10 1,060.902.378
door3 318.272.364
will4 424.362.343
present2 212.182.302
plain2 212.182.302
thrang2 212.182.302
process2 212.182.302
spoken3 318.272.285
act3 318.272.285
loss2 212.182.255
bell2 212.182.255
students2 212.182.255
bein3 318.272.250
hert5 530.452.221
naething3 318.272.192
him18 1,909.612.191
auld18 1,909.612.189
if9 954.812.163
cried5 530.452.124
needin2 212.182.119
hame4 424.362.102
maks4 424.362.082
whether2 212.182.076
sib2 212.182.076
she's2 212.182.076
thinkin4 424.362.038
tackie2 212.18nan
twa8 848.722.022
watch3 318.272.017
staunin3 318.272.017
faur5 530.452.011
oot31 3,288.782.003
haen2 212.181.992
ach2 212.181.992
land3 318.271.989
them10 1,060.901.964
gettin5 530.451.957
every4 424.361.950
-4 424.361.950
lit2 212.181.911
growin2 212.181.911
wis82 8,699.341.838
words5 530.451.834
fit6 636.541.812
gane2 212.181.797
accoont2 212.181.797
steps2 212.181.797
agin4 424.361.785
thir7 742.631.730
wark6 636.541.700
address2 212.181.689
edinburgh3 318.271.678
system2 212.181.655
uggin2 212.18nan
need7 742.631.642
hendry2 212.18nan
lassies2 212.181.655
tho7 742.631.642
past5 530.451.637
haudin3 318.271.631
fitnotes2 212.18nan
whilk2 212.181.621
key2 212.181.555
win7 742.631.487
ocht2 212.181.461
darg2 212.181.461
cawed2 212.181.431
guideen2 212.18nan
muckle5 530.451.414
pittin3 318.271.410
richt-haun2 212.18nan
sax2 212.181.402
neb2 212.181.373
that's5 530.451.372
cast2 212.181.344
gied2 212.181.342
case3 318.271.329
ettle2 212.181.262
kent4 424.361.255
wisna6 636.541.232
heid6 636.541.231
english10 1,060.901.224
since3 318.271.214
poetry2 212.181.210
lass2 212.181.210
i'm3 318.271.177
stert3 318.271.177
may2 212.181.159
fowrth2 212.18nan
hinnert2 212.18nan
ettlin2 212.181.135
chynge2 212.181.135
maunbe2 212.18nan
set2 212.181.683
strangbox2 212.18nan
last8 848.721.696
roon3 318.271.125
thing8 848.721.113
daft2 212.181.111
bodies2 212.181.111
biggit2 212.181.087
sort2 212.181.064
ben4 424.361.033
won2 212.181.019
room2 212.181.017
intae12 1,273.071.016
got6 636.541.001
example2 212.180.997
colour2 212.180.997
years2 212.180.992
road6 636.540.991
footers2 212.18nan
speak4 424.361.059
leid4 424.360.987
duin3 318.270.945
press2 212.180.933
ane12 1,273.070.930
whit16 1,697.430.919
again5 530.450.898
baith8 848.720.893
europe2 212.180.853
centre2 212.180.834
readin2 212.180.815
maum2 212.18nan
writin3 318.270.815
warld5 530.450.814
cauld2 212.180.799
cry2 212.180.797
ken9 954.810.793
heard2 212.180.775
far3 318.270.762
shair2 212.180.761
on52 5,516.660.727
gets2 212.180.710
kitchen2 212.180.693
misdoot2 212.18nan
fair6 636.540.703
corner2 212.180.693
born2 212.180.677
uk2 212.180.677
european2 212.180.661
afore14 1,485.250.659
ablow2 212.180.645
een9 954.810.631
better3 318.270.628
wantin2 212.180.615
hingin2 212.180.585
men2 212.180.578
hae26 2,758.330.565
braid2 212.180.557
likely2 212.180.530
thochts2 212.180.503
dark2 212.180.490
easy2 212.180.490
wull2 212.180.471
onyither2 212.18nan
kind3 318.270.510
keep3 318.270.464
comin2 212.180.452
help2 212.180.445
alang3 318.270.442
our2 212.180.441
referent2 212.18nan
dinna3 318.270.437
strang2 212.180.407
orally2 212.18nan
weel9 954.810.420
wulsome2 212.18nan
roond3 318.270.408
while3 318.270.390
whiles3 318.270.383
it's9 954.810.375
doun5 530.450.370
love2 212.180.353
syne5 530.450.322
canna4 424.360.302
stood2 212.180.296
understaunding2 212.18nan
care2 212.180.530
compluterance2 212.18nan
atween3 318.270.515
really2 212.180.282
makkin3 318.270.279
report2 212.180.278
whaur8 848.720.265
first7 742.630.256
next3 318.270.252
haein3 318.270.252
deep2 212.180.237
hoo2 212.180.226
leave2 212.180.221
beilding2 212.18nan
toun2 212.180.221
thegither4 424.360.220
gaelic6 636.540.213
eh2 212.180.203
but34 3,607.040.202
manifestations2 212.18nan
here9 954.810.203
reduction2 212.18nan
could9 954.810.219
met2 212.180.193
left4 424.360.192
somethin3 318.270.188
real2 212.180.186
ilk2 212.180.179
bide2 212.180.171
disna2 212.180.166
sun2 212.180.163
done2 212.180.160
sister2 212.180.160
tap2 212.180.158
taks2 212.180.148
feart2 212.180.142
let3 318.270.141
gie8 848.720.138
high2 212.180.136
late2 212.180.136
disnae2 212.180.136
noo14 1,485.250.125
am2 212.180.119
brocht3 318.270.118
gin8 848.720.107
come7 742.630.105
end4 424.360.103
gey4 424.360.094
opinioun2 212.18nan
support2 212.180.110
sae16 1,697.430.092
like26 2,758.330.089
bairns4 424.360.087
gaun5 530.450.086
ee3 318.270.078
why2 212.180.075
life5 530.450.073
made6 636.540.071
seen7 742.630.070
gaed5 530.450.056
pit9 954.810.055
onsets2 212.18nan
nae28 2,970.510.051
jist15 1,591.340.048
wad11 1,166.980.045
turn2 212.180.042
wulsomely2 212.18nan
yae2 212.180.041
word2 212.180.041
mean3 318.270.038
sat2 212.180.038
hair2 212.180.033
commonties'2 212.18nan
ilka3 318.270.033
loses2 212.18nan
different3 318.270.075
great3 318.270.028
gies2 212.180.026
open2 212.180.025
few2 212.180.020
hauf2 212.180.020
fower2 212.180.020
deid2 212.180.018
itsel2 212.180.018
near4 424.360.018
tak8 848.720.018
fell2 212.180.017
staund-alane2 212.18nan
anither6 636.540.017
look4 424.360.014
took3 318.270.012
scott2 212.180.012
table2 212.180.009
masel3 318.270.007
governance2 212.18nan
five2 212.180.006
can14 1,485.250.005
time17 1,803.520.005
wey9 954.810.004
he82 8,699.340.004
guid10 1,060.900.004
looked2 212.180.003
mind7 742.630.003
aince2 212.180.002
less2 212.180.002
yin7 742.630.001
territours2 212.18nan
ithers2 212.180.001
times3 318.270.000
still7 742.630.000
black2 212.180.000
mebbe2 212.180.000