A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Donati, Colin

Basic Stats

Total words by this author in corpus - 9,426
Total unique words used by this author in corpus - 2,390
Ratio of total words to unique words - 3.944
Tagged as LAL (General Central) dialect.
Top ten most common words - the, o, an, tae, a, in, scots, and, be, for,

List of texts in corpus

Universal Declaration o Human Richts
Scots Language Centre (01-01-2004) in Central dialect (LAL), categorised as government (1,827 words)

Scots A Statement o Principles
Scots Parliament (2003-03 ) in Central dialect (LAL), categorised as government (4,241 words)
The Student and the Pawnwife
Itchy Coo (2003-12-08 ) in Central dialect (LAL), categorised as prose (3,358 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
airticle45 4,774.03295.011
language77 8,168.89205.101
richts35 3,713.13205.096
awbody33 3,500.95192.939
hes35 3,713.13168.020
declaration25 2,652.24162.854
principles24 2,546.15151.430
scots123 13,049.01144.292
shuid30 3,182.69123.996
universal18 1,909.61121.472
linguistic27 2,864.42118.843
o347 36,813.0799.228
freedoms10 1,060.9087.562
haill19 2,015.7087.135
its54 5,728.8483.490
commonties11 1,166.9881.705
equal14 1,485.2579.793
entitelt9 954.8178.337
ful12 1,273.0770.214
wuman11 1,166.9867.461
international14 1,485.2567.078
commonty10 1,060.9066.700
aw71 7,532.3665.981
she5 530.4563.785
forsameikle7 742.6359.959
richt46 4,880.1255.851
uphaud14 1,485.2555.520
be105 11,139.4051.780
trades6 636.5450.821
freedom11 1,166.9849.854
statement10 1,060.9049.800
nations9 954.8147.211
discrimination7 742.6346.688
ma6 636.5444.405
human15 1,591.3443.899
and120 12,730.7442.887
learning7 742.6342.111
signpostin5 530.4541.732
kenning5 530.4541.732
languages16 1,697.4341.168
has23 2,440.0640.956
cultural14 1,485.2540.647
loun8 848.7237.761
shal26 2,758.33nan
hud22 2,333.9740.292
multilingual7 742.6337.302
scotland35 3,713.1337.134
communications6 636.5436.517
they7 742.6335.889
society10 1,060.9035.731
awthing7 742.6333.788
ony28 2,970.5133.647
law11 1,166.9833.000
knowledge7 742.6332.800
nationality4 424.3632.710
chairter8 848.7232.520
names11 1,166.9832.459
purposes6 636.5432.363
til25 2,652.2432.273
for100 10,608.9532.235
ye16 1,697.4332.090
stair9 954.8131.953
necessar8 848.7231.859
throu20 2,121.7931.415
scowth6 636.5431.052
terms9 954.8130.972
the-day5 530.4530.946
ability6 636.5430.449
conter6 636.5430.449
naebody14 1,485.2530.343
in218 23,127.5230.267
ongaun7 742.6330.223
barcelona4 424.3630.094
assemlie4 424.3630.094
continuity4 424.3630.094
documentation4 424.3630.094
essential6 636.5429.878
young17 1,803.5228.887
whitiver6 636.5428.815
valued4 424.3628.189
haverin4 424.3628.189
appendix4 424.3628.189
unitit6 636.5427.845
frae45 4,774.0327.792
fundamental5 530.4527.444
gar8 848.7226.760
tongues6 636.5426.534
i22 2,333.9726.159
bit4 424.3626.057
her18 1,909.6125.699
diversity7 742.6325.427
educational6 636.5425.362
dignity5 530.4524.898
airticles6 636.5424.646
speakers10 1,060.9024.412
writers7 742.6324.409
responsibility5 530.4523.845
occupations3 318.2723.791
wuman's3 318.2723.791
secont3 318.2723.791
mairrage3 318.2723.791
boxroom3 318.2723.791
resourcin3 318.2723.791
role6 636.5423.653
ither30 3,182.6923.396
beild5 530.4523.360
naitural5 530.4523.360
social10 1,060.9023.173
me8 848.7222.699
this68 7,214.0922.490
means10 1,060.9022.475
grunds4 424.3621.810
domains3 318.2721.577
extinction3 318.2721.577
status7 742.6321.404
uise10 1,060.9021.381
general8 848.7221.007
uised9 954.8120.962
sic18 1,909.6120.936
aboot8 848.7220.129
howf3 318.2720.008
particular6 636.5419.737
faimily7 742.6319.649
public10 1,060.9019.140
or58 6,153.1918.956
respect5 530.4518.954
folk15 1,591.3418.931
teaching3 318.2718.783
integral3 318.2718.783
media8 848.7218.753
uiss7 742.6318.712
siccar8 848.7218.633
tongue9 954.8118.554
smeddum5 530.4518.391
pairlament9 954.8118.356
people10 1,060.9018.267
distinction4 424.3617.968
uisage3 318.2717.777
prejudice3 318.2717.777
tae264 28,007.6417.461
faut4 424.3617.159
proclaimed3 318.2716.923
presentation3 318.2716.923
members4 424.3616.785
action6 636.5416.749
professional5 530.4516.668
statements3 318.2716.181
develop4 424.3616.091
states5 530.4516.023
heritage5 530.4516.023
awready6 636.5415.975
pairt14 1,485.2515.877
chaumer5 530.4515.619
available5 530.4515.619
political7 742.6315.607
personality3 318.2715.525
pootch3 318.2715.525
somewey4 424.3615.459
security4 424.3615.162
member4 424.3615.162
branks2 212.1815.046
affirmit2 212.1815.046
daily-day2 212.1815.046
elementar2 212.1815.046
flegs2 212.1815.046
periodic2 212.1815.046
follae-up2 212.1815.046
disposal2 212.1815.046
donati2 212.1815.046
intromission2 212.1815.046
reservit2 212.1815.046
fends2 212.1815.046
apairtment2 212.1815.046
gawpit2 212.1815.046
accordit2 212.1815.046
perceived2 212.1815.046
twined2 212.1815.046
definin2 212.1815.046
vennels2 212.1815.046
maun15 1,591.3414.834
pouer5 530.4514.513
pey6 636.5414.466
cotton3 318.2714.407
entitled3 318.2714.407
up16 1,697.4314.346
discussion4 424.3614.341
document4 424.3614.088
institutions3 318.2713.922
image5 530.4513.850
an301 31,932.9513.622
rouble7 742.63nan
lane6 636.5413.854
their33 3,500.9513.533
sma7 742.6313.487
dreid3 318.2713.476
day5 530.4513.445
miscawed2 212.1813.338
fent2 212.1813.338
plenishins2 212.1813.338
layoot2 212.1813.338
guddled2 212.1813.338
haurlie2 212.1813.338
warld-wide2 212.1813.338
ootlined2 212.1813.338
inherent2 212.1813.338
footer2 212.1813.338
tochered2 212.1813.338
whummelt2 212.1813.338
it75 7,956.7213.123
identity5 530.4513.092
beer4 424.3612.943
adverts3 318.2712.680
circumstances3 318.2712.680
needit5 530.4512.670
yet10 1,060.9012.618
currently3 318.2712.322
active3 318.2712.322
intent3 318.2712.322
text5 530.4512.269
socially2 212.1812.171
dwams2 212.1812.171
drunks2 212.1812.171
ainly4 424.3612.149
includin5 530.4512.140
nor17 1,803.5212.096
person5 530.4512.013
nerves3 318.2711.986
effective3 318.2711.986
thae13 1,379.1611.924
even18 1,909.6111.907
situation5 530.4511.888
thole6 636.5411.851
vital3 318.2711.670
countries3 318.2711.670
exact3 318.2711.670
initiatives3 318.2711.670
sudden4 424.3611.439
conscience3 318.2711.372
includes3 318.2711.372
acts3 318.2711.372
backgrun2 212.1811.281
shuld2 212.1811.281
commonly2 212.1811.281
coherent2 212.1811.281
yella4 424.3611.274
yer5 530.4511.191
man's3 318.2711.089
national8 848.7211.007
street9 954.8110.864
jouk3 318.2710.821
purpose3 318.2710.821
state7 742.6310.814
fifty4 424.3610.800
gart5 530.4510.636
place14 1,485.2510.605
oral2 212.1810.562
wemen2 212.1810.562
slavery2 212.1810.562
peoples2 212.1810.562
had3 318.2710.532
practice4 424.3610.502
back7 742.6310.453
whan2 212.1810.392
development4 424.3610.359
forrit8 848.7210.267
himsel8 848.7210.158
speakin6 636.5410.079
peacefu2 212.189.958
hasna2 212.189.958
tassie2 212.189.958
meisures2 212.189.958
historically2 212.189.958
get3 318.279.943
we20 2,121.799.897
justice3 318.279.868
authority3 318.279.868
regaird3 318.279.868
minority6 636.549.855
pairts5 530.459.844
ay4 424.369.828
amang8 848.729.684
interest5 530.459.568
ii3 318.279.450
duly2 212.189.439
raskolnikov5 530.45nan
kirks2 212.189.439
mairower2 212.189.439
adoptit2 212.189.439
lanlady5 530.45nan
switherin2 212.189.439
jalouse4 424.369.436
month6 636.549.426
his78 8,274.989.389
haud9 954.819.269
presence3 318.279.254
niver10 1,060.909.171
than2 212.189.130
group7 742.639.093
awfie3 318.279.065
trial2 212.188.984
historic2 212.188.984
countrie5 530.45nan
sowels2 212.188.984
application2 212.188.984
linguists2 212.188.984
visitors2 212.188.984
providin2 212.188.984
weirin3 318.278.883
specific3 318.278.883
territor5 530.45nan
religioun5 530.45nan
terrible3 318.278.883
furthset3 318.278.883
levels3 318.278.707
culture7 742.638.661
fundit2 212.188.579
yinst2 212.188.579
tenement2 212.188.579
toy2 212.188.579
loan2 212.188.579
spheres2 212.188.579
global3 318.278.538
proper4 424.368.530
policy5 530.458.411
trade3 318.278.374
books3 318.278.215
expressed2 212.188.215
brigs2 212.188.215
duties2 212.188.215
mcgugan2 212.188.215
whuther2 212.188.215
analysis2 212.188.215
free7 742.638.105
promote3 318.278.062
pen3 318.278.062
german4 424.368.027
doon9 954.817.976
transmission4 424.36nan
country5 530.457.971
providit2 212.187.884
glent2 212.187.884
freely2 212.187.884
waled2 212.187.884
maist16 1,697.437.881
recognition3 318.277.769
precious2 212.187.581
skinklin2 212.187.581
grup2 212.187.581
lands2 212.187.581
penal4 424.36nan
bink2 212.187.581
there12 1,273.077.544
government8 848.727.530
shairly3 318.277.493
forms3 318.277.493
literary3 318.277.493
groups4 424.367.390
foond3 318.277.361
hat3 318.277.361
needna2 212.187.302
effecks2 212.187.302
clerk2 212.187.302
humanity2 212.187.302
declaration'4 424.36nan
thirteen2 212.187.302
curriculum3 318.277.232
wee9 954.817.211
withoot5 530.457.174
personal3 318.277.108
forby7 742.637.062
signs4 424.367.057
alane4 424.367.057
realise3 318.276.986
cross4 424.366.898
yon11 1,166.986.823
heat4 424.366.821
resources3 318.276.752
hoose2 212.186.745
is79 8,381.076.699
orra4 424.366.668
worth4 424.366.594
foregaun4 424.36nan
suddent2 212.186.578
keech2 212.186.578
expression3 318.276.530
native3 318.276.530
passin3 318.276.530
gear4 424.366.521
enjoy3 318.276.423
then3 318.276.417
juist2 212.186.364
of5 530.456.349
siller7 742.636.267
seein5 530.456.254
been8 848.726.221
sets3 318.276.217
dictionary3 318.276.217
translation2 212.186.169
definition2 212.186.169
a236 25,037.136.129
ae7 742.636.107
meet4 424.366.101
future4 424.366.035
sindry4 424.366.035
year2 212.186.015
priority2 212.185.982
december2 212.185.982
bilingual2 212.185.982
etc3 318.275.925
medium3 318.275.925
study3 318.275.925
seek3 318.275.925
ower11 1,166.985.878
business4 424.365.840
research3 318.275.832
redd3 318.275.832
interests2 212.185.805
stevenson2 212.185.805
unner6 636.545.696
tool2 212.185.636
gaird2 212.185.636
standart2 212.185.636
guilty2 212.185.636
nicht2 212.185.550
ile2 212.185.477
keys2 212.185.477
prent2 212.185.477
common4 424.365.473
see7 742.635.407
yours2 212.185.324
vizzied2 212.185.324
stepped2 212.185.324
association2 212.185.324
hecht2 212.185.324
kennin4 424.365.299
ain18 1,909.615.291
mindit3 318.275.233
regional3 318.275.233
kinds2 212.185.179
upmak3 318.27nan
recognised2 212.185.179
kythe2 212.185.179
lear2 212.185.179
stapped2 212.185.179
pressure2 212.185.179
britain3 318.275.154
speech3 318.275.154
the602 63,865.905.108
thon2 212.185.053
awthegither2 212.185.040
recognise2 212.185.040
respeck2 212.185.040
mainteen3 318.27nan
space4 424.365.024
copecks3 318.27nan
micht2 212.184.976
nane5 530.454.942
unique2 212.184.907
regairdit2 212.184.907
landin2 212.184.907
man17 1,803.524.786
penalised3 318.27nan
meenit4 424.364.867
safegairdit3 318.27nan
literature3 318.274.927
information3 318.274.782
warks3 318.274.782
trauchle2 212.184.780
else5 530.454.700
nummer5 530.454.661
importance2 212.184.657
skaith2 212.184.657
communication2 212.184.657
spite2 212.184.657
hauds3 318.274.643
mak14 1,485.254.617
awa5 530.454.583
flair4 424.364.568
express2 212.184.540
belief2 212.184.540
colin2 212.184.540
force3 318.274.509
lead3 318.274.509
kenspeckle3 318.274.444
naither2 212.184.426
ayewis2 212.184.426
stertit5 530.454.397
material2 212.184.317
patient2 212.184.317
opportunities2 212.184.317
theirsels3 318.274.317
hauden2 212.184.212
examples2 212.184.212
langer3 318.274.194
wide3 318.274.194
scottish11 1,166.984.138
exist2 212.184.111
fifteen2 212.184.111
by23 2,440.064.076
meanin3 318.274.017
quate3 318.274.017
bi5 530.454.014
student2 212.184.013
position2 212.184.013
awlinguistic3 318.27nan
isles2 212.184.013
historical2 212.183.918
positive2 212.183.918
coort2 212.183.918
at46 4,880.123.914
comatee3 318.273.904
www2 212.183.827
steer2 212.183.827
accurate3 318.27nan
wes3 318.273.821
mairch3 318.273.795
are19 2,015.703.766
pairty4 424.363.745
braes2 212.183.738
scunner2 212.183.738
as73 7,744.543.696
office3 318.273.689
us6 636.543.677
byordinar2 212.183.652
msp2 212.183.652
meetin2 212.183.652
influence2 212.183.652
staundin2 212.183.652
education6 636.543.606
sir3 318.273.586
british3 318.273.536
efter5 530.453.515
lobby2 212.183.488
shame2 212.183.488
fou3 318.273.438
no16 1,697.433.428
some7 742.633.419
claes5 530.453.416
cam3 318.273.385
wid5 530.453.365
pass3 318.273.343
steel2 212.183.334
alyona3 318.27nan
drama2 212.183.334
cairryin2 212.183.260
decide2 212.183.260
sair6 636.543.259
sowel2 212.183.188
choice2 212.183.188
ivaanovna3 318.27nan
said7 742.633.244
aye21 2,227.883.181
pitten3 318.273.161
reality2 212.183.118
livin3 318.273.074
new3 318.273.066
thirty2 212.183.050
kist2 212.182.984
that77 8,168.892.928
derk2 212.182.920
stour2 212.182.920
sang4 424.362.912
reason3 318.272.908
aften4 424.362.881
forderance2 212.18nan
learn3 318.272.867
dae6 636.542.851
joy2 212.182.796
confidence2 212.182.796
scotland's2 212.182.796
level2 212.182.796
aff7 742.632.792
needs3 318.272.789
mony9 954.812.779
intil6 636.542.749
peace2 212.182.736
wi85 9,017.612.712
history4 424.362.704
windaes2 212.182.678
lang5 530.452.656
taen7 742.632.654
oor8 848.722.653
howfs2 212.18nan
square2 212.182.678
order3 318.272.638
clattie2 212.18nan
mindin2 212.182.621
mair27 2,864.422.600
were6 636.542.597
race2 212.182.566
register2 212.182.512
thocht13 1,379.162.509
body6 636.542.471
gien7 742.632.463
maitter3 318.272.460
ring2 212.182.459
generations2 212.182.459
shape2 212.182.459
door3 318.272.457
tent4 424.362.456
loss2 212.182.408
will4 424.362.403
how7 742.632.364
same9 954.812.334
least4 424.362.327
thrang2 212.182.308
present2 212.182.308
act3 318.272.262
didna10 1,060.902.261
process2 212.182.260
plain2 212.182.260
bein3 318.272.233
him18 1,909.612.232
spoken3 318.272.230
students2 212.182.213
bell2 212.182.213
whether2 212.182.167
hert5 530.452.146
wis82 8,699.342.141
naething3 318.272.138
if9 954.812.135
needin2 212.182.122
auld18 1,909.612.086
watch3 318.272.079
cried5 530.452.069
hame4 424.362.052
she's2 212.182.035
tackie2 212.18nan
maks4 424.362.041
sib2 212.182.035
oot31 3,288.782.027
thinkin4 424.362.019
haen2 212.181.993
ach2 212.181.993
staunin3 318.271.964
twa8 848.721.958
lit2 212.181.952
faur5 530.451.939
gettin5 530.451.939
land3 318.271.936
words5 530.451.903
every4 424.361.888
-4 424.361.888
growin2 212.181.872
gane2 212.181.872
fit6 636.541.861
them10 1,060.901.850
agin4 424.361.846
accoont2 212.181.758
steps2 212.181.758
ocht2 212.181.722
set2 212.181.693
last8 848.721.687
lassies2 212.181.686
wark6 636.541.680
system2 212.181.651
past5 530.451.633
hendry2 212.18nan
thir7 742.631.672
edinburgh3 318.271.629
strangbox2 212.18nan
tho7 742.631.621
need7 742.631.584
whilk2 212.181.584
haudin3 318.271.582
key2 212.181.518
muckle5 530.451.492
darg2 212.181.456
cawed2 212.181.456
maunbe2 212.18nan
uggin2 212.18nan
guideen2 212.18nan
fitnotes2 212.18nan
address2 212.181.651
win7 742.631.408
pittin3 318.271.405
sax2 212.181.396
neb2 212.181.396
case3 318.271.384
gied2 212.181.375
cast2 212.181.366
that's5 530.451.310
kent4 424.361.303
thing8 848.721.302
since3 318.271.245
richt-haun2 212.18nan
ettle2 212.181.228
heid6 636.541.216
lass2 212.181.202
wisna6 636.541.166
english10 1,060.901.152
fowrth2 212.18nan
poetry2 212.181.176
hinnert2 212.18nan
may2 212.181.176
stert3 318.271.134
i'm3 318.271.134
ettlin2 212.181.126
chynge2 212.181.102
daft2 212.181.102
biggit2 212.181.078
bodies2 212.181.078
roon3 318.271.074
sort2 212.181.055
intae12 1,273.071.053
leid4 424.361.049
speak4 424.361.037
footers2 212.18nan
room2 212.181.034
road6 636.541.008
whit16 1,697.431.000
won2 212.180.987
colour2 212.180.987
ben4 424.360.984
years2 212.180.975
example2 212.180.965
again5 530.450.921
duin3 318.270.921
got6 636.540.917
press2 212.180.902
baith8 848.720.886
ane12 1,273.070.874
cauld2 212.180.836
shair2 212.180.824
centre2 212.180.824
europe2 212.180.824
ken9 954.810.816
maum2 212.18nan
writin3 318.270.805
heard2 212.180.804
warld5 530.450.803
readin2 212.180.786
cry2 212.180.786
far3 318.270.766
kitchen2 212.180.716
corner2 212.180.716
gets2 212.180.699
on52 5,516.660.686
born2 212.180.682
fair6 636.540.682
afore14 1,485.250.660
uk2 212.180.650
hae26 2,758.330.648
european2 212.180.634
better3 318.270.623
ablow2 212.180.619
een9 954.810.598
braid2 212.180.560
misdoot2 212.18nan
wantin2 212.180.589
hingin2 212.180.560
men2 212.180.552
likely2 212.180.518
onyither2 212.18nan
atween3 318.270.515
thochts2 212.180.505
care2 212.180.505
dark2 212.180.492
keep3 318.270.486
kind3 318.270.480
easy2 212.180.479
dinna3 318.270.469
comin2 212.180.464
alang3 318.270.436
wulsome2 212.18nan
help2 212.180.458
compluterance2 212.18nan
orally2 212.18nan
referent2 212.18nan
our2 212.180.454
strang2 212.180.430
whiles3 318.270.398
while3 318.270.363
love2 212.180.353
syne5 530.450.346
wull2 212.180.341
weel9 954.810.338
doun5 530.450.335
it's9 954.810.328
stood2 212.180.295
really2 212.180.291
first7 742.630.289
makkin3 318.270.282
whaur8 848.720.282
canna4 424.360.274
report2 212.180.268
haein3 318.270.249
deep2 212.180.243
reduction2 212.18nan
next3 318.270.243
thegither4 424.360.220
eh2 212.180.219
toun2 212.180.205
met2 212.180.205
leave2 212.180.205
but34 3,607.040.200
here9 954.810.200
could9 954.810.188
manifestations2 212.18nan
beilding2 212.18nan
left4 424.360.198
hoo2 212.180.186
bide2 212.180.186
gaelic6 636.540.183
understaunding2 212.18nan
roond3 318.270.389
somethin3 318.270.179
real2 212.180.177
ilk2 212.180.170
sun2 212.180.164
done2 212.180.151
disna2 212.180.151
let3 318.270.148
taks2 212.180.146
feart2 212.180.146
sister2 212.180.146
gie8 848.720.145
disnae2 212.180.140
late2 212.180.140
tap2 212.180.139
high2 212.180.134
brocht3 318.270.131
come7 742.630.119
opinioun2 212.18nan
gey4 424.360.114
bairns4 424.360.106
am2 212.180.104
support2 212.180.103
gaun5 530.450.102
noo14 1,485.250.093
end4 424.360.093
sae16 1,697.430.092
different3 318.270.078
life5 530.450.073
gaed5 530.450.073
why2 212.180.072
like26 2,758.330.070
onsets2 212.18nan
gin8 848.720.093
made6 636.540.068
ee3 318.270.066
jist15 1,591.340.060
pit9 954.810.057
loses2 212.18nan
seen7 742.630.057
wulsomely2 212.18nan
nae28 2,970.510.048
turn2 212.180.044
sat2 212.180.042
yin7 742.630.039
hair2 212.180.037
ilka3 318.270.037
commonties'2 212.18nan
word2 212.180.036
yae2 212.180.034
open2 212.180.033
near4 424.360.031
wad11 1,166.980.030
mean3 318.270.029
great3 318.270.027
fell2 212.180.023
hauf2 212.180.023
gies2 212.180.020
few2 212.180.019
deid2 212.180.019
itsel2 212.180.016
staund-alane2 212.18nan
anither6 636.540.017
tak8 848.720.014
table2 212.180.012
guid10 1,060.900.010
scott2 212.180.010
look4 424.360.010
fower2 212.180.009
can14 1,485.250.008
looked2 212.180.006
masel3 318.270.006
he82 8,699.340.005
governance2 212.18nan
took3 318.270.007
ithers2 212.180.005
five2 212.180.004
time17 1,803.520.004
wey9 954.810.002
less2 212.180.002
times3 318.270.002
territours2 212.18nan
mind7 742.630.002
mebbe2 212.180.001
aince2 212.180.001
still7 742.630.001
black2 212.180.000