A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Donati, Colin

Basic Stats

Total words by this author in corpus - 9,426
Total unique words used by this author in corpus - 2,390
Ratio of total words to unique words - 3.944
Tagged as LAL (General Central) dialect.
Top ten most common words - the, o, an, tae, a, in, scots, and, be, for,

List of texts in corpus

Universal Declaration o Human Richts
Scots Language Centre (01-01-2004) in Central dialect (LAL), categorised as government (1,827 words)

Scots A Statement o Principles
Scots Parliament (2003-03 ) in Central dialect (LAL), categorised as government (4,241 words)
The Student and the Pawnwife
Itchy Coo (2003-12-08 ) in Central dialect (LAL), categorised as prose (3,358 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
airticle45 4,774.03295.054
richts35 3,713.13205.129
language77 8,168.89205.020
awbody33 3,500.95189.535
hes35 3,713.13168.053
declaration25 2,652.24162.878
principles24 2,546.15152.596
scots123 13,049.01146.369
shuid30 3,182.69122.637
universal18 1,909.61121.489
linguistic27 2,864.42118.868
o347 36,813.0799.944
haill19 2,015.7089.002
freedoms10 1,060.9087.571
its54 5,728.8484.630
commonties11 1,166.9881.716
equal14 1,485.2579.806
entitelt9 954.8178.345
ful12 1,273.0770.226
international14 1,485.2568.118
wuman11 1,166.9867.472
commonty10 1,060.9066.710
aw71 7,532.3665.143
she5 530.4564.768
forsameikle7 742.6359.966
richt46 4,880.1256.193
uphaud14 1,485.2555.533
be105 11,139.4051.902
freedom11 1,166.9851.221
trades6 636.5450.827
statement10 1,060.9049.809
ma6 636.5447.691
nations9 954.8147.219
discrimination7 742.6346.695
human15 1,591.3444.443
communications6 636.5443.163
learning7 742.6342.118
and120 12,730.7441.998
kenning5 530.4541.736
languages16 1,697.4341.606
has23 2,440.0640.975
cultural14 1,485.2540.832
hud22 2,333.9738.341
multilingual7 742.6337.309
scotland35 3,713.1337.158
society10 1,060.9036.260
shal26 2,758.33nan
loun8 848.7237.769
they7 742.6335.567
ony28 2,970.5135.391
awthing7 742.6333.795
knowledge7 742.6333.292
for100 10,608.9533.076
law11 1,166.9833.010
documentation4 424.3632.714
nationality4 424.3632.714
chairter8 848.7232.528
ye16 1,697.4332.527
til25 2,652.2432.402
purposes6 636.5432.369
names11 1,166.9832.292
stair9 954.8131.961
necessar8 848.7231.866
terms9 954.8131.710
throu20 2,121.7931.640
scowth6 636.5431.057
ability6 636.5431.057
the-day5 530.4530.951
naebody14 1,485.2530.462
conter6 636.5430.455
in218 23,127.5230.337
ongaun7 742.6330.229
barcelona4 424.3630.098
assemlie4 424.3630.098
continuity4 424.3630.098
whitiver6 636.5429.339
essential6 636.5429.339
young17 1,803.5228.901
frae45 4,774.0328.353
appendix4 424.3628.193
haverin4 424.3628.193
unitit6 636.5427.851
fundamental5 530.4527.449
i22 2,333.9727.049
gar8 848.7226.994
valued4 424.3626.681
tongues6 636.5426.539
bit4 424.3626.007
diversity7 742.6325.975
educational6 636.5425.368
her18 1,909.6125.156
airticles6 636.5424.651
speakers10 1,060.9024.550
writers7 742.6324.416
responsibility5 530.4524.361
me8 848.7224.279
dignity5 530.4523.849
ither30 3,182.6923.811
boxroom3 318.2723.794
secont3 318.2723.794
occupations3 318.2723.794
wuman's3 318.2723.794
mairrage3 318.2723.794
role6 636.5423.659
social10 1,060.9023.543
naitural5 530.4523.365
beild5 530.4523.365
uise10 1,060.9022.598
means10 1,060.9022.484
this68 7,214.0922.409
grunds4 424.3621.814
extinction3 318.2721.580
domains3 318.2721.580
status7 742.6321.410
uised9 954.8121.212
sic18 1,909.6121.047
general8 848.7220.869
howf3 318.2720.011
aboot8 848.7219.864
particular6 636.5419.743
or58 6,153.1919.627
public10 1,060.9019.513
faimily7 742.6319.177
siccar8 848.7219.129
uiss7 742.6319.022
respect5 530.4518.958
integral3 318.2718.786
teaching3 318.2718.786
folk15 1,591.3418.782
media8 848.7218.639
pairlament9 954.8118.562
members4 424.3618.413
smeddum5 530.4518.396
tongue9 954.8118.363
people10 1,060.9018.107
distinction4 424.3617.972
tae264 28,007.6417.889
uisage3 318.2717.780
prejudice3 318.2717.780
faut4 424.3617.162
statements3 318.2716.925
presentation3 318.2716.925
proclaimed3 318.2716.925
action6 636.5416.754
professional5 530.4516.672
develop4 424.3616.095
states5 530.4516.028
heritage5 530.4516.028
available5 530.4516.028
awready6 636.5415.980
chaumer5 530.4515.823
pairt14 1,485.2515.745
political7 742.6315.613
pootch3 318.2715.528
personality3 318.2715.528
member4 424.3615.462
somewey4 424.3615.462
daily-day2 212.1815.048
apairtment2 212.1815.048
branks2 212.1815.048
gawpit2 212.1815.048
perceived2 212.1815.048
flegs2 212.1815.048
follae-up2 212.1815.048
affirmit2 212.1815.048
reservit2 212.1815.048
definin2 212.1815.048
disposal2 212.1815.048
twined2 212.1815.048
fends2 212.1815.048
intromission2 212.1815.048
vennels2 212.1815.048
accordit2 212.1815.048
periodic2 212.1815.048
donati2 212.1815.048
elementar2 212.1815.048
security4 424.3614.882
an301 31,932.9514.821
maun15 1,591.3414.803
pouer5 530.4514.517
cotton3 318.2714.410
discussion4 424.3614.345
pey6 636.5414.222
up16 1,697.4314.135
document4 424.3614.092
entitled3 318.2713.925
lane6 636.5413.859
their33 3,500.9513.739
rouble7 742.63nan
image5 530.4513.855
day5 530.4513.695
it75 7,956.7213.640
sma7 742.6313.584
dreid3 318.2713.479
plenishins2 212.1813.340
miscawed2 212.1813.340
guddled2 212.1813.340
whummelt2 212.1813.340
inherent2 212.1813.340
tochered2 212.1813.340
fent2 212.1813.340
footer2 212.1813.340
ootlined2 212.1813.340
warld-wide2 212.1813.340
haurlie2 212.1813.340
layoot2 212.1813.340
adverts3 318.2713.066
institutions3 318.2713.066
yet10 1,060.9012.733
intent3 318.2712.683
circumstances3 318.2712.683
identity5 530.4512.674
needit5 530.4512.538
currently3 318.2712.325
nor17 1,803.5212.319
thae13 1,379.1612.204
drunks2 212.1812.173
dwams2 212.1812.173
person5 530.4512.144
includin5 530.4512.144
active3 318.2711.989
initiatives3 318.2711.989
nerves3 318.2711.989
exact3 318.2711.989
effective3 318.2711.989
ainly4 424.3611.968
situation5 530.4511.892
even18 1,909.6111.778
text5 530.4511.769
countries3 318.2711.673
vital3 318.2711.673
thole6 636.5411.669
beer4 424.3611.613
yer5 530.4511.560
sudden4 424.3611.443
acts3 318.2711.374
includes3 318.2711.374
coherent2 212.1811.283
shuld2 212.1811.283
backgrun2 212.1811.283
socially2 212.1811.283
commonly2 212.1811.283
purpose3 318.2711.092
conscience3 318.2711.092
national8 848.7211.072
yella4 424.3610.957
gart5 530.4510.853
place14 1,485.2510.839
jouk3 318.2710.824
man's3 318.2710.824
street9 954.8110.819
state7 742.6310.684
practice4 424.3610.653
minority6 636.5410.632
wemen2 212.1810.564
slavery2 212.1810.564
peoples2 212.1810.564
had3 318.2710.509
fifty4 424.3610.506
development4 424.3610.506
back7 742.6310.429
himsel8 848.7210.327
authority3 318.2710.325
speakin6 636.5410.314
whan2 212.1810.296
signpostin5 530.45nan
forrit8 848.7210.272
get3 318.2710.248
regaird3 318.2710.093
justice3 318.2710.093
hasna2 212.189.960
peacefu2 212.189.960
historically2 212.189.960
tassie2 212.189.960
pairts5 530.459.848
ay4 424.369.818
amang8 848.729.793
his78 8,274.989.788
we20 2,121.799.669
awfie3 318.279.658
raskolnikov5 530.45nan
meisures2 212.189.960
interest5 530.459.572
jalouse4 424.369.563
ii3 318.279.453
providin2 212.189.441
duly2 212.189.441
mairower2 212.189.441
lanlady5 530.45nan
visitors2 212.189.441
adoptit2 212.189.441
kirks2 212.189.441
switherin2 212.189.441
haud9 954.819.405
than2 212.189.396
month6 636.549.361
group7 742.639.266
presence3 318.279.257
niver10 1,060.909.254
linguists2 212.188.986
application2 212.188.986
countrie5 530.45nan
trial2 212.188.986
oral2 212.188.986
historic2 212.188.986
yinst2 212.188.986
weirin3 318.278.885
furthset3 318.278.885
specific3 318.278.885
terrible3 318.278.885
religioun5 530.45nan
policy5 530.458.726
tenement2 212.188.581
sowels2 212.188.581
loan2 212.188.581
fundit2 212.188.581
spheres2 212.188.581
free7 742.638.408
trade3 318.278.376
global3 318.278.376
levels3 318.278.376
proper4 424.368.326
pen3 318.278.218
territor5 530.45nan
culture7 742.638.258
toy2 212.188.216
whuther2 212.188.216
expressed2 212.188.216
priority2 212.188.216
mcgugan2 212.188.216
analysis2 212.188.216
brigs2 212.188.216
duties2 212.188.216
promote3 318.278.064
doon9 954.818.045
maist16 1,697.437.956
waled2 212.187.885
transmission4 424.36nan
glent2 212.187.885
freely2 212.187.885
providit2 212.187.885
german4 424.367.841
books3 318.277.771
recognition3 318.277.771
country5 530.457.766
government8 848.727.734
there12 1,273.077.660
lands2 212.187.583
grup2 212.187.583
keech2 212.187.583
bink2 212.187.583
skinklin2 212.187.583
literary3 318.277.495
forms3 318.277.495
penal4 424.36nan
hat3 318.277.631
resources3 318.277.495
shairly3 318.277.495
groups4 424.367.394
withoot5 530.457.368
foond3 318.277.363
wee9 954.817.318
effecks2 212.187.304
clerk2 212.187.304
humanity2 212.187.304
declaration'4 424.36nan
needna2 212.187.304
precious2 212.187.304
curriculum3 318.277.235
forby7 742.637.066
alane4 424.367.061
thirteen2 212.187.045
signs4 424.366.980
yon11 1,166.986.910
heat4 424.366.902
personal3 318.276.870
cross4 424.366.824
realise3 318.276.755
is79 8,381.076.707
orra4 424.366.672
of5 530.456.654
foregaun4 424.36nan
suddent2 212.186.580
native3 318.276.533
expression3 318.276.533
hoose2 212.186.494
then3 318.276.473
gear4 424.366.451
etc3 318.276.426
a236 25,037.136.344
passin3 318.276.321
seein5 530.456.311
worth4 424.366.310
been8 848.726.309
siller7 742.636.234
sets3 318.276.219
enjoy3 318.276.219
study3 318.276.219
medium3 318.276.219
dictionary3 318.276.219
meet4 424.366.172
definition2 212.186.171
translation2 212.186.171
juist2 212.186.156
business4 424.366.104
sindry4 424.366.038
year2 212.186.036
seek3 318.276.022
unner6 636.545.985
bilingual2 212.185.984
december2 212.185.984
future4 424.365.972
ae7 742.635.909
research3 318.275.834
stevenson2 212.185.806
interests2 212.185.806
ower11 1,166.985.774
redd3 318.275.743
standart2 212.185.638
gaird2 212.185.638
tool2 212.185.638
common4 424.365.595
nicht2 212.185.571
ile2 212.185.478
guilty2 212.185.478
prent2 212.185.478
stepped2 212.185.478
see7 742.635.438
the602 63,865.905.334
hecht2 212.185.326
vizzied2 212.185.326
kinds2 212.185.326
yours2 212.185.326
association2 212.185.326
ain18 1,909.615.299
kennin4 424.365.246
nummer5 530.455.243
regional3 318.275.235
mindit3 318.275.235
upmak3 318.27nan
recognised2 212.185.181
pressure2 212.185.181
lear2 212.185.181
kythe2 212.185.181
space4 424.365.135
literature3 318.275.079
britain3 318.275.079
respeck2 212.185.042
keys2 212.185.042
landin2 212.185.042
stapped2 212.185.042
awthegither2 212.185.042
speech3 318.275.003
meenit4 424.364.922
micht2 212.184.881
information3 318.274.856
warks3 318.274.785
nane5 530.454.783
safegairdit3 318.27nan
unique2 212.184.781
penalised3 318.27nan
trauchle2 212.184.781
importance2 212.184.781
ayewis2 212.184.781
man17 1,803.524.680
thon2 212.184.676
copecks3 318.27nan
recognise2 212.185.042
mainteen3 318.27nan
regairdit2 212.184.909
resourcin3 318.27nan
mak14 1,485.254.869
else5 530.454.665
spite2 212.184.659
skaith2 212.184.659
communication2 212.184.659
hauds3 318.274.646
force3 318.274.578
langer3 318.274.578
lead3 318.274.578
flair4 424.364.571
express2 212.184.541
colin2 212.184.541
belief2 212.184.541
theirsels3 318.274.511
stertit5 530.454.437
naither2 212.184.428
material2 212.184.428
kenspeckle3 318.274.382
awa5 530.454.355
opportunities2 212.184.319
patient2 212.184.319
hauden2 212.184.319
wide3 318.274.257
examples2 212.184.214
by23 2,440.064.189
isles2 212.184.113
exist2 212.184.113
meanin3 318.274.019
quate3 318.274.019
student2 212.184.015
fifteen2 212.184.015
awlinguistic3 318.27nan
position2 212.184.015
bi5 530.454.008
as73 7,744.543.954
scunner2 212.183.920
historical2 212.183.920
coort2 212.183.920
comatee3 318.273.906
steer2 212.183.828
positive2 212.183.828
www2 212.183.828
wes3 318.273.817
mairch3 318.273.797
accurate3 318.27nan
scottish11 1,166.983.885
aye21 2,227.883.750
braes2 212.183.740
lobby2 212.183.740
meetin2 212.183.740
at46 4,880.123.739
education6 636.543.688
influence2 212.183.654
byordinar2 212.183.654
staundin2 212.183.654
msp2 212.183.654
pairty4 424.363.632
us6 636.543.626
efter5 530.453.613
british3 318.273.588
no16 1,697.433.572
shame2 212.183.571
sir3 318.273.538
fou3 318.273.489
office3 318.273.489
claes5 530.453.448
some7 742.633.413
cam3 318.273.402
are19 2,015.703.395
pass3 318.273.392
steel2 212.183.336
sair6 636.543.334
alyona3 318.27nan
wid5 530.453.313
cairryin2 212.183.262
decide2 212.183.190
ivaanovna3 318.27nan
drama2 212.183.262
sowel2 212.183.190
said7 742.633.181
pitten3 318.273.163
choice2 212.183.120
new3 318.273.104
thirty2 212.183.052
reality2 212.183.052
intil6 636.543.009
kist2 212.182.986
livin3 318.272.950
mony9 954.812.938
that77 8,168.892.928
joy2 212.182.921
derk2 212.182.921
stour2 212.182.921
wi85 9,017.612.917
learn3 318.272.869
reason3 318.272.869
howfs2 212.18nan
forderance2 212.18nan
dae6 636.542.930
level2 212.182.858
aften4 424.362.854
needs3 318.272.830
sang4 424.362.824
confidence2 212.182.797
scotland's2 212.182.797
aff7 742.632.748
peace2 212.182.738
mair27 2,864.422.728
windaes2 212.182.679
square2 212.182.679
order3 318.272.677
taen7 742.632.657
oor8 848.722.603
lang5 530.452.600
were6 636.542.584
mindin2 212.182.567
clattie2 212.18nan
race2 212.182.567
register2 212.182.513
loss2 212.182.513
shape2 212.182.513
tent4 424.362.512
will4 424.362.471
gien7 742.632.450
body6 636.542.437
history4 424.362.432
generations2 212.182.409
thocht13 1,379.162.406
how7 742.632.400
maitter3 318.272.394
door3 318.272.365
didna10 1,060.902.339
same9 954.812.338
if9 954.812.336
ring2 212.182.310
present2 212.182.310
process2 212.182.310
students2 212.182.310
bein3 318.272.307
bell2 212.182.261
plain2 212.182.261
whether2 212.182.261
least4 424.362.230
spoken3 318.272.201
act3 318.272.201
naething3 318.272.170
needin2 212.182.169
thrang2 212.182.169
him18 1,909.612.164
ach2 212.182.124
wis82 8,699.342.115
hert5 530.452.110
sib2 212.182.080
hame4 424.362.074
cried5 530.452.071
auld18 1,909.612.068
tackie2 212.18nan
thinkin4 424.362.089
oot31 3,288.782.047
lit2 212.182.037
she's2 212.182.037
haen2 212.181.994
watch3 318.271.994
faur5 530.451.978
staunin3 318.271.966
steps2 212.181.953
gane2 212.181.953
land3 318.271.938
-4 424.361.933
twa8 848.721.894
agin4 424.361.869
growin2 212.181.835
them10 1,060.901.804
gettin5 530.451.800
words5 530.451.800
accoont2 212.181.760
every4 424.361.747
address2 212.181.723
last8 848.721.701
key2 212.181.688
win7 742.631.687
thir7 742.631.662
edinburgh3 318.271.655
ocht2 212.181.653
fit6 636.541.652
past5 530.451.651
set2 212.181.632
tho7 742.631.624
whilk2 212.181.618
system2 212.181.618
uggin2 212.18nan
fitnotes2 212.18nan
wark6 636.541.613
haudin3 318.271.584
lassies2 212.181.552
cawed2 212.181.520
need7 742.631.503
muckle5 530.451.440
darg2 212.181.427
maunbe2 212.18nan
guideen2 212.18nan
strangbox2 212.18nan
pittin3 318.271.428
neb2 212.181.427
gied2 212.181.420
sax2 212.181.397
cast2 212.181.367
case3 318.271.345
that's5 530.451.339
thing8 848.721.324
wisna6 636.541.305
richt-haun2 212.18nan
heid6 636.541.288
ettle2 212.181.256
speak4 424.361.255
lass2 212.181.229
kent4 424.361.228
poetry2 212.181.177
hinnert2 212.18nan
may2 212.181.177
fowrth2 212.18nan
intae12 1,273.071.167
since3 318.271.154
english10 1,060.901.139
stert3 318.271.136
chynge2 212.181.127
i'm3 318.271.118
ettlin2 212.181.103
daft2 212.181.103
bodies2 212.181.103
whit16 1,697.431.082
biggit2 212.181.079
road6 636.541.072
ben4 424.361.039
footers2 212.18nan
sort2 212.181.033
colour2 212.181.010
baith8 848.720.991
example2 212.180.988
again5 530.450.985
years2 212.180.982
hendry2 212.18nan
maks4 424.362.043
roon3 318.270.968
won2 212.180.966
got6 636.540.959
room2 212.180.932
duin3 318.270.892
press2 212.180.883
cauld2 212.180.866
ane12 1,273.070.859
cry2 212.180.844
heard2 212.180.826
shair2 212.180.825
maum2 212.18nan
writin3 318.270.820
europe2 212.180.787
centre2 212.180.769
readin2 212.180.769
afore14 1,485.250.767
warld5 530.450.766
ken9 954.810.764
corner2 212.180.734
ablow2 212.180.717
uk2 212.180.717
better3 318.270.685
kitchen2 212.180.683
gets2 212.180.683
fair6 636.540.676
hae26 2,758.330.668
far3 318.270.660
kind3 318.270.641
misdoot2 212.18nan
een9 954.810.646
leid4 424.360.635
european2 212.180.635
on52 5,516.660.626
wantin2 212.180.620
born2 212.180.620
hingin2 212.180.605
braid2 212.180.575
dark2 212.180.547
likely2 212.180.519
onyither2 212.18nan
men2 212.180.593
help2 212.180.503
keep3 318.270.491
comin2 212.180.470
easy2 212.180.467
referent2 212.18nan
dinna3 318.270.451
atween3 318.270.435
compluterance2 212.18nan
orally2 212.18nan
thochts2 212.180.467
care2 212.180.419
understaunding2 212.18nan
wulsome2 212.18nan
alang3 318.270.413
our2 212.180.397
strang2 212.180.386
whiles3 318.270.382
gaelic6 636.540.365
doun5 530.450.348
weel9 954.810.336
it's9 954.810.333
wull2 212.180.329
while3 318.270.317
stood2 212.180.314
syne5 530.450.310
first7 742.630.304
makkin3 318.270.297
really2 212.180.279
love2 212.180.277
canna4 424.360.275
roond3 318.270.269
report2 212.180.269
reduction2 212.18nan
haein3 318.270.256
but34 3,607.040.252
beilding2 212.18nan
thegither4 424.360.237
whaur8 848.720.228
eh2 212.180.223
deep2 212.180.220
next3 318.270.219
toun2 212.180.213
manifestations2 212.18nan
met2 212.180.213
somethin3 318.270.196
could9 954.810.195
left4 424.360.194
here9 954.810.193
disnae2 212.180.191
hoo2 212.180.185
bide2 212.180.185
leave2 212.180.184
ilk2 212.180.177
sun2 212.180.167
disna2 212.180.164
taks2 212.180.158
high2 212.180.158
real2 212.180.158
let3 318.270.154
late2 212.180.152
brocht3 318.270.150
feart2 212.180.146
done2 212.180.146
am2 212.180.134
tap2 212.180.130
gaun5 530.450.126
come7 742.630.120
opinioun2 212.18nan
gey4 424.360.119
gie8 848.720.118
sister2 212.180.118
support2 212.180.108
loses2 212.18nan
end4 424.360.112
noo14 1,485.250.107
bairns4 424.360.103
sae16 1,697.430.102
jist15 1,591.340.101
gin8 848.720.091
made6 636.540.079
life5 530.450.079
different3 318.270.079
ee3 318.270.072
pit9 954.810.067
gaed5 530.450.067
open2 212.180.065
seen7 742.630.065
why2 212.180.058
nae28 2,970.510.057
great3 318.270.045
wulsomely2 212.18nan
turn2 212.180.051
hauf2 212.180.044
like26 2,758.330.043
yin7 742.630.043
hair2 212.180.041
commonties'2 212.18nan
near4 424.360.037
itsel2 212.180.037
few2 212.180.035
yae2 212.180.034
onsets2 212.18nan
word2 212.180.034
table2 212.180.034
ilka3 318.270.028
wad11 1,166.980.028
mean3 318.270.026
tak8 848.720.024
sat2 212.180.019
fell2 212.180.019
staund-alane2 212.18nan
deid2 212.180.019
gies2 212.180.017
took3 318.270.015
can14 1,485.250.013
anither6 636.540.011
ithers2 212.180.009
five2 212.180.008
guid10 1,060.900.005
governance2 212.18nan
scott2 212.180.010
wey9 954.810.004
fower2 212.180.004
less2 212.180.004
black2 212.180.003
times3 318.270.002
looked2 212.180.002
mebbe2 212.180.001
mind7 742.630.001
time17 1,803.520.001
aince2 212.180.001
territours2 212.18nan
still7 742.630.001
look4 424.360.000
he82 8,699.340.000
masel3 318.270.000