A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Smirnov, Kuzma

Basic Stats

Total words by this author in corpus - 522
Total unique words used by this author in corpus - 231
Ratio of total words to unique words - 2.26
Tagged as LAL (General Central) dialect.
Top ten most common words - an, the, staundard, roushie, in, a, o, leids, scots, haes,

List of texts in corpus


Facebook (2015-05-10) in Central dialect (LAL), categorised as prose (522 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
staundard14 26,819.92205.791
roushie13 24,904.21182.449
byleids5 9,578.5467.660
ukraine4 7,662.8453.183
thare's4 7,662.8451.261
i'4 7,662.8451.261
inglis6 11,494.2542.021
melt4 7,662.8441.763
leids7 13,409.9641.276
b4 7,662.8436.218
u4 7,662.8435.735
pronoonciation3 5,747.1330.608
an36 68,965.5227.535
iveryday2 3,831.4226.584
offeecial3 5,747.1326.130
haes7 13,409.9624.105
dominatin2 3,831.4223.675
belarusian6 11,494.25nan
sib3 5,747.1321.451
treatit2 3,831.4221.410
grammar3 5,747.1320.570
baith5 9,578.5418.682
braid3 5,747.1317.569
nou4 7,662.8416.566
anely3 5,747.1315.817
forms2 3,831.4214.661
ukrainian4 7,662.84nan
distinction2 3,831.4217.416
orthography2 3,831.4214.559
thair5 9,578.5414.538
speak3 5,747.1313.663
leid4 7,662.8412.853
different3 5,747.1312.700
especially2 3,831.4212.306
cawed2 3,831.4211.825
thay3 5,747.1311.673
daes2 3,831.4211.488
juist4 7,662.8410.700
thaim4 7,662.849.697
uise2 3,831.429.411
three3 5,747.139.189
scots7 13,409.968.771
belarus3 5,747.13nan
nae6 11,494.257.781
til3 5,747.137.701
thare2 3,831.427.444
aw5 9,578.546.360
masel2 3,831.426.296
canna2 3,831.426.262
tae4 7,662.846.222
comes2 3,831.426.112
been4 7,662.845.351
while2 3,831.425.186
mony2 3,831.424.644
sic2 3,831.424.624
the20 38,314.184.264
twa3 5,747.134.181
for7 13,409.964.082
life2 3,831.424.036
some3 5,747.133.986
by3 5,747.133.513
whaur2 3,831.423.477
sae3 5,747.133.459
fae4 7,662.843.035
surzhik2 3,831.42nan
will2 3,831.422.915
say2 3,831.422.795
wis2 3,831.422.724
that2 3,831.422.533
a10 19,157.092.145
ower3 5,747.132.014
as6 11,494.252.003
hae3 5,747.131.848
are2 3,831.421.729
be5 9,578.541.701
in12 22,988.511.630
auld2 3,831.421.626
o7 13,409.961.515
like3 5,747.131.425
wi2 3,831.421.124
is5 9,578.540.846
trasianka2 3,831.42nan
whit2 3,831.420.536
on2 3,831.420.528
but3 5,747.130.390
at4 7,662.840.122
or2 3,831.420.053
fouaniver2 3,831.42nan
thing2 3,831.424.372