A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Smirnov, Kuzma

Basic Stats

Total words by this author in corpus - 522
Total unique words used by this author in corpus - 231
Ratio of total words to unique words - 2.26
Tagged as LAL (General Central) dialect.
Top ten most common words - an, the, staundard, roushie, in, a, o, leids, scots, haes,

List of texts in corpus


Facebook (2015-05-10) in Central dialect (LAL), categorised as prose (522 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
staundard14 26,819.92205.836
roushie13 24,904.21182.491
byleids5 9,578.5467.676
ukraine4 7,662.8453.196
thare's4 7,662.8451.274
i'4 7,662.8451.274
inglis6 11,494.2542.040
melt4 7,662.8441.776
leids7 13,409.9641.298
b4 7,662.8436.231
u4 7,662.8435.748
pronoonciation3 5,747.1330.618
an36 68,965.5227.573
iveryday2 3,831.4226.590
offeecial3 5,747.1326.140
haes7 13,409.9624.126
dominatin2 3,831.4223.681
belarusian6 11,494.25nan
sib3 5,747.1321.460
treatit2 3,831.4221.417
grammar3 5,747.1320.579
baith5 9,578.5418.681
braid3 5,747.1317.579
nou4 7,662.8416.579
anely3 5,747.1315.826
forms2 3,831.4214.668
ukrainian4 7,662.84nan
distinction2 3,831.4217.423
orthography2 3,831.4214.565
thair5 9,578.5414.553
speak3 5,747.1313.673
leid4 7,662.8412.864
different3 5,747.1312.709
especially2 3,831.4212.312
cawed2 3,831.4211.781
thay3 5,747.1311.682
daes2 3,831.4211.494
juist4 7,662.8410.711
thaim4 7,662.849.708
uise2 3,831.429.418
three3 5,747.139.198
scots7 13,409.968.788
belarus3 5,747.13nan
nae6 11,494.257.781
til3 5,747.137.710
thare2 3,831.427.450
aw5 9,578.546.333
masel2 3,831.426.302
canna2 3,831.426.268
tae4 7,662.846.219
comes2 3,831.426.118
been4 7,662.845.352
while2 3,831.425.192
mony2 3,831.424.650
sic2 3,831.424.630
the20 38,314.184.269
twa3 5,747.134.182
for7 13,409.964.095
life2 3,831.424.041
some3 5,747.133.988
by3 5,747.133.512
whaur2 3,831.423.483
sae3 5,747.133.466
fae4 7,662.843.025
surzhik2 3,831.42nan
will2 3,831.422.920
say2 3,831.422.793
wis2 3,831.422.713
that2 3,831.422.537
a10 19,157.092.144
ower3 5,747.132.020
as6 11,494.251.993
hae3 5,747.131.854
are2 3,831.421.733
be5 9,578.541.699
in12 22,988.511.635
auld2 3,831.421.626
o7 13,409.961.503
like3 5,747.131.426
wi2 3,831.421.122
is5 9,578.540.850
trasianka2 3,831.42nan
whit2 3,831.420.534
on2 3,831.420.524
but3 5,747.130.392
at4 7,662.840.122
or2 3,831.420.054
fouaniver2 3,831.42nan
thing2 3,831.424.378