A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Curt_Kenobi

Basic Stats

Total words by this author in corpus - 525
Total unique words used by this author in corpus - 283
Ratio of total words to unique words - 1.855
Tagged as LAL (General Central) dialect.
Top ten most common words - ah, tae, a, the, me, it, ehs, fir, ay, sae,

List of texts in corpus

Mates
archiveofourown.org (2016-09-27) in Central dialect (LAL), categorised as prose (525 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
ehs8 15,238.10115.237
ah'm6 11,428.5742.151
ah20 38,095.2440.753
sick4 7,619.0532.414
fir7 13,333.3330.038
n6 11,428.5729.989
all5 9,523.8127.027
natural3 5,714.2923.270
compliment2 3,809.5221.985
blond2 3,809.5220.828
shite3 5,714.2920.106
eh's6 11,428.57nan
fucker2 3,809.5219.204
eh4 7,619.0518.461
ay7 13,333.3318.401
theory2 3,809.5218.056
kissed2 3,809.5217.586
long3 5,714.2916.159
fuckin4 7,619.0515.952
it's5 9,523.8115.859
me9 17,142.8615.288
chest2 3,809.5214.830
wonder2 3,809.5214.721
oan6 11,428.5714.721
almost2 3,809.5214.616
jist6 11,428.5714.288
makes2 3,809.5214.035
sae6 11,428.5713.561
ihm5 9,523.81nan
forward2 3,809.5215.297
ah've2 3,809.5213.083
when5 9,523.8112.976
birds2 3,809.5212.435
that's3 5,714.2912.032
the14 26,666.6711.796
colour2 3,809.5210.886
dinnae3 5,714.2910.748
go3 5,714.299.959
mooth2 3,809.529.879
fuck2 3,809.529.288
boy2 3,809.529.288
an4 7,619.059.229
now2 3,809.529.012
since2 3,809.528.894
across2 3,809.528.580
eyes2 3,809.528.495
about2 3,809.527.844
do2 3,809.527.810
cunt2 3,809.527.724
hair2 3,809.527.279
am2 3,809.526.858
in2 3,809.526.764
git2 3,809.526.309
masel2 3,809.526.253
boy's2 3,809.52nan
though2 3,809.526.144
face2 3,809.524.591
ken3 5,714.294.415
are3 5,714.294.358
life2 3,809.524.035
still2 3,809.523.366
gie2 3,809.523.324
think2 3,809.523.217
but5 9,523.813.003
aboot4 7,619.052.687
us2 3,809.521.755
had2 3,809.521.630
no3 5,714.291.420
outta2 3,809.52nan
like3 5,714.291.403
tae15 28,571.431.170
wi2 3,809.521.146
his5 9,523.811.125
it9 17,142.861.100
ma4 7,619.051.033
or3 5,714.290.809
and2 3,809.520.755
mair2 3,809.520.642
back2 3,809.520.626
we3 5,714.290.370
wis4 7,619.050.364
him2 3,809.520.233
a14 26,666.670.135
be3 5,714.290.045
that5 9,523.810.005