A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Curt_Kenobi

Basic Stats

Total words by this author in corpus - 525
Total unique words used by this author in corpus - 283
Ratio of total words to unique words - 1.855
Tagged as LAL (General Central) dialect.
Top ten most common words - ah, tae, a, the, me, it, ehs, fir, ay, sae,

List of texts in corpus

Mates
archiveofourown.org (2016-09-27) in Central dialect (LAL), categorised as prose (525 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
ehs8 15,238.10115.321
ah20 38,095.2443.751
ah'm6 11,428.5741.686
sick4 7,619.0534.001
n6 11,428.5729.988
fir7 13,333.3329.960
all5 9,523.8127.118
natural3 5,714.2923.172
compliment2 3,809.5222.006
blond2 3,809.5220.849
eh's6 11,428.57nan
shite3 5,714.2920.699
fucker2 3,809.5219.226
eh4 7,619.0518.528
ay7 13,333.3318.467
theory2 3,809.5218.077
kissed2 3,809.5217.835
ihm5 9,523.81nan
long3 5,714.2916.426
fuckin4 7,619.0516.010
it's5 9,523.8115.885
me9 17,142.8615.816
chest2 3,809.5215.709
wonder2 3,809.5215.078
oan6 11,428.5715.058
forward2 3,809.5214.851
almost2 3,809.5214.851
makes2 3,809.5214.147
jist6 11,428.5714.083
sae6 11,428.5713.546
when5 9,523.8113.225
ah've2 3,809.5212.898
birds2 3,809.5212.456
that's3 5,714.2912.007
the14 26,666.6711.890
dinnae3 5,714.2910.879
colour2 3,809.5210.868
mooth2 3,809.5210.055
go3 5,714.2910.001
an4 7,619.059.379
boy2 3,809.529.361
fuck2 3,809.529.308
now2 3,809.529.057
since2 3,809.529.033
eyes2 3,809.528.799
across2 3,809.528.558
about2 3,809.527.935
do2 3,809.527.935
cunt2 3,809.527.744
hair2 3,809.527.299
am2 3,809.526.971
in2 3,809.526.810
masel2 3,809.526.455
git2 3,809.526.295
boy's2 3,809.52nan
though2 3,809.526.174
face2 3,809.524.656
are3 5,714.294.462
ken3 5,714.294.369
life2 3,809.524.031
still2 3,809.523.367
gie2 3,809.523.345
think2 3,809.523.218
but5 9,523.813.065
aboot4 7,619.052.645
us2 3,809.521.726
no3 5,714.291.451
like3 5,714.291.442
ma4 7,619.051.187
wi2 3,809.521.173
it9 17,142.861.155
tae15 28,571.431.146
his5 9,523.811.103
outta2 3,809.52nan
had2 3,809.521.602
or3 5,714.290.776
and2 3,809.520.755
back2 3,809.520.630
mair2 3,809.520.623
wis4 7,619.050.370
we3 5,714.290.351
him2 3,809.520.229
a14 26,666.670.124
be3 5,714.290.046
that5 9,523.810.006