A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Curt_Kenobi

Basic Stats

Total words by this author in corpus - 525
Total unique words used by this author in corpus - 283
Ratio of total words to unique words - 1.855
Tagged as LAL (General Central) dialect.
Top ten most common words - ah, tae, a, the, me, it, ehs, fir, ay, sae,

List of texts in corpus

Mates
archiveofourown.org (2016-09-27) in Central dialect (LAL), categorised as prose (525 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
ehs8 15,238.10115.329
ah'm6 11,428.5741.692
ah20 38,095.2440.911
sick4 7,619.0532.460
n6 11,428.5730.056
fir7 13,333.3329.966
all5 9,523.8127.083
natural3 5,714.2923.305
compliment2 3,809.5222.008
blond2 3,809.5220.851
shite3 5,714.2920.141
eh's6 11,428.57nan
fucker2 3,809.5219.227
eh4 7,619.0518.506
ay7 13,333.3318.473
theory2 3,809.5218.079
kissed2 3,809.5217.609
long3 5,714.2916.193
fuckin4 7,619.0515.996
it's5 9,523.8115.901
me9 17,142.8615.300
chest2 3,809.5214.853
oan6 11,428.5714.782
wonder2 3,809.5214.744
almost2 3,809.5214.639
jist6 11,428.5714.286
makes2 3,809.5214.058
sae6 11,428.5713.589
ihm5 9,523.81nan
forward2 3,809.5215.320
ah've2 3,809.5212.900
when5 9,523.8112.870
birds2 3,809.5212.458
that's3 5,714.2912.046
the14 26,666.6711.819
colour2 3,809.5210.909
dinnae3 5,714.2910.766
go3 5,714.299.966
mooth2 3,809.529.901
fuck2 3,809.529.310
boy2 3,809.529.284
an4 7,619.059.191
now2 3,809.529.035
since2 3,809.528.916
eyes2 3,809.528.517
across2 3,809.528.517
about2 3,809.527.849
do2 3,809.527.831
cunt2 3,809.527.746
hair2 3,809.527.271
am2 3,809.526.866
in2 3,809.526.804
git2 3,809.526.330
masel2 3,809.526.274
boy's2 3,809.52nan
though2 3,809.526.123
face2 3,809.524.605
ken3 5,714.294.410
are3 5,714.294.339
life2 3,809.524.017
still2 3,809.523.377
gie2 3,809.523.301
think2 3,809.523.228
but5 9,523.813.011
aboot4 7,619.052.677
us2 3,809.521.738
had2 3,809.521.605
no3 5,714.291.424
outta2 3,809.52nan
like3 5,714.291.406
tae15 28,571.431.172
wi2 3,809.521.145
his5 9,523.811.136
it9 17,142.861.115
ma4 7,619.051.020
or3 5,714.290.807
and2 3,809.520.778
mair2 3,809.520.639
back2 3,809.520.632
wis4 7,619.050.367
we3 5,714.290.363
him2 3,809.520.235
a14 26,666.670.131
be3 5,714.290.047
that5 9,523.810.006