A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

solonggaybowser

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 133
Ratio of total words to unique words - 1.504
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, as, a, she, his, eggsy, on, in, he,

List of texts in corpus

Merlin's Bairns
archiveofourown.org (2020-08-18) in Central dialect (LAL), categorised as prose (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
crannie3 15,000.0043.041
merlin2 10,000.0030.410
laucht2 10,000.0024.696
ye'r2 10,000.0019.151
eggsy4 20,000.00nan
oh3 15,000.0017.772
phone2 10,000.0012.912
cat2 10,000.0012.386
again3 15,000.0011.919
lassie2 10,000.0010.933
kin2 10,000.009.150
she5 25,000.007.865
his5 25,000.007.139
as5 25,000.006.667
heid2 10,000.005.698
him3 15,000.005.485
aff2 10,000.004.658
on4 20,000.003.947
they3 15,000.003.718
fur2 10,000.002.459
me2 10,000.001.773
tae2 10,000.001.543
he3 15,000.000.755
ye2 10,000.000.630
that3 15,000.000.474
the10 50,000.000.256
an6 30,000.000.160
a5 25,000.000.139
in3 15,000.000.004
merlin's2 10,000.00nan
it2 10,000.000.062