A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

solonggaybowser

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 133
Ratio of total words to unique words - 1.504
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, as, a, she, his, eggsy, on, in, he,

List of texts in corpus

Merlin's Bairns
archiveofourown.org (2020-08-18) in Central dialect (LAL), categorised as prose (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
crannie3 15,000.0043.076
merlin2 10,000.0030.433
laucht2 10,000.0024.720
ye'r2 10,000.0019.174
eggsy4 20,000.00nan
oh3 15,000.0017.806
phone2 10,000.0012.884
cat2 10,000.0012.409
again3 15,000.0011.945
lassie2 10,000.0010.956
kin2 10,000.009.172
she5 25,000.007.904
his5 25,000.007.160
as5 25,000.006.641
heid2 10,000.005.709
him3 15,000.005.494
aff2 10,000.004.668
on4 20,000.003.941
they3 15,000.003.720
fur2 10,000.002.453
me2 10,000.001.775
tae2 10,000.001.541
he3 15,000.000.764
ye2 10,000.000.637
that3 15,000.000.467
the10 50,000.000.258
an6 30,000.000.164
a5 25,000.000.136
in3 15,000.000.005
merlin's2 10,000.00nan
it2 10,000.000.060