A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

solonggaybowser

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 133
Ratio of total words to unique words - 1.504
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, as, a, she, his, eggsy, on, in, he,

List of texts in corpus

Merlin's Bairns
archiveofourown.org (2020-08-18) in Central dialect (LAL), categorised as prose (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
crannie3 15,000.0043.073
merlin2 10,000.0030.431
laucht2 10,000.0023.822
ye'r2 10,000.0019.172
eggsy4 20,000.00nan
oh3 15,000.0017.803
phone2 10,000.0013.337
cat2 10,000.0012.429
again3 15,000.0012.018
lassie2 10,000.0011.016
kin2 10,000.009.179
she5 25,000.007.994
his5 25,000.007.096
as5 25,000.006.578
heid2 10,000.005.750
him3 15,000.005.470
aff2 10,000.004.653
on4 20,000.003.913
they3 15,000.003.696
fur2 10,000.002.500
me2 10,000.001.871
tae2 10,000.001.557
he3 15,000.000.741
ye2 10,000.000.648
that3 15,000.000.467
the10 50,000.000.265
an6 30,000.000.146
a5 25,000.000.132
in3 15,000.000.005
merlin's2 10,000.00nan
it2 10,000.000.055