A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

solonggaybowser

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 133
Ratio of total words to unique words - 1.504
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, as, a, she, his, eggsy, on, in, he,

List of texts in corpus

Merlin's Bairns
archiveofourown.org (2020-08-18) in Central dialect (LAL), categorised as prose (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
crannie3 15,000.0043.159
merlin2 10,000.0030.488
laucht2 10,000.0023.880
ye'r2 10,000.0019.229
eggsy4 20,000.00nan
oh3 15,000.0017.849
phone2 10,000.0013.365
cat2 10,000.0012.441
again3 15,000.0012.045
lassie2 10,000.0011.056
kin2 10,000.009.225
she5 25,000.008.073
his5 25,000.007.101
as5 25,000.006.582
heid2 10,000.005.741
him3 15,000.005.484
aff2 10,000.004.655
on4 20,000.003.895
they3 15,000.003.683
fur2 10,000.002.445
me2 10,000.001.862
tae2 10,000.001.563
he3 15,000.000.743
ye2 10,000.000.637
that3 15,000.000.463
the10 50,000.000.236
a5 25,000.000.141
an6 30,000.000.137
it2 10,000.000.055
merlin's2 10,000.00nan
in3 15,000.000.005