A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

solonggaybowser

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 133
Ratio of total words to unique words - 1.504
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, as, a, she, his, eggsy, on, in, he,

List of texts in corpus

Merlin's Bairns
archiveofourown.org (2020-08-18) in Central dialect (LAL), categorised as prose (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
crannie3 15,000.0043.086
merlin2 10,000.0030.439
laucht2 10,000.0024.726
ye'r2 10,000.0019.180
eggsy4 20,000.00nan
oh3 15,000.0017.777
phone2 10,000.0012.890
cat2 10,000.0012.415
again3 15,000.0011.934
lassie2 10,000.0010.962
kin2 10,000.009.178
she5 25,000.007.910
his5 25,000.007.152
as5 25,000.006.627
heid2 10,000.005.707
him3 15,000.005.491
aff2 10,000.004.658
on4 20,000.003.950
they3 15,000.003.705
fur2 10,000.002.447
me2 10,000.001.776
tae2 10,000.001.541
he3 15,000.000.762
ye2 10,000.000.632
that3 15,000.000.465
the10 50,000.000.259
an6 30,000.000.165
a5 25,000.000.136
in3 15,000.000.004
merlin's2 10,000.00nan
it2 10,000.000.061