A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Mainland-Sinclair, Marina Jane

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 122
Ratio of total words to unique words - 1.639
Tagged as SHD (Shetland) dialect.
Top ten most common words - be, will, an', it, we, aa, you, a, da, as,

List of texts in corpus


Facebook (September 8,2017) in Shetland dialect (SHD), categorised as social (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
an'7 35,000.0052.534
will7 35,000.0038.690
fok3 15,000.0034.184
aa6 30,000.0022.360
you6 30,000.0022.290
group3 15,000.0020.757
be8 40,000.0019.430
different3 15,000.0018.322
spell2 10,000.0017.648
wirds3 15,000.0016.321
we6 30,000.0014.043
dialect2 10,000.0012.865
ony3 15,000.0012.307
foo2 10,000.0011.150
o'2 10,000.0010.584
da4 20,000.009.783
your2 10,000.009.010
ta2 10,000.008.833
dis2 10,000.007.805
it7 35,000.006.023
can2 10,000.004.142
as4 20,000.003.985
nae2 10,000.002.180
für2 10,000.00nan
for3 15,000.002.165
i3 15,000.001.950
is3 15,000.001.818
tae2 10,000.001.541
in2 10,000.000.464
a5 25,000.000.136