A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Mainland-Sinclair, Marina Jane

Basic Stats

Total words by this author in corpus - 200
Total unique words used by this author in corpus - 122
Ratio of total words to unique words - 1.639
Tagged as SHD (Shetland) dialect.
Top ten most common words - be, will, an', it, we, aa, you, a, da, as,

List of texts in corpus


Facebook (September 8,2017) in Shetland dialect (SHD), categorised as social (200 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
an'7 35,000.0052.491
will7 35,000.0038.889
fok3 15,000.0034.258
you6 30,000.0022.760
aa6 30,000.0021.862
group3 15,000.0020.703
be8 40,000.0019.451
different3 15,000.0018.290
spell2 10,000.0017.782
wirds3 15,000.0016.393
we6 30,000.0013.832
dialect2 10,000.0012.913
ony3 15,000.0012.114
foo2 10,000.0011.135
o'2 10,000.0010.536
da4 20,000.009.813
your2 10,000.009.178
ta2 10,000.008.863
dis2 10,000.007.832
it7 35,000.006.098
can2 10,000.004.173
as4 20,000.003.951
nae2 10,000.002.174
für2 10,000.00nan
for3 15,000.002.162
i3 15,000.002.019
is3 15,000.001.816
tae2 10,000.001.563
in2 10,000.000.468
a5 25,000.000.141