A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

NewtonLass

Basic Stats

Total words by this author in corpus - 229
Total unique words used by this author in corpus - 130
Ratio of total words to unique words - 1.762
Tagged as PUL (Peninsular Ulster (Ards)) dialect.
Top ten most common words - tha, s, an, a, it, tay, ye, tae, fur, in,

List of texts in corpus

Tay
newtonlass.blogspot.com (14-June-2011) in Ulster (Newtownards) dialect (PUL), categorised as poetry (229 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha13 56,768.5685.126
tay7 30,567.6971.269
brew2 8,733.6225.340
spoon2 8,733.6224.719
wairm2 8,733.6224.182
larned2 8,733.6223.709
s10 43,668.1218.844
pot2 8,733.6217.554
ir3 13,100.4417.477
hot2 8,733.6217.460
daen2 8,733.6216.628
watter3 13,100.4416.138
ocht2 8,733.6215.302
ivery2 8,733.6214.944
the3 13,100.4412.208
thon3 13,100.4411.776
it9 39,301.319.201
hale2 8,733.628.831
cannae2 8,733.628.781
fur4 17,467.258.501
keep2 8,733.627.968
ye5 21,834.066.519
yin2 8,733.626.519
maist2 8,733.626.147
pit2 8,733.625.636
say2 8,733.625.581
no3 13,100.444.857
dae2 8,733.624.518
me3 13,100.443.828
yer2 8,733.623.171
day2 8,733.622.907
an10 43,668.122.566
hae2 8,733.622.360
fae2 8,733.621.859
ma2 8,733.620.764
a9 39,301.310.731
in3 13,100.440.097
tae5 21,834.060.002