A Corpus of 21st Century Scots Texts
Intro
a
b
c
d
e
f
g
h
i
j
k
l
m
n
o
p
q
r
s
t
u
v
w
x
y
z
Texts
Writers
Statistics
Top200
Search
Compare
. .Previous author - Next author
- dialect comparison -
fine grain dialect comparison -
Venn diagrams -
punctuation analysis -
chronology -
Love, Rowena M.
Basic Stats
Total words by this author in corpus - 268
Total unique words used by this author in corpus - 192
Ratio of total words to unique words - 1.396
Tagged as LAL (General Central) dialect.
Top ten most common words - the, an, a, wi, in, as, o, is, tae, ma,
List of texts in corpus
Lallans 82 - Simmer Strand
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry
(96 words)
Lallans 82 - Hame tae Dunbar Harbour
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry
(104 words)
Lallans 82 - Hamecomin
Lallans Magazine (2013-07 ) in Central dialect (LAL), categorised as poetry
(68 words)
Author word Keyness frequencies
This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
Word | Count |
Normalised per million |
Keyness |
wuid | 2 |
7,462.69 | 12.355 |
hert | 2 |
7,462.69 | 9.651 |
oan | 3 |
11,194.03 | 7.267 |
as | 6 |
22,388.06 | 6.965 |
nor | 2 |
7,462.69 | 5.997 |
time | 3 |
11,194.03 | 5.972 |
hame | 2 |
7,462.69 | 5.332 |
wi | 6 |
22,388.06 | 5.259 |
heid | 2 |
7,462.69 | 4.660 |
bi | 2 |
7,462.69 | 4.085 |
awa | 2 |
7,462.69 | 3.986 |
auld | 2 |
7,462.69 | 3.621 |
sae | 2 |
7,462.69 | 3.101 |
is | 4 |
14,925.37 | 2.394 |
ma | 3 |
11,194.03 | 1.975 |
in | 6 |
22,388.06 | 0.713 |
an | 9 |
33,582.09 | 0.664 |
tae | 4 |
14,925.37 | 0.598 |
it | 2 |
7,462.69 | 0.503 |
the | 18 |
67,164.18 | 0.374 |
a | 7 |
26,119.40 | 0.098 |
o | 5 |
18,656.72 | 0.051 |
he | 2 |
7,462.69 | 0.048 |
that | 3 |
11,194.03 | 0.047 |
plet | 2 |
7,462.69 | nan |