A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Harvey, Lewis

Basic Stats

Total words by this author in corpus - 777
Total unique words used by this author in corpus - 289
Ratio of total words to unique words - 2.689
Tagged as MNB (Mid Northern B) dialect.
Top ten most common words - a, the, wis, and, her, tae, she, ma, me, hid,

List of texts in corpus

Crash
Scots Hoose (2020) in Doric (Forres) dialect (MNB), categorised as prose (777 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
a67 86,229.0958.907
wifie7 9,009.0142.585
skweel5 6,435.0141.433
her22 28,314.0339.626
usual5 6,435.0136.410
and24 30,888.0336.326
detention3 3,861.0034.903
fit10 12,870.0132.258
hid11 14,157.0131.915
ok4 5,148.0131.266
me15 19,305.0228.470
wid9 11,583.0126.426
she18 23,166.0226.296
wis26 33,462.0326.255
telt7 9,009.0124.406
bawlin2 2,574.0022.083
jist9 11,583.0121.655
ma15 19,305.0221.418
heidie2 2,574.0021.176
well5 6,435.0121.148
happened4 5,148.0120.302
lookit3 3,861.0017.896
bobbies2 2,574.0016.515
could6 7,722.0116.269
heided2 2,574.0015.249
fan4 5,148.0114.926
excuse2 2,574.0013.643
go4 5,148.0112.525
to6 7,722.0111.776
arrived2 2,574.0011.291
pulled2 2,574.0011.163
an7 9,009.0111.153
next3 3,861.0011.105
should3 3,861.0010.364
then5 6,435.019.890
office2 2,574.009.845
gaen2 2,574.009.587
ca2 2,574.008.804
stick2 2,574.008.576
hiv3 3,861.008.431
road3 3,861.008.221
wait2 2,574.008.131
walk2 2,574.008.021
they9 11,583.017.850
thocht4 5,148.017.625
there7 9,009.017.569
car2 2,574.007.447
done2 2,574.007.401
seemed2 2,574.007.138
wint2 2,574.006.797
thing3 3,861.006.602
tell3 3,861.006.024
roond2 2,574.006.014
turn2 2,574.005.766
door3 3,861.005.418
ony3 3,861.004.995
get4 5,148.014.982
git2 2,574.004.917
niver2 2,574.004.689
help2 2,574.004.669
let2 2,574.004.117
them4 5,148.013.932
was3 3,861.003.931
the33 42,471.043.861
heid3 3,861.003.769
fair2 2,574.003.704
need2 2,574.003.704
so3 3,861.003.390
through2 2,574.003.382
weel3 3,861.003.056
fir2 2,574.002.710
o10 12,870.012.639
aff3 3,861.002.524
it5 6,435.012.318
kent2 2,574.002.282
if3 3,861.001.944
i2 2,574.001.905
see3 3,861.001.772
s3 3,861.001.721
my2 2,574.001.713
for2 2,574.001.615
at8 10,296.011.562
aboot4 5,148.011.026
back3 3,861.000.985
as7 9,009.010.958
awa2 2,574.000.935
chik2 2,574.00nan
ower3 3,861.000.760
auld2 2,574.000.710
said2 2,574.000.665
tae20 25,740.030.660
wi4 5,148.010.623
in10 12,870.010.397
noo2 2,574.000.374
up4 5,148.010.321
d2 2,574.000.309
intae2 2,574.000.293
be5 6,435.010.280
but4 5,148.010.273
been2 2,574.000.202
day2 2,574.000.142
or2 2,574.000.116
that7 9,009.010.059
doon2 2,574.000.058
oot3 3,861.000.023
angrier2 2,574.00nan
nae2 2,574.000.020