A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Harvey, Lewis

Basic Stats

Total words by this author in corpus - 777
Total unique words used by this author in corpus - 289
Ratio of total words to unique words - 2.689
Tagged as MNB (Mid Northern B) dialect.
Top ten most common words - a, the, wis, and, her, tae, she, ma, me, hid,

List of texts in corpus

Crash
Scots Hoose (2020) in Doric (Forres) dialect (MNB), categorised as prose (777 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
a67 86,229.0958.914
wifie7 9,009.0142.607
skweel5 6,435.0141.449
her22 28,314.0339.665
usual5 6,435.0136.426
and24 30,888.0336.381
detention3 3,861.0034.913
fit10 12,870.0132.288
hid11 14,157.0131.736
ok4 5,148.0131.082
me15 19,305.0228.481
wid9 11,583.0126.453
she18 23,166.0226.315
wis26 33,462.0326.314
telt7 9,009.0124.407
bawlin2 2,574.0022.089
jist9 11,583.0121.629
ma15 19,305.0221.430
heidie2 2,574.0021.182
well5 6,435.0121.103
happened4 5,148.0120.315
lookit3 3,861.0017.906
bobbies2 2,574.0016.522
could6 7,722.0116.236
heided2 2,574.0015.256
fan4 5,148.0114.938
excuse2 2,574.0013.649
go4 5,148.0112.488
to6 7,722.0111.792
arrived2 2,574.0011.233
an7 9,009.0111.132
next3 3,861.0011.069
pulled2 2,574.0010.925
should3 3,861.0010.373
then5 6,435.019.872
office2 2,574.009.852
gaen2 2,574.009.594
ca2 2,574.008.810
stick2 2,574.008.583
hiv3 3,861.008.440
road3 3,861.008.217
wait2 2,574.008.137
walk2 2,574.008.027
they9 11,583.017.809
thocht4 5,148.017.635
there7 9,009.017.563
car2 2,574.007.453
done2 2,574.007.407
seemed2 2,574.007.144
wint2 2,574.006.803
thing3 3,861.006.611
tell3 3,861.006.032
roond2 2,574.006.020
turn2 2,574.005.772
door3 3,861.005.419
ony3 3,861.004.996
get4 5,148.014.983
git2 2,574.004.901
niver2 2,574.004.694
help2 2,574.004.664
let2 2,574.004.098
was3 3,861.003.938
them4 5,148.013.917
the33 42,471.043.867
heid3 3,861.003.767
fair2 2,574.003.709
need2 2,574.003.709
so3 3,861.003.393
through2 2,574.003.375
weel3 3,861.003.059
fir2 2,574.002.715
o10 12,870.012.619
aff3 3,861.002.513
it5 6,435.012.325
kent2 2,574.002.287
if3 3,861.001.945
i2 2,574.001.898
see3 3,861.001.774
s3 3,861.001.731
my2 2,574.001.717
for2 2,574.001.608
at8 10,296.011.560
aboot4 5,148.011.026
back3 3,861.000.979
as7 9,009.010.950
awa2 2,574.000.938
chik2 2,574.00nan
ower3 3,861.000.764
auld2 2,574.000.710
tae20 25,740.030.662
said2 2,574.000.646
wi4 5,148.010.622
in10 12,870.010.394
noo2 2,574.000.375
up4 5,148.010.320
d2 2,574.000.311
intae2 2,574.000.292
be5 6,435.010.279
but4 5,148.010.275
been2 2,574.000.202
day2 2,574.000.143
or2 2,574.000.115
that7 9,009.010.060
doon2 2,574.000.058
oot3 3,861.000.023
angrier2 2,574.00nan
nae2 2,574.000.020