A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Holton, Brian

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 1,561
Total unique words used by this author in corpus - 626
Ratio of total words to unique words - 2.494
Tagged as SEA (South East (Borders)) dialect.
Top ten most common words - a, the, an, in, ti, wis, o, and, at, ma,

List of texts in corpus

An Owresetter’s Tale
Bella Caledonia (2015-Nov-26 ) in Lallans (Galashiels) dialect, categorised as blog (1,561 words)

Author word usage frequencies

WordCount Normalised
per 100,000
a915829.6
the865509.29
an593779.63
in513267.14
ti422690.58
wis372370.28
o281793.72
and241537.48
at221409.35
ma191217.17
wi171089.05
he15960.92
s14896.86
hed13832.8
for12768.74
on12768.74
frae11704.68
scots11704.68
his10640.61
it10640.61
we9576.55
chinese9576.55
that9576.55
d9576.55
my8512.49
but8512.49
english8512.49
eftir7448.43
gaed7448.43
as7448.43
aa7448.43
no7448.43
read6384.37
cud6384.37
hou6384.37
fursten6384.37
or5320.31
poetry5320.31
uis5320.31
t5320.31
latin5320.31
5320.31
under5320.31
hausa5320.31
never5320.31
ye5320.31
whaur5320.31
french5320.31
mither5320.31
schuil4256.25
sae4256.25
us4256.25
stert4256.25
year4256.25
tho4256.25
me4256.25
fowk4256.25
harvey4256.25
readin4256.25
fair4256.25
up4256.25
our4256.25
of4256.25
university4256.25
stertit4256.25
this4256.25
leid4256.25
greek3192.18
some3192.18
tae3192.18
norman3192.18
whan3192.18
brither3192.18
mcdiarmid3192.18
been3192.18
library3192.18
wee3192.18
dae3192.18
waley3192.18
john3192.18
gaun3192.18
was3192.18
poems3192.18
masel3192.18
scott3192.18
place3192.18
scotland3192.18
back3192.18
inti3192.18
out3192.18
donald3192.18
dominies3192.18
then3192.18
cam3192.18
hous3192.18
twin3192.18
dad3192.18
swahili3192.18
galashiels3192.18
faither3192.18
born3192.18
young3192.18
to3192.18
i3192.18
big2128.12
thon2128.12
tongue2128.12
saxt2128.12
language2128.12
ken2128.12
yin2128.12
flittit2128.12
owreset2128.12
whit2128.12
says2128.12
be2128.12
feinisht2128.12
smith2128.12
edinburgh2128.12
spak2128.12
hedna2128.12
alec2128.12
arthur2128.12
nou2128.12
nigeria2128.12
high2128.12
spent2128.12
mind2128.12
her2128.12
memory2128.12
like2128.12
juist2128.12
hugh2128.12
kent2128.12
baith2128.12
later2128.12
wir2128.12
uncle2128.12
war2128.12
sit2128.12
2128.12
academy2128.12
grandfaither2128.12
tak2128.12
morn2128.12
2128.12
games2128.12
wad2128.12
dolby2128.12
throu2128.12
literature2128.12
what2128.12
mcinnes2128.12
time2128.12
gotten2128.12
bill2128.12
see2128.12
original2128.12
gala2128.12
lagos2128.12
west2128.12
doun2128.12
beukie2128.12
apply2128.12
shelf2128.12
pidgin2128.12
write2128.12
yon2128.12
saicont2128.12
ain2128.12
hame2128.12
leids2128.12
onie2128.12
furst2128.12
-2128.12
tale2128.12
leirit2128.12
spied2128.12
fund2128.12
by2128.12
day2128.12
awfu2128.12
can2128.12
lessons2128.12
mair2128.12
kennin2128.12
monie2128.12
border2128.12
beuk2128.12
pittin2128.12
tellt2128.12
come2128.12