A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Douglas, Sheila

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 1,502
Total unique words used by this author in corpus - 538
Ratio of total words to unique words - 2.792
Tagged as WCE (West Central) dialect.
Top ten most common words - the, a, o, an, tae, fowk, that, in, it, hae,

List of texts in corpus

Lallans 59 - Rants and Foys
Lallans Magazine (2001) in Lallans dialect, categorised as newspaper (1,502 words)

Author word usage frequencies

WordCount Normalised
per 100,000
the895925.43
a493262.32
o463062.58
an432862.85
tae422796.27
fowk261731.03
that251664.45
in241597.87
it221464.71
hae181198.4
or171131.82
is171131.82
ye15998.67
but15998.67
are14932.09
muisic13865.51
be13865.51
club12798.93
sang12798.93
i11732.36
aa11732.36
clubs10665.78
on10665.78
sangs10665.78
wi9599.2
there9599.2
aboot9599.2
this9599.2
their8532.62
festivals8532.62
as8532.62
fae8532.62
hear8532.62
wis8532.62
and8532.62
some7466.05
haes7466.05
whit7466.05
for7466.05
they7466.05
been7466.05
singin7466.05
singers6399.47
kin6399.47
artists6399.47
weel6399.47
by6399.47
at6399.47
ithers5332.89
mak5332.89
maist5332.89
scots5332.89
jist5332.89
oot5332.89
tmsa5332.89
ony5332.89
ither5332.89
them5332.89
folk5332.89
stertit5332.89
wha5332.89
tradition5332.89
yer5332.89
mair5332.89
tunes4266.31
like4266.31
oor4266.31
pairt4266.31
aabody4266.31
nae4266.31
micht4266.31
ain4266.31
whyles4266.31
rin4266.31
gie4266.31
were4266.31
cam4266.31
prentit4266.31
mony4266.31
up4266.31
say4266.31
athoot4266.31
s4266.31
ilka4266.31
fir3199.73
muckle3199.73
sin3199.73
fack3199.73
guest3199.73
twa-three3199.73
ettle3199.73
yins3199.73
me3199.73
isnae3199.73
wey3199.73
tradeetion3199.73
guid3199.73
festival3199.73
year3199.73
its3199.73
tak3199.73
noo3199.73
gey3199.73
than3199.73
come3199.73
scottish3199.73
auld3199.73
set3199.73
directory3199.73
can3199.73
whaur3199.73
haill3199.73
bein3199.73
days3199.73
nicht3199.73
kintrae3199.73
sing3199.73
aye3199.73
get2133.16
instruments2133.16
muisicians2133.16
owre2133.16
fyowe2133.16
groups2133.16
acause2133.16
til2133.16
rinnin2133.16
whan2133.16
uphaud2133.16
screivin2133.16
leid2133.16
unalike2133.16
buikie2133.16
national2133.16
will2133.16
fauts2133.16
niver2133.16
lang2133.16
hoo2133.16
heid2133.16
aathin2133.16
revival2133.16
gae2133.16
jyne2133.16
yin2133.16
may2133.16
stream2133.16
time2133.16
younkers2133.16
new2133.16
aiberdeen2133.16
club's2133.16
reglar2133.16
played2133.16
rants2133.16
inverness2133.16
spots2133.16
then2133.16
twa2133.16
modren2133.16
hearken2133.16
cuid2133.16
thocht2133.16
scene2133.16
haed2133.16
becam2133.16
braw2133.16
gaun2133.16
foondit2133.16
begoud2133.16
tent2133.16
nichts2133.16
willie2133.16
fassoun2133.16
sung2133.16
foys2133.16
gin2133.16
associe2133.16
langsyne2133.16
fiddler2133.16
maitter2133.16
ploy2133.16
apen2133.16
hert2133.16
we2133.16
these2133.16
itsel2133.16
coorse2133.16
pynt2133.16
ae2133.16
dune2133.16
-2133.16
naethin2133.16
thae2133.16
ye'll2133.16
concerts2133.16
ceilidhs2133.16