A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Docherty, John F.

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 2,122
Total unique words used by this author in corpus - 681
Ratio of total words to unique words - 3.116
Tagged as EDN (Edinburgh) dialect.
Top ten most common words - an, the, wis, he, a, tae, in, they, it, ma,

List of texts in corpus

Lallans 81 - Lang Summer Days
Lallans Magazine (2012) in Lallans dialect, categorised as prose (2,122 words)

Author word usage frequencies

WordCount Normalised
per 100,000
an1165466.54
the1064995.29
wis582733.27
he522450.52
a462167.77
tae411932.14
in391837.89
they371743.64
it291366.64
ma281319.51
o261225.26
we241131.01
wid231083.88
wir21989.63
had20942.51
ah19895.38
up17801.13
but17801.13
that16754.01
larry15706.88
wi15706.88
oot14659.75
thir14659.75
fur14659.75
back14659.75
ye14659.75
jist13612.63
whin13612.63
at13612.63
'13612.63
said12565.5
or12565.5
thim11518.38
got10471.25
sometimes10471.25
uncle10471.25
cam10471.25
used9424.13
how9424.13
on9424.13
she9424.13
deid9424.13
wan9424.13
her9424.13
aw8377
hes8377
know8377
doon8377
wee8377
hoose8377
him8377
mither7329.88
here7329.88
mammy7329.88
git7329.88
men7329.88
drunk7329.88
sidney7329.88
thomas7329.88
wint7329.88
road6282.75
this6282.75
war6282.75
says6282.75
if6282.75
night6282.75
say6282.75
great6282.75
us5235.63
as5235.63
see5235.63
taur5235.63
uncles5235.63
then5235.63
brithers5235.63
wullyum5235.63
niver5235.63
even5235.63
oan5235.63
because5235.63
before5235.63
peter5235.63
think4188.5
jings4188.5
ower4188.5
big4188.5
aboot4188.5
muscles4188.5
dae4188.5
went4188.5
setterday4188.5
intae4188.5
efter4188.5
fae4188.5
walk4188.5
kinna4188.5
me4188.5
stey4188.5
germans4188.5
gave4188.5
didnae4188.5
two4188.5
day4188.5
cuidnae4188.5
come4188.5
whit4188.5
whir3141.38
so3141.38
feet3141.38
lik3141.38
wummin3141.38
hair3141.38
away3141.38
something3141.38
atlas3141.38
faither3141.38
work3141.38
sayin3141.38
been3141.38
time3141.38
did3141.38
sent3141.38
no3141.38
hid3141.38
warnock3141.38
place3141.38
pit3141.38
watter3141.38
boots3141.38
don't3141.38
tell3141.38
-3141.38
charles3141.38
oor3141.38
tommy3141.38
friday3141.38
'ah3141.38
go3141.38
days3141.38
coorse3141.38
thing3141.38
came3141.38
were3141.38
awa3141.38
lalor3141.38
dunkirk3141.38
ah'll3141.38
like3141.38
mibbe3141.38
is3141.38
heypen294.25
comin294.25
played294.25
smile294.25
weerin294.25
survived294.25
bit294.25
auntie294.25
pavement294.25
his294.25
stuck294.25
funny294.25
et294.25
letters294.25
gordon294.25
teeth294.25
til294.25
ur294.25
months294.25
especially294.25
laugh294.25
lannot294.25
chair294.25
highlanders294.25
anither294.25
date294.25
daddy294.25
play294.25
door294.25
three294.25
feart294.25
long294.25
hoat294.25
shore294.25
ella294.25
hard294.25
street294.25
black294.25
sidney's294.25
family294.25
widnae294.25
lifted294.25
michael294.25
loat294.25
talk294.25
left294.25
heard294.25
suit294.25
it's294.25
awright294.25
durin294.25
much294.25
alang294.25
liked294.25
somebody294.25
be294.25
luik294.25
angry294.25
wurds294.25
spit294.25
nineteen294.25
driver294.25
brushed294.25
mind294.25
thirty294.25
cause294.25
sat294.25
goad294.25
telegram294.25
very294.25
brocht294.25
sojers294.25
spread294.25
weel294.25
paris294.25
curly294.25
made294.25
forget294.25
smell294.25
never294.25
noo294.25
bi294.25
stanes294.25
gaein294.25
wheels294.25
look294.25
still294.25
handsome294.25
talking294.25
smart294.25
hope294.25
hame294.25
white294.25
roon294.25
strong294.25
quiet294.25
hauns294.25
alive294.25
suits294.25
write294.25
cuid294.25