A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Pacitti, Stephen

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 3,048
Total unique words used by this author in corpus - 964
Ratio of total words to unique words - 3.162
Top ten most common words - the, a, an, ', he, in, o, wis, ye, his,

List of texts in corpus

Lallans 81 - Pyntin the Been
Lallans Magazine (2012) in Doric (Aberdeen) dialect, categorised as newspaper (3,048 words)

Author word usage frequencies

WordCount Normalised
per 100,000
the1444724.41
a1123674.54
an882887.14
'752460.63
he531738.85
in491607.61
o471541.99
wis461509.19
ye441443.57
his441443.57
ti421377.95
that341115.49
wollaston30984.25
nae26853.02
ah26853.02
be26853.02
it25820.21
said24787.4
oot23754.59
wi23754.59
for23754.59
on22721.78
airchie22721.78
aa19623.36
but19623.36
yer18590.55
him18590.55
fit18590.55
-18590.55
they16524.93
or16524.93
day15492.13
fae15492.13
pit14459.32
at13426.51
doon13426.51
mair13426.51
tae12393.7
been12393.7
man12393.7
me12393.7
is12393.7
aal12393.7
this12393.7
richt11360.89
you11360.89
there11360.89
got11360.89
mi10328.08
gun10328.08
lik10328.08
hoose10328.08
nummer9295.28
weel9295.28
fan9295.28
had9295.28
na9295.28
noo9295.28
up9295.28
ower9295.28
their8262.47
gied8262.47
see8262.47
nivver8262.47
were8262.47
'na8262.47
nor8262.47
jist8262.47
hae8262.47
aboot7229.66
cooncil7229.66
myne7229.66
year7229.66
mak7229.66
chairlie7229.66
brocht7229.66
fowk6196.85
reid6196.85
maisie6196.85
them6196.85
aff6196.85
new6196.85
inti6196.85
twa6196.85
wollaston's6196.85
dae6196.85
ah'm5164.04
mr5164.04
ane5164.04
couldna5164.04
toon5164.04
can5164.04
fa5164.04
are5164.04
we5164.04
afore5164.04
think5164.04
'an5164.04
efter5164.04
ye've5164.04
far5164.04
awa5164.04
he'd5164.04
road5164.04
lang5164.04
some5164.04
thing5164.04
has5164.04
bob5164.04
alf5164.04
dinna5164.04
ira5164.04
ay5164.04
business5164.04
turnt4131.23
body4131.23
did4131.23
time4131.23
month4131.23
fergus4131.23
pick4131.23
kent4131.23
bleed4131.23
as4131.23
bit4131.23
spak4131.23
ivvry4131.23
her4131.23
wey4131.23
shakin4131.23
like4131.23
it's4131.23
div4131.23
wad4131.23
daein4131.23
and4131.23
gweed4131.23
fair4131.23
tak4131.23
wee4131.23
look4131.23
three4131.23
syne4131.23
come4131.23
wid4131.23
'the4131.23
ken4131.23
of4131.23
cam4131.23
pynted4131.23
park4131.23
butterworth4131.23
laist4131.23
he's4131.23
get4131.23
'fit4131.23
heid398.43
cried398.43
pouch398.43
big398.43
name398.43
to398.43
blin398.43
hissel398.43
half398.43
cheer398.43
ony398.43
again398.43
could398.43
still398.43
fifteenth398.43
naebody398.43
siller398.43
ayewis398.43
drink398.43
'noo398.43
'wollaston398.43
o't398.43
eneuch398.43
seen398.43
birthday398.43
gin398.43
til398.43
able398.43
word398.43
haggart398.43
peyed398.43
says398.43
than398.43
kitchen398.43
surely398.43
maybe398.43
maks398.43
letter398.43
winner398.43
ain398.43
moo398.43
if398.43
hear398.43
damn398.43
till398.43
wint398.43
'ay398.43
here398.43
ma398.43
life398.43
need398.43
men398.43
she398.43
daen398.43
they're398.43
hale398.43
ahin398.43
near398.43
eh265.62
'come265.62
onythin265.62
mairrit265.62
haan265.62
likes265.62
een265.62
corner265.62
was265.62
fusper265.62
canna265.62
loon265.62
echty265.62
bade265.62
born265.62
deceesions265.62
really265.62
anely265.62
that's265.62
ten265.62
'twa265.62
grunt265.62
meenit265.62
'pick265.62
tell265.62
chiel265.62
bottom265.62
though265.62
forgotten265.62
line265.62
atween265.62
six265.62
face265.62
words265.62
stood265.62
quick265.62
vyce265.62
'aa265.62
into265.62
gaed265.62
yersel265.62
picked265.62
mither265.62
guilty265.62
anither265.62
sittin265.62
dother265.62
thegither265.62
wik265.62
back265.62
ooer265.62
'ah265.62
pairty265.62
fine265.62
lump265.62
tried265.62
reality265.62
turned265.62
gings265.62
wisna265.62
shouders265.62
pittin265.62
drummond265.62
bailiff265.62
finger265.62
thin265.62
mean265.62
pulld265.62
somethin265.62
coorse265.62
breath265.62
walked265.62
wi't265.62
order265.62
sent265.62
lass265.62
dee265.62
caird265.62
'this265.62
names265.62
till't265.62
second265.62
swallyin265.62
then265.62
dad265.62
doot265.62
hunner265.62
beuk265.62
affa265.62
kinna265.62
hiv265.62
intae265.62
micht265.62
suppose265.62
jamieson265.62
pollisman265.62
ah've265.62
laach265.62
same265.62
ee265.62
'och265.62
bawbee265.62
widna265.62
cos265.62
gie265.62
i265.62
justice265.62
airchie's265.62
thocht265.62
gairden265.62
compensation265.62
giein265.62
bi265.62
its265.62
ticht265.62
here's265.62
'weel265.62
stairt265.62
freen265.62
richt-thinkin265.62
'please265.62
haans265.62