A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Malgrati, Paul

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 1,990
Total unique words used by this author in corpus - 764
Ratio of total words to unique words - 2.605
Top ten most common words - the, scots, an, o, tae, a, that, is, in, it,

List of texts in corpus

Ayont the Fake and the Leal: Let's Free the Leid
Bella Caledonia (02-12-2018 ) in Lallans dialect, categorised as blog (1,990 words)

Author word usage frequencies

WordCount Normalised
per 100,000
the884422.11
scots663316.58
an603015.08
o562814.07
tae492462.31
a492462.31
that391959.8
is341708.54
in281407.04
it261306.53
or231155.78
-201005.03
as19954.77
fae19954.77
fir18904.52
we18904.52
wi17854.27
leid17854.27
but14703.52
aye11552.76
be11552.76
nae11552.76
us10502.51
this10502.51
ither9452.26
bi9452.26
ye9452.26
aa9452.26
there8402.01
wye8402.01
e'en8402.01
fowks8402.01
maun8402.01
they8402.01
ilka7351.76
fake7351.76
muckle7351.76
his7351.76
wha7351.76
oor7351.76
isna7351.76
wis7351.76
eh7351.76
linguistic7351.76
its7351.76
national7351.76
mak7351.76
ony6301.51
aboot6301.51
tak6301.51
yin6301.51
like6301.51
syne6301.51
sic6301.51
on6301.51
new6301.51
can6301.51
braid5251.26
mony5251.26
wirds5251.26
whit5251.26
alistair's5251.26
hiv5251.26
yon5251.26
maist5251.26
genuine5251.26
nor5251.26
makkars5251.26
hail5251.26
gin5251.26
at5251.26
proper5251.26
whin4201.01
ower4201.01
time4201.01
said4201.01
their4201.01
ainly4201.01
scottish4201.01
door4201.01
ithers4201.01
sae4201.01
warld4201.01
lallans4201.01
and4201.01
authenticity4201.01
doric4201.01
ettle4201.01
bein4201.01
ken4201.01
ayont4201.01
aareadies4201.01
staunnart4201.01
thegither4201.01
strength4201.01
baith4201.01
labbyists4201.01
coorse4201.01
people4201.01
ivver4201.01
hae4201.01
onybody4201.01
canna3150.75
state3150.75
want3150.75
whaur3150.75
mair3150.75
leal3150.75
real3150.75
medium3150.75
here3150.75
me3150.75
fin'3150.75
need3150.75
true3150.75
it's3150.75
poo'er3150.75
wad3150.75
speik3150.75
cultural3150.75
artifice3150.75
modren3150.75
auld3150.75
staun3150.75
gie3150.75
were3150.75
clip3150.75
to3150.75
are3150.75
agin3150.75
braa3150.75
video3150.75
that's3150.75
tongue3150.75
he3150.75
them3150.75
up3150.75
jyst3150.75
thae3150.75
you3150.75
whit's2100.5
art2100.5
amang2100.5
weel2100.5
synthetic2100.5
kent2100.5
demotic2100.5
isnae2100.5
speak2100.5
either2100.5
means2100.5
pit2100.5
doon2100.5
ain2100.5
kind2100.5
rule2100.5
north-east2100.5
artificial2100.5
pairt-takkers2100.5
lang2100.5
contemporar2100.5
tawk2100.5
quirk2100.5
intae2100.5
dinnae2100.5
noo2100.5
day2100.5
faker2100.5
whaurever2100.5
offeecial2100.5
johnny2100.5
jeannie2100.5
hunert2100.5
fu2100.5
platform2100.5
twa2100.5
houivver2100.5
kinrick2100.5
aff2100.5
yaised2100.5
speaks2100.5
gushetneuk2100.5
freedom2100.5
awaa2100.5
learn2100.5
see2100.5
dae2100.5
bbc2100.5
na'hin2100.5
shuid2100.5
european2100.5
happenin2100.5
orra2100.5
problem2100.5
comments2100.5
micht2100.5
may2100.5
thing2100.5
happens2100.5
dictionar2100.5
oxford2100.5
yet2100.5
propriety2100.5
foot2100.5
some2100.5
ainership2100.5
kintra2100.5
itsel2100.5
scrievers2100.5
wird2100.5
tongues2100.5
ayebidan2100.5
sound2100.5
say2100.5
poets2100.5
ilk2100.5
whitivver2100.5
people's2100.5
form2100.5
uphaudit2100.5
yer2100.5
nummer2100.5
trust2100.5
mind2100.5
oot2100.5
siccar2100.5
makkit2100.5
million2100.5
moyen2100.5
best2100.5
tent2100.5
vyce2100.5
atween2100.5
fowk2100.5
aroon2100.5
harry2100.5
ahint2100.5
wull2100.5
pure2100.5
phrase2100.5
line2100.5
thocht2100.5
giles2100.5
still2100.5
hid2100.5
yaise2100.5
meddlin2100.5
scotland2100.5
politics2100.5