A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Ulster Scots Agency

Punctuation analysis - Comparative word frequency analysis

Stats

Total words by this author in corpus - 6,356
Total unique words used by this author in corpus - 928
Ratio of total words to unique words - 6.849
Tagged as SYN (Synthetic (no region)) dialect.
Top ten most common words - tha, or, in, o, tae, ye, fur, a, an, oan,

List of texts in corpus

Ulster Scots Census
census.gov.uk (2021-04-03 ) in Ulster dialect, categorised as government (6,356 words)

Author word usage frequencies

WordCount Normalised
per 100,000
tha3425380.74
or2153382.63
in1522391.44
o1402202.64
tae1342108.24
ye1312061.04
fur1261982.38
a1201887.98
an1161825.05
oan1021604.78
bodie911431.72
yer851337.32
speirins731148.52
oot671054.12
h651022.66
heid-coont62975.46
foarm59928.26
hoosehaud57896.79
name55865.32
speirin52818.12
this51802.39
be47739.46
bes46723.73
whut45707.99
address44692.26
gin43676.53
ither42660.79
mair41645.06
fowk39613.59
thair34534.93
page34534.93
€˜at33519.19
uk33519.19
wi32503.46
at32503.46
€™s31487.73
doon30471.99
scrieve29456.26
fill29456.26
ordnar29456.26
yin28440.53
€™l25393.33
the25393.33
sing25393.33
faimlie25393.33
like24377.6
v24377.6
wark24377.6
nae24377.6
hae24377.6
€™24377.6
mairk23361.86
by22346.13
sib21330.4
aa21330.4
maun21330.4
pairtner21330.4
wye20314.66
dae20314.66
hame20314.66
til20314.66
as20314.66
tuk19298.93
mairch19298.93
can19298.93
oanline18283.2
frae18283.2
furst18283.2
ceevil18283.2
onie17267.46
aye17267.46
steyin17267.46
wha17267.46
here17267.46
pairt17267.46
bides16251.73
no16251.73
-16251.73
pit15236
airlann15236
gie15236
foremaist15236
hoo15236
throch15236
tak15236
takkin15236
repones15236
it15236
fillin15236
paper14220.26
wud14220.26
yins14220.26
hoose14220.26
bidin14220.26
lander13204.53
effeirs13204.53
dinnae13204.53
ga13204.53
day13204.53
awa13204.53
nor13204.53
jist12188.8
nicht12188.8
ootbye12188.8
naw12188.8
dochter12188.8
place12188.8
st12188.8
sister11173.06
tack11173.06
caa11173.06
bis11173.06
ax11173.06
lear11173.06
buik11173.06
shud11173.06
ilka11173.06
richt11173.06
uise11173.06
census11173.06
whar11173.06
landers10157.33
by-whyles10157.33
box10157.33
norlin10157.33
us10157.33
get10157.33
leid10157.33
aboot10157.33
yeir10157.33
help10157.33
haes10157.33
guidman10157.33
sin10157.33
step-wean10157.33
brither10157.33
guidwife10157.33
that10157.33
need10157.33
sets9141.6
onlie9141.6
dwellin-place9141.6
helpline9141.6
ain9141.6
bak9141.6
heid9141.6
warkin9141.6
we9141.6
laa8125.87
wur8125.87
naethin8125.87
sae8125.87
kintrie8125.87
taak8125.87
ma8125.87
lines8125.87
scrievin8125.87
answer8125.87
weans8125.87
oor8125.87
mak8125.87
accoont8125.87
gov8125.87
www8125.87
maist8125.87
weel8125.87
ingang8125.87
ni8125.87
wrocht8125.87
monds8125.87
aff8125.87
kirk8125.87
sex8125.87
same8125.87
code7110.13
bae7110.13
collegianer7110.13
sennicht7110.13
€™at7110.13
cud7110.13
pey7110.13
ulstèr-scotch7110.13
wus7110.13
aucht7110.13
bide7110.13
feck7110.13
hale7110.13
fillt7110.13
s7110.13
poastcode694.4
date694.4
alow694.4
isnae694.4
ulster-scotch694.4
pairtnerie694.4
freen694.4
airisch694.4
aabodie694.4
smith694.4
soart694.4
nane694.4
halth694.4
consarn694.4
lake694.4
faither694.4
mither694.4
boarn694.4
memmers694.4
oniebodie694.4
but694.4
yersel694.4
haun694.4
anither694.4
owre578.67
oors578.67
parteeklar578.67
cry578.67
level578.67
affen578.67
schuil578.67
€™t578.67
nvq578.67
foster578.67
step-brither578.67
step-sister578.67
siclike578.67
monie578.67
babbies578.67
step-mither578.67
step-faither578.67
haitin578.67
hiddlie578.67
houl578.67
granpaurent578.67
granwean578.67
palls578.67
conteenyed578.67
giein578.67
efter578.67
daen578.67
sent578.67
eik578.67
gif578.67
gien578.67
sen578.67
fit578.67
pye578.67
poast578.67
wittins578.67
stairt578.67
awns578.67
inglisch462.93
mond462.93
yit462.93
bodies462.93
afore462.93
man-bodie462.93
wummin-bodie462.93
naebodie462.93
micht462.93
adae462.93
envelope462.93
uisein462.93
landèr462.93
€™r462.93
sawbith462.93
collegianers462.93
lear-time462.93
luk-ower462.93
i462.93
offys462.93
pages462.93
t462.93
and462.93
lea462.93
these462.93
foarms462.93
heicht462.93
forbye462.93
nivver462.93
ludgers462.93
mine462.93
leeve462.93
some462.93
owresettin462.93
impediment462.93
lenth462.93
wull462.93
declaration462.93
hoosehauds462.93
saiconn462.93
hoaliday462.93
repone462.93
twalmond462.93
follaein462.93
€™d462.93
gies462.93
oul462.93
€™ll462.93
letter462.93
yin-soorce462.93
raed462.93
shane462.93
job462.93
lairnin462.93
up462.93
is462.93
siccar462.93
ir462.93
unnèrstaun462.93
seiven347.2
axed347.2
e347.2
then347.2
athin347.2
time347.2
athoot347.2
to347.2
sarvices347.2
anent347.2
guidin347.2
vans347.2
hoosin347.2
motors347.2
thon347.2
replenishin347.2
oots347.2
nurse347.2
darg347.2
ins347.2
wheelchyre347.2
laid347.2
abune347.2
en347.2
ulster-scots347.2
wun347.2
thïngs347.2
irnae347.2
baith347.2
coontit347.2
paurents347.2
freens347.2
daein347.2
twa347.2
days347.2
gcses347.2
parteeklars347.2
kin347.2
levels347.2
airtin347.2
gethert347.2
releegion347.2
uised347.2
lippent347.2
blak347.2
pull347.2
why347.2
thar347.2
shair347.2
throu347.2
€“347.2
restrickit347.2
ruim347.2
cum347.2
iverie347.2
warkers347.2
leeves347.2
condeetions347.2
age347.2
merriet347.2
ayther347.2
daily231.47
columns231.47
gate231.47
daeins231.47
ootfit231.47
lectric231.47
matthew231.47
she231.47
wuid231.47
axes231.47
kennin231.47
methody231.47
poust231.47
seek231.47
degree231.47
richt-oot231.47
€™s-erran231.47
ether231.47
ilka-day231.47
chloe231.47
pall231.47
retail231.47
lowse231.47
gates231.47
whan231.47
mental231.47
houlin231.47
lang-tholed231.47
daes231.47
gettin231.47
merchant231.47
syndrome231.47
£231.47
dïd231.47
haein231.47
held231.47
haetae231.47
fu231.47
pictèr231.47
gas231.47
roman231.47
cses231.47
noo231.47
gangin231.47
lowsed231.47
gan231.47
soartins231.47
syne231.47
cums231.47
haed231.47
btec231.47
biggin231.47
structur231.47
sindèrt231.47
on231.47
main231.47
sophie231.47
general231.47
plenish231.47
craft231.47
o-levels231.47
city231.47
convoy231.47
guilds231.47
airlannn231.47
yinst231.47
unnèr231.47
twinit231.47
convenerie231.47
releegious231.47
stannin231.47
caithlick231.47
heerin231.47
prisbetarian231.47
sinn-pooert231.47
flatches231.47
day-231.47
trystit231.47
foondin231.47
rairin231.47
raa231.47
ivver231.47
gye231.47
airtit231.47
leuk231.47
settin231.47
guid231.47
sennichts231.47
sicht231.47
furst-heicht231.47
memmer231.47
previous231.47
coontin231.47
van231.47
james231.47
scotch231.47
whaniver231.47
ware231.47
blank231.47
apairt231.47
uisin231.47
cannae231.47
whyles231.47
luk231.47
goverment231.47
locyal231.47
forces231.47
airmed231.47
motor231.47
tinints231.47
gang231.47
nixt231.47
less231.47
bin231.47
plaise231.47
freepost231.47
screengin231.47
thenks231.47
raisons231.47
genèral231.47
skaith231.47
ken231.47
nummerin231.47
makkin231.47
apen231.47
their231.47
want231.47
keepin231.47
six231.47
fects231.47
alane231.47
stey231.47
leevin231.47
billies231.47
fin231.47
ootlays231.47
soartin231.47
pyes231.47
primary231.47
mary231.47
fower231.47
next231.47
skip231.47
content231.47
letters231.47
ospittle231.47
insense231.47
guide231.47
'at231.47
pick231.47
sarvice231.47