A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Ultach

Basic Stats

Total words by this author in corpus - 298
Total unique words used by this author in corpus - 183
Ratio of total words to unique words - 1.628
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - the, an, o, a, frae, his, ye, it, aye, are,

List of texts in corpus

Midgates
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (122 words)

The Gra They Hae in Americae
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (99 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
yiz2 6,711.4128.887
heth2 6,711.4127.161
tide2 6,711.4117.392
boady2 6,711.4117.392
frae6 20,134.2317.383
are5 16,778.5216.882
aye5 16,778.5215.352
sicna2 6,711.4114.756
taen3 10,067.1114.088
whit5 16,778.5211.957
nor3 10,067.1110.574
maks2 6,711.4110.475
sae4 13,422.8210.157
whiles2 6,711.419.888
dinna2 6,711.417.052
til2 6,711.415.620
yer3 10,067.115.467
ye5 16,778.524.626
his5 16,778.524.197
aw3 10,067.114.135
fowk2 6,711.412.884
tae3 10,067.112.297
but3 10,067.112.040
is4 13,422.821.914
ower2 6,711.411.743
an11 36,912.751.308
o9 30,201.341.173
a6 20,134.230.999
he4 13,422.820.646
puzhin2 6,711.41nan
the21 70,469.800.819
it5 16,778.520.575
at3 10,067.110.530
they2 6,711.410.504
s3 10,067.110.257
be2 6,711.410.153
in4 13,422.820.097
for2 6,711.410.061