A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Ultach

Basic Stats

Total words by this author in corpus - 298
Total unique words used by this author in corpus - 183
Ratio of total words to unique words - 1.628
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - the, an, o, a, frae, his, ye, it, aye, are,

List of texts in corpus

Midgates
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (122 words)

The Gra They Hae in Americae
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (99 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
heth2 6,711.4127.104
boady2 6,711.4117.576
tide2 6,711.4117.576
frae6 20,134.2317.431
are5 16,778.5216.834
aye5 16,778.5215.334
sicna2 6,711.4114.699
taen3 10,067.1114.049
whit5 16,778.5211.874
nor3 10,067.1110.619
maks2 6,711.4110.439
sae4 13,422.8210.159
whiles2 6,711.419.918
dinna2 6,711.417.006
til2 6,711.415.676
yer3 10,067.115.532
ye5 16,778.524.667
his5 16,778.524.193
aw3 10,067.114.081
fowk2 6,711.412.890
tae3 10,067.112.290
but3 10,067.112.041
is4 13,422.821.911
ower2 6,711.411.728
yiz2 6,711.41nan
an11 36,912.751.342
o9 30,201.341.186
the21 70,469.800.752
puzhin2 6,711.41nan
a6 20,134.230.971
he4 13,422.820.644
it5 16,778.520.575
at3 10,067.110.522
they2 6,711.410.509
s3 10,067.110.257
be2 6,711.410.150
in4 13,422.820.096
for2 6,711.410.054