A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Ultach

Basic Stats

Total words by this author in corpus - 298
Total unique words used by this author in corpus - 183
Ratio of total words to unique words - 1.628
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - the, an, o, a, frae, his, ye, it, aye, are,

List of texts in corpus

Midgates
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (122 words)

The Gra They Hae in Americae
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (99 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
heth2 6,711.4127.106
boady2 6,711.4117.578
tide2 6,711.4117.578
frae6 20,134.2317.546
are5 16,778.5216.591
aye5 16,778.5215.688
sicna2 6,711.4114.701
taen3 10,067.1114.052
whit5 16,778.5211.793
nor3 10,067.1110.678
maks2 6,711.4110.441
sae4 13,422.8210.188
whiles2 6,711.419.885
dinna2 6,711.417.031
til2 6,711.415.688
yer3 10,067.115.454
ye5 16,778.524.623
his5 16,778.524.248
aw3 10,067.114.041
fowk2 6,711.412.924
tae3 10,067.112.266
but3 10,067.112.007
is4 13,422.821.912
ower2 6,711.411.741
yiz2 6,711.41nan
an11 36,912.751.410
o9 30,201.341.200
the21 70,469.800.768
puzhin2 6,711.41nan
a6 20,134.230.986
he4 13,422.820.669
it5 16,778.520.553
at3 10,067.110.535
they2 6,711.410.518
s3 10,067.110.228
be2 6,711.410.152
in4 13,422.820.095
for2 6,711.410.060