A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Ultach

Basic Stats

Total words by this author in corpus - 298
Total unique words used by this author in corpus - 183
Ratio of total words to unique words - 1.628
Tagged as BUL (Ballymena Ulster (Mid Antrim)) dialect.
Top ten most common words - the, an, o, a, frae, his, ye, it, aye, are,

List of texts in corpus

Midgates
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (122 words)

The Gra They Hae in Americae
Reddit (01-11-2020) in Ulster dialect (BUL), categorised as poetry (99 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
heth2 6,711.4127.083
boady2 6,711.4117.555
tide2 6,711.4117.555
frae6 20,134.2317.494
are5 16,778.5216.629
aye5 16,778.5215.672
sicna2 6,711.4114.678
taen3 10,067.1114.094
whit5 16,778.5211.835
nor3 10,067.1110.654
maks2 6,711.4110.418
sae4 13,422.8210.169
whiles2 6,711.419.915
dinna2 6,711.417.017
til2 6,711.415.673
yer3 10,067.115.447
ye5 16,778.524.601
his5 16,778.524.229
aw3 10,067.114.044
fowk2 6,711.412.963
tae3 10,067.112.268
but3 10,067.112.002
is4 13,422.821.904
ower2 6,711.411.736
yiz2 6,711.41nan
an11 36,912.751.396
o9 30,201.341.224
the21 70,469.800.773
puzhin2 6,711.41nan
a6 20,134.230.993
he4 13,422.820.660
it5 16,778.520.545
at3 10,067.110.542
they2 6,711.410.517
s3 10,067.110.223
be2 6,711.410.149
in4 13,422.820.091
for2 6,711.410.059