A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- dialect comparison - fine grain dialect comparison - Venn diagrams - punctuation analysis - chronology -

Dee Har

Basic Stats

Total words by this author in corpus - 109
Total unique words used by this author in corpus - 76
Ratio of total words to unique words - 1.434
Tagged as GUL (General Ulster) dialect.
Top ten most common words - tha, as, a, tae, the, in, an, troot, he, fer,

List of texts in corpus

Tha Broon Troot
Facebook (2020-03-07) in Ulster dialect (GUL), categorised as poetry (109 words)

Author word Keyness frequencies

This should list the words that the author uses in a disproportionate manner more often than other writers in the corpus. This may include (a) proper nouns (character names in the author's stories), (b) other words related to the specific subject matter and (c) words specific to the regional dialect.
WordCount Normalised
per million
Keyness
tha8 73,394.5056.511
troot3 27,522.9435.573
flea2 18,348.6229.967
fer2 18,348.6213.030
as5 45,871.5611.708
get2 18,348.626.866
oan2 18,348.626.647
doon2 18,348.625.055
him2 18,348.624.346
the4 36,697.251.053
tae4 36,697.251.007
he2 18,348.620.898
in3 27,522.940.826
a4 36,697.250.195
an3 27,522.940.020