A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- basic details - dialect comparison - fine grain dialect comparison - punctuation analysis - chronology -

Ulster-Scots Language Society


Total words by this author in corpus - 44,666
Total unique words used by this author in corpus - 3,183
Ratio of total words to unique words - 14.033

Lexicon overlap between author and dialects

This is how much the author's top 200 most frequently used words overlaps with each major dialects top 200 words
An overlap of more than 50% is a pretty good match. Because of the nature of the top 200 words, almost no-one has an overlap of more than 70%, they'd have to be writing about a really broad range of things, like body parts, working, playing, thinking, governance, and so on.
On average Scots writers overlap with English by about 27%, so this is perhaps an indication on where on the Scots - Scottish English spectrum a writer's lexicon lies.

Central overlap: 43.5%
Central Ulster-Scots Language Society

Doric overlap: 39.5%
Doric Ulster-Scots Language Society

Ulster overlap: 52.0%
Ulster Ulster-Scots Language Society

Shetland overlap: 33.5%
Shetland Ulster-Scots Language Society

Orkney overlap: 33.0%
Orkney Ulster-Scots Language Society

Southern overlap: 39.0%
Southern Ulster-Scots Language Society

English overlap: 17.0%
English Ulster-Scots Language Society