A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z Texts Writers Statistics Top200 Search Compare

. .Previous author - Next author
- basic details - dialect comparison - Venn diagrams - punctuation analysis - chronology -

Elphinstone Institute

Lexicon overlap between author and dialects

This is how much the author's top 200 most frequently used words overlaps with each major dialects top 200 words
An overlap of more than 50% is a pretty good match. Because of the nature of the top 200 words, almost no-one has an overlap of more than 70%, they'd have to be writing about a really broad range of things, like body parts, working, playing, thinking, governance, and so on.
On average Scots writers overlap with English by about 27%, so this is perhaps an indication on where on the Scots - Scottish English spectrum a writer's lexicon lies.

Dialect groupoverlap
Central27.5%
Doric27.0%
Ulster24.0%
Shetland21.0%
Orkney23.0%
Southern23.5%
English19.0%

Fine-grain dialect

DialectoverlapDescription
ORK23.0%Orkney
SHD21.0%Shetland
TON0.0%Tonge
NNB17.0%North Northern B (Caithness)
NNA7.0%North Northern A (Black Isle)
MNA26.5%Mid Northern A
MNB21.5%Mid Northern B
SNO24.0%South Northern
ABN29.5%Aberdeen
DOR22.5%General Northern
NEC29.5%North East Central
SEC27.5%(South) East Central
WCE24.0%West Central
DUN27.5%Dundee
EDN25.0%Edinburgh
GLA20.0%Glasgow
AYR23.0%Ayrshire
LAL29.5%General Central
SEA20.0%South East (Borders)
SWE28.0%South West (Galloway)
SOU0.0%General Southern
DUL1.5%Donegal (East Donegal)
WUL20.0%West Ulster (Letterkenny / L'Derry)
CUL19.5%Coleraine Ulster (North Antrim)
BUL19.5%Ballymena Ulster (Mid Antrim)
SUL12.5%South Antrim (Between Sixmilewater and Belfast)
BEL0.0%Eastern Ulster (Belfast)
PUL23.5%Peninsular Ulster (Ards)
EUL22.0%East Antrim (Larne)
GUL25.0%General Ulster
SYN19.0%Synthetic (no region)