A Corpus of 21st Century Scots Texts

Intro a b c d e f g h i j k l m n o p q r s t u v w x y z List of texts Statistics Top200 Search Compare

Top 200 Words

Sortable List of Top 200 most frequently occurring words in Corpus

Hereto follows a list of the top 200 most frequently used words in the Corpus of 21st Century Scots Texts, with the number of occurences normalised per million words, the number of different authors who've used the words and then number of occurrences normalised per million words for each dialect.

Clicking on the column headings should sort the words by that language variety.

A full dump of the raw word frequencies can be downloaded in csv format here (do a right click save as)

Word Total Occurrences
(normalised per million words)
Authors Central / Lallans
(normalised per million words)
Doric / Northern
(normalised per million words)
Shetland
(normalised per million words)
Orkney
(normalised per million words)
Southern / Borders
(normalised per million words)
Ulster
(normalised per million words)
the47,688.0 320 54,073.749,697.79 3,179.27 54,619.35 45,028.16 23,265.70
a28,259.4 351 27,776.130,081.44 25,559.01 30,186.78 25,483.78 29,832.34
an22,973.5 325 19,039.227,085.98 30,957.05 16,425.39 26,914.69 37,176.53
tae20,555.1 331 22,408.117,138.31 7,222.99 22,514.50 19,249.11 23,618.03
o17,581.8 311 16,814.417,336.14 16,645.54 19,393.83 17,659.22 23,575.51
in13,873.9 344 14,795.213,770.00 10,248.58 14,035.41 11,481.33 10,934.27
it10,592.2 316 12,112.510,528.51 6,915.63 2,846.66 5,825.84 7,660.07
wis9,054.9 245 9,650.410,036.45 16,069.23 8,951.00 5,087.67 103.27
that8,679.4 291 10,813.75,344.15 461.04 8,402.98 6,859.27 7,435.31
and8,534.8 288 10,924.94,692.30 3,995.70 19,972.29 4,099.66 1,044.83
he8,064.5 248 8,233.08,332.00 10,882.51 9,986.15 1,726.17 7,040.46
wi6,417.9 278 5,682.37,824.73 8,615.72 6,165.23 6,518.58 7,052.61
at6,230.6 321 5,934.66,196.37 11,113.03 5,586.76 6,768.42 5,345.64
ah6,042.4 129 7,856.04,735.42 96.05 106.56 7,052.33 1,579.39
ye5,996.7 226 5,922.98,233.08 - 1,507.06 3,395.57 8,152.11
is5,932.6 311 6,418.55,232.55 6,723.53 7,383.05 6,677.57 2,599.93
on5,905.3 300 5,571.96,675.75 5,878.28 7,596.17 4,860.54 6,384.40
i5,566.1 262 4,248.16,508.35 15,973.18 15,846.92 5,848.55 2,053.21
as5,516.4 307 5,884.55,062.61 7,222.99 5,404.09 5,144.45 3,091.97
his5,008.3 245 4,840.76,688.43 5,407.64 6,515.35 3,702.19 2,047.14
ma4,803.1 213 5,451.65,521.69 67.24 106.56 6,416.37 2,375.17
be4,730.6 301 5,028.74,403.15 5,570.92 4,795.18 1,601.25 4,464.83
she4,670.0 198 5,439.94,981.45 278.55 669.80 7,006.90 1,451.83
oot4,496.2 296 4,101.95,415.17 4,648.84 4,886.51 3,531.84 5,424.61
her4,390.4 220 4,410.26,041.65 5,129.09 5,815.10 1,555.83 771.47
for4,387.9 258 4,736.24,070.89 4,523.97 2,663.99 1,941.95 4,525.57
up4,244.3 299 4,216.34,895.21 4,379.90 2,618.32 4,099.66 3,529.34
said4,176.8 190 4,427.83,036.04 7,501.54 1,765.84 5,610.07 3,177.01
but4,162.9 278 5,018.71,572.55 3,534.65 3,775.25 3,906.60 4,835.38
me3,933.4 264 4,244.82,549.06 5,455.66 10,397.16 1,919.23 2,520.96
this3,718.8 265 4,563.22,384.20 422.62 4,003.59 3,384.21 2,927.96
s3,675.1 232 3,676.53,061.41 2,141.92 2,694.43 3,827.11 6,414.77
they3,587.7 246 4,135.33,332.80 297.76 3,546.91 3,338.78 2,448.06
aboot3,324.9 272 3,149.13,789.35 4,418.32 3,029.33 2,861.82 3,164.86
no3,242.5 257 3,964.8623.95 3,842.02 6,241.34 3,009.45 2,812.54
tha3,088.5 38 17.65.07 - - 11.36 37,626.05
da3,010.0 101 376.9286.61 52,366.68 30.45 317.98 85.04
fae2,945.0 239 3,053.03,302.36 3,976.49 3,668.69 760.88 1,530.80
nae2,929.1 250 2,153.76,082.23 2,305.21 380.57 3,145.73 2,308.35
or2,874.4 267 2,847.42,505.94 2,487.71 3,455.57 2,521.12 4,155.02
we2,857.0 260 3,196.72,125.48 3,630.71 3,592.58 1,987.37 1,822.38
aw2,745.8 148 4,310.0172.47 67.24 - 3,020.80 182.24
him2,736.3 207 2,766.33,132.43 4,879.36 2,237.75 1,340.06 1,160.25
like2,733.9 253 2,805.64,261.12 297.76 2,740.10 840.37 1,105.58
hae2,573.4 231 2,359.31,876.92 2,420.47 1,430.94 2,918.60 6,165.71
wee2,570.9 227 2,979.42,308.10 - 106.56 2,498.41 2,879.36
fur2,483.5 144 2,185.52,148.31 1,776.93 4,353.72 3,679.48 4,513.42
scots2,451.2 138 3,612.9641.70 259.34 1,111.26 567.82 1,269.59
whit2,408.0 198 3,145.8225.74 4,581.60 3,425.13 2,907.24 224.76
yer2,406.0 190 2,061.83,715.79 38.42 608.91 2,169.07 4,112.50
jist2,347.4 173 2,389.43,913.63 - 213.12 1,623.97 1,014.46
doon2,278.9 238 1,792.73,216.13 2,420.47 2,466.09 2,430.27 3,322.80
bit2,268.9 240 1,534.44,545.19 3,313.74 4,764.73 1,726.17 789.70
you2,243.1 200 2,195.52,295.42 5,388.43 3,257.68 420.19 1,044.83
aye2,186.0 236 2,318.42,417.17 1,210.24 1,552.72 1,487.69 1,913.50
there2,163.1 229 2,641.81,973.30 220.92 2,009.41 1,839.74 601.39
if2,038.5 235 2,112.81,876.92 2,987.17 1,720.17 1,737.53 1,573.32
day2,036.5 259 1,544.53,548.39 2,305.21 2,283.42 1,726.17 1,889.20
back1,966.4 234 2,068.52,303.03 1,901.80 2,405.20 1,987.37 273.36
oan1,887.0 116 2,294.11,235.22 - 121.78 4,122.38 1,190.62
ower1,884.0 229 1,673.22,543.99 2,526.13 1,796.29 1,544.47 1,646.22
time1,880.5 261 1,885.42,021.49 2,286.00 1,826.73 1,726.17 1,354.63
aa1,866.1 144 828.23,477.37 5,109.88 1,019.93 1,112.93 4,240.07
mair1,829.9 222 1,854.51,808.44 1,584.83 76.11 1,521.76 2,721.42
see1,826.4 253 1,798.52,064.61 2,276.39 1,689.73 840.37 1,755.56
them1,801.0 204 1,867.12,673.34 192.10 1,933.29 511.04 886.89
get1,799.1 218 2,008.31,930.18 883.66 1,217.82 1,385.48 996.23
noo1,783.7 236 1,539.42,249.77 1,853.77 2,724.88 1,169.71 2,350.87
it's1,770.2 146 1,942.32,051.93 1,046.95 487.13 590.53 1,445.75
are1,671.4 213 1,921.41,752.64 509.07 1,507.06 1,385.48 613.53
been1,650.5 226 1,972.41,339.21 1,738.51 1,978.96 953.94 242.98
had1,641.1 169 2,371.0314.51 374.60 289.23 249.84 1,597.62
their1,586.0 197 1,723.32,079.83 124.87 761.14 1,442.26 735.03
ae1,567.1 121 1,942.3778.67 38.42 121.78 5,746.34 36.45
can1,556.7 245 1,641.41,400.08 1,700.09 1,674.51 1,385.48 1,269.59
dae1,538.3 192 1,943.1380.46 48.03 761.14 1,499.05 2,642.45
ken1,521.4 212 1,464.21,991.06 1,584.83 2,055.08 1,873.81 370.55
intae1,502.5 193 1,807.71,019.62 614.72 1,659.28 1,305.99 1,044.83
when1,455.3 176 1,965.7296.76 874.06 2,435.65 1,839.74 291.58
sae1,454.3 174 1,393.21,400.08 787.61 60.89 2,350.78 2,527.03
aff1,441.4 209 1,393.21,600.45 1,978.64 1,050.37 579.18 1,688.74
to1,436.0 205 1,476.81,511.68 2,276.39 1,248.27 1,056.15 704.65
then1,417.6 200 1,598.0951.14 86.45 1,826.73 1,760.24 1,719.11
by1,388.3 227 1,727.5783.74 931.69 1,461.39 806.31 941.56
some1,363.9 234 1,401.51,402.62 1,421.55 1,994.18 1,237.85 777.55
were1,356.0 178 1,628.91,179.42 422.62 867.70 1,601.25 449.52
so1,347.6 215 1,261.11,285.94 3,352.16 3,196.78 420.19 613.53
richt1,340.1 189 1,266.21,590.31 1,575.23 - 1,567.18 1,542.95
here1,330.2 216 1,462.6958.75 1,296.68 1,598.39 1,283.27 1,196.70
fir1,321.2 117 1,204.31,739.96 3,227.29 730.69 1,203.78 261.21
frae1,310.3 114 1,155.81,085.57 48.03 - 3,338.78 3,207.39
awa1,271.6 185 980.32,163.53 1,258.26 106.56 1,215.14 1,755.56
of1,270.1 216 1,380.71,100.79 1,498.39 989.48 1,601.25 662.13
weel1,262.6 183 1,050.51,539.58 1,959.43 2,115.97 647.32 1,688.74
got1,262.1 206 1,336.41,270.73 979.71 1,278.71 1,169.71 923.34
oor1,259.6 191 1,510.2973.97 67.24 182.67 1,340.06 1,263.52
wid1,218.9 170 945.21,805.90 3,966.88 2,115.97 488.33 97.19
guid1,212.9 167 1,401.5469.23 1,431.15 274.01 1,283.27 1,822.38
fit1,195.6 149 239.95,027.10 345.78 137.01 476.97 309.80
say1,194.6 210 1,140.01,169.27 1,373.52 1,111.26 1,658.04 1,324.26
fowk1,189.6 128 1,216.81,417.84 9.61 15.22 306.62 2,132.18
m1,149.9 136 1,107.4583.37 720.38 685.03 397.47 3,675.13
hid1,131.0 99 124.53,703.11 105.66 9,803.47 68.14 42.52
ti1,108.6 40 1,441.7842.08 - - 1,987.37 -
n1,107.2 91 1,090.7623.95 1,613.65 121.78 45.43 3,025.15
will1,098.2 178 1,267.01,187.02 931.69 867.70 476.97 188.31
auld1,096.7 167 1,323.8979.04 211.31 152.23 2,135.00 109.34
us1,075.9 182 1,430.0568.15 115.26 608.91 897.16 607.46
man1,074.9 174 1,043.0580.83 1,911.40 1,126.49 1,192.42 1,877.05
e1,069.9 56 112.84,859.70 57.63 - 919.87 97.19
yin1,068.4 134 1,002.9109.06 - 563.24 2,328.06 4,045.68
its1,056.5 182 1,273.7783.74 461.04 228.34 988.01 874.74
twa1,046.1 183 850.81,775.46 979.71 1,172.15 317.98 1,099.50
bi1,032.6 75 870.01,800.83 557.09 928.59 988.01 741.10
afore1,032.1 195 839.11,278.33 1,498.39 1,217.82 1,408.19 1,275.67
think1,007.8 189 1,040.51,199.71 374.60 1,111.26 783.59 789.70
come984.0 193 880.91,123.62 1,940.22 1,004.70 783.59 892.97
ither978.5 161 1,045.5771.06 38.42 1,004.70 1,044.79 1,536.87
heid971.1 182 933.51,065.28 144.08 654.58 1,351.41 1,463.98
efter947.2 169 952.81,022.16 1,191.03 411.02 1,305.99 595.31
hoose937.3 170 720.41,146.44 883.66 1,019.93 1,828.38 1,536.87
than935.8 163 1,077.3897.88 48.03 1,446.16 953.94 346.25
ta931.3 57 44.327.90 17,212.23 - 56.78 85.04
did927.8 181 969.5991.72 883.66 776.36 1,078.86 479.89
my924.9 170 703.71,255.51 3,573.08 334.90 261.20 656.06
lang918.4 191 817.41,148.98 1,027.74 319.68 1,056.15 1,196.70
d909.5 148 706.21,245.36 3,121.64 380.57 431.54 649.98
even908.5 177 1,048.0725.40 1,037.34 669.80 806.31 400.92
muckle900.0 155 955.3941.00 1,392.73 745.92 863.09 170.09
could898.5 176 982.0682.29 1,363.92 1,248.27 1,249.21 188.31
th893.1 12 10.0314.51 - - 18,874.35 -
was892.6 171 692.8494.59 633.93 258.79 5,235.30 1,391.08
ain882.6 175 922.7834.47 528.28 15.22 931.23 1,251.37
whan880.2 83 922.722.83 883.66 1,187.38 2,328.06 1,725.19
ay852.8 58 1,108.2608.73 57.63 15.22 1,067.50 303.73
tak847.9 171 700.4862.37 1,383.13 1,172.15 681.38 1,506.50
pit822.0 181 856.6862.37 739.59 517.57 295.27 929.41
still816.1 207 866.7697.50 941.29 1,187.38 840.37 492.04
gie815.6 174 671.9748.23 1,027.74 319.68 1,033.43 1,968.17
first813.6 189 898.4814.18 1,440.76 730.69 295.27 109.34
thocht806.6 133 724.61,204.78 19.21 - 1,056.15 1,135.95
lol796.2 13 1,321.315.22 - - 147.63 18.22
mak790.8 172 743.0730.48 1,277.47 806.81 601.89 1,069.13
big788.8 179 907.6560.54 249.73 730.69 681.38 892.97
again782.3 162 857.5897.88 566.70 1,111.26 431.54 151.86
cam781.3 116 713.7989.19 2,161.13 928.59 442.90 24.30
een780.8 154 392.01,843.95 1,824.96 608.91 340.69 704.65
thaim770.4 69 737.12.54 28.82 350.12 715.45 3,517.19
ah'm769.4 46 739.6869.98 307.36 2,877.11 851.73 151.86
tell768.9 182 743.8730.48 854.85 593.69 783.59 1,050.91
scotland759.5 133 1,128.3294.22 96.05 167.45 295.27 97.19
new753.5 188 736.3829.40 758.80 1,035.15 908.51 498.12
wan750.5 118 940.2144.57 1,623.25 2,161.64 102.21 54.67
te749.0 15 6.73,802.03 - - 11.36 -
fer748.5 32 648.5948.61 28.82 1,552.72 283.91 1,378.93
wae747.0 60 985.35.07 9.61 2,298.64 193.06 935.49
says745.6 137 706.21,022.16 624.33 1,674.51 215.77 358.40
thon743.1 117 651.9573.22 - 30.45 317.98 2,794.31
nicht733.1 130 588.41,410.23 249.73 30.45 635.96 801.85
wey729.7 103 1,099.8109.06 38.42 1,141.71 340.69 6.07
that's716.2 105 927.7215.59 38.42 928.59 499.68 838.29
hame710.3 186 542.41,136.30 672.35 167.45 965.29 1,014.46
ony707.8 143 653.6956.21 1,171.81 319.68 590.53 431.30
made705.8 173 753.8834.47 268.94 137.01 613.25 601.39
mind697.9 193 759.7469.23 1,104.58 730.69 851.73 443.45
dinnae691.4 111 836.6172.47 - 15.22 1,124.28 1,354.63
ane676.0 111 841.6606.19 297.76 121.78 851.73 6.07
year670.5 182 739.6626.49 806.82 776.36 522.39 224.76
go668.6 158 809.0461.62 509.07 898.14 158.99 425.22
thing645.2 169 665.3575.76 816.43 867.70 283.91 662.13
gaun643.2 93 753.0654.39 - - 1,169.71 200.46
went635.8 142 679.5370.31 38.42 45.67 1,181.07 1,275.67
wus630.3 24 4.22.54 - - 68.14 7,635.77
haes625.3 75 619.320.29 778.01 487.13 340.69 2,229.38
last624.4 181 619.3659.46 653.14 730.69 692.74 479.89
only623.4 135 653.6530.10 826.03 806.81 760.88 352.33
look622.9 134 702.0694.97 768.40 91.34 522.39 48.60
thair616.4 42 630.2- 9.61 60.89 1,135.64 2,320.50
seen606.5 172 592.5740.62 509.07 745.92 488.33 455.59
r605.0 64 328.465.95 672.35 121.78 227.13 4,258.29
door601.5 130 600.9801.50 624.33 715.47 681.38 24.30
maist601.0 148 730.4436.26 144.08 197.90 397.47 613.53
how594.6 130 814.9202.91 355.39 137.01 988.01 54.67
anither594.6 131 551.6669.60 19.21 974.26 670.03 899.04
micht586.6 129 626.0555.47 528.28 - 624.60 625.68
folk580.1 123 590.9436.26 326.57 1,233.05 420.19 832.22
wur580.1 57 479.778.63 28.82 882.92 2,555.19 1,682.66
till578.7 125 295.01,663.86 537.88 745.92 431.54 78.97
dinna578.2 83 437.91,455.88 172.89 517.57 124.92 18.22
bein577.7 147 687.0474.30 605.12 91.34 420.19 291.58
wir576.7 118 295.9342.41 4,082.14 2,907.55 454.26 97.19
language574.2 90 820.7111.60 432.23 943.81 90.85 91.12
need573.7 167 663.6540.25 451.44 411.02 238.48 321.95
he's571.7 85 617.6563.08 576.30 882.92 56.78 407.00
cannae569.2 118 697.0124.28 - 15.22 874.44 1,123.80
things568.2 139 593.4509.81 797.22 1,004.70 238.48 382.70
syne565.2 102 560.01,029.77 67.24 - 317.98 164.01
kent564.3 134 511.5748.23 874.06 578.47 1,022.08 60.75
same563.8 162 623.5504.74 115.26 274.01 647.32 625.68
wha563.8 126 682.081.16 614.72 380.57 204.42 1,093.43