-
Notifications
You must be signed in to change notification settings - Fork 0
/
index.html
368 lines (331 loc) · 14.1 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
<!DOCTYPE html>
<html class="no-js">
<head lang="en">
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<title>
Joel Purra's master's thesis
</title>
<script>
(function() {
try {
var h = document.getElementsByTagName("html")[0];
h.className = ("" + h.className).replace("no-js", "js");
} catch (e) {}
}());
</script>
<link rel="stylesheet" type="text/css" href="http://fonts.googleapis.com/css?family=Quando" />
<style>
body {
padding-left: 10%;
padding-right: 10%;
}
h1,
h2,
h3,
h4,
h5,
h6 {
font-family: Quando, Helvetica, Verdana, sans-serif;
}
h2 {
margin-top: 2em;
border-bottom: 1px solid lightgrey;
}
.warning:before {
content: "Warning";
font-style: normal;
font-weight: bold;
font-size: 0.6em;
font-family: Helvetica, Verdana, sans-serif;
padding-left: 0.3em;
padding-right: 0.3em;
margin-right: 0.5em;
color: #514721;
background-color: #FFF6BF;
border: 1px solid #FFD324;
}
header {
margin-top: 2em;
margin-bottom: 5em;
}
footer {
margin-top: 5em;
margin-bottom: 2em;
}
h2 time {
font-size: small;
}
table {
width: 100%;
}
table col:nth-of-type(odd) {
background-color: #f6f6f6;
}
table col:nth-of-type(even) {
background-color: #f0f0f0;
}
table td,
table th {
vertical-align: top;
margin: 0;
}
table td.number {
text-align: right;
}
ul.no-bullets li,
ol.no-bullets li {
list-style-type: none;
}
ul.inline-menu {
margin: 0;
padding: 0;
}
ul.inline-menu li {
display: inline-block;
margin: 0;
padding: 0;
}
ul.inline-menu li:before {
content: " \00B7 ";
margin-left: 0.3em;
margin-right: 0.3em;
}
html.no-js .js-only {
display: none;
}
html.js .no-js-only {
display: none;
}
</style>
<style>
h1 small,
h2 small,
h3 small {
color: lightgrey;
font-size: 0.5em;
}
</style>
</head>
<body>
<header>
<h1>
Joel Purra's master's thesis
<small>work in progress</small>
</h1>
<nav>
<ul class="inline-menu">
<!--
<li>
<a href="./" rel="start">Home</a>
</li>
-->
<li>
<a href="http://joelpurra.com/projects/masters-thesis/" rel="home">Project page</a>
</li>
<li>
<a href="http://joelpurra.com/" rel="author me">joelpurra.com</a>
</li>
</ul>
</nav>
<hr />
</header>
<main>
<section>
<h1>
<em>Swedes Online: You Are More Tracked Than You Think</em>
</h1>
<p>
<em class="warning">The thesis is currently in the process of being researched and written!</em>
</p>
<h2>
Abstract
</h2>
<p>
When you are browsing websites, third-party resources record your online habits; such <em>tracking</em> can be considered an invasion of privacy. It was previously unknown how many third-party resources, trackers and tracker companies are present in the different classes of websites chosen: globally popular websites, random samples of .se/.dk/.com/.net domains and curated lists of websites of public interest in Sweden. The in-browser HTTP/HTTPS traffic was recorded while downloading over 150,000 websites, allowing comparison of HTTPS adaption and third-party tracking within and across the different classes of websites.
</p>
<p>
The data shows that known third-party resources including known trackers are present on <em>over 90%</em> of most classes, that third-party hosted <em>content</em> such as video, scripts and fonts make up a large portion of the known trackers seen on a typical website and that tracking is just as prevalent on <em>secure</em> as insecure sites.
</p>
<p>
Observations include that Google is the most widespread tracker organization <em>by far</em>, that content is being served by known trackers may suggest that trackers are moving to providing services to the end user to <em>avoid being blocked</em> by privacy tools and ad blockers, and that the small difference in tracking between using HTTP and HTTPS connections may suggest that users are given a <em>false sense of privacy</em> when using HTTPS.
</p>
</section>
<section>
<h2>
Stay updated
</h2>
<p>
I maintain two mailing lists for slightly different purposes; one for announcing major milestones, one for small releases and discussions.
</p>
<form method="post" action="http://lists.joelpurra.com/signup/">
<input type="hidden" name="maillist-signup-masters-thesis-announce" value="true" />
<input type="hidden" name="maillist-signup-masters-thesis-discussion" value="true" />
<fieldset>
<legend>
Quick sign up
</legend>
<input name="email" type="email" required="required" placeholder="Your email address" />
<button type="submit">
Subscribe
</button>
</fieldset>
</form>
<details>
<summary>
List descriptions and addresses
</summary>
<p>
You can also sign up manually by sending an email (with any subject and text) to the two email addresses below
</p>
<h3>
Announcement mailing list
</h3>
<p>
<a href="mailto:masters-thesis-announce+subscribe@lists.joelpurra.com">masters-thesis-announce+subscribe@lists.joelpurra.com</a>
</p>
<p>
Announcements of milestones. If you want to hear about documents when they are finished and published, and be invited to the presentation(s), subscribe to this list.
</p>
<h3>
Discussion mailing list
</h3>
<p>
<a href="mailto:masters-thesis-discussion+subscribe@lists.joelpurra.com">masters-thesis-discussion+subscribe@lists.joelpurra.com</a>
</p>
<p>
Announcements of work in progress. More frequent updates and discussions surrounding the ongoing work. Input is appreciated!
</p>
</details>
</section>
<section>
<h2>
Work in progress
</h2>
<p>
Documents published as they are being written - comments, suggestions and pull requests welcome!
</p>
<section>
<h3>
<!--
<time datetime="2015-02-06T14:07:35Z" title="2015-02-06T14:07:35Z">
2015-02-06T14:07:35Z
</time>
-->
<a href="http://joelpurra.com/projects/masters-thesis/files/documents/latest/joel-purra_masters-thesis_report.pdf">Thesis draft</a> (pdf)
</h3>
<p>
A document with ongoing thesis writing.
</p>
</section>
<details>
<summary>Finished documents</summary>
<section>
<h3>
<time datetime="2014-11-18T21:06:11Z" title="2014-11-18T21:06:11Z">
2014-11-18T21:06:11Z
</time>
<a href="http://joelpurra.com/projects/masters-thesis/files/documents/latest/joel-purra_masters-thesis_planning.pdf">Thesis planning and draft</a> (pdf)
</h3>
<p>
A document that combines planning with ongoing thesis writing. Read the abstract and background for an overview.
</p>
</section>
<section>
<h3>
<time datetime="2014-02-07T17:42:00Z" title="2014-02-07T17:42:00Z">
2014-02-07T17:42:00Z
</time> <a href="http://joelpurra.com/projects/masters-thesis/files/documents/latest/joepu444-thesis-proposal-2014-02-07T1742Z.pdf">Thesis subject proposal</a> (pdf)
</h3>
<p>
A broadly formulated subject proposal, which allowed .SE and I to specify direction and scope after an initial survey.
</p>
</section>
</details>
</section>
<section>
<h2>
People involved
</h2>
<ul>
<li>
Student: <a href="http://joelpurra.com">Joel Purra</a>, master student for a <a href="https://www.liu.se/utbildning/program/informationsteknologi">Master of Science in Information Technology and Engineering</a>, <a href="https://liu.se/">Linköping University</a>, Sweden.
</li>
<li>
University examiner: <a href="https://www.ida.liu.se/~nikca/">Niklas Carlsson</a>, Associate Professor (Swedish: docent and universitetslektor) at <a href="https://www.ida.liu.se/divisions/adit/">Division for Database and Information Techniques (ADIT)</a>, <a href="https://www.ida.liu.se/">Department of Computer and Information Science (IDA)</a>, <a href="https://liu.se/">Linköping University</a>, Sweden.
</li>
<li>
Company supervisor: <a href="https://www.iis.se/bloggare/pawal/">Patrik Wallström</a>, Project Manager within R&D, <a href="https://www.iis.se/">.SE (The Internet Infrastructure Foundation)</a>, Sweden.
</li>
<li>
Company supervisor: <a href="https://www.iis.se/bloggare/staffanh/">Staffan Hagnell</a>, Head of New Businesses, <a href="https://www.iis.se/">.SE (The Internet Infrastructure Foundation)</a>, Sweden.
</li>
</ul>
</section>
<section>
<h2>
Open source projects
</h2>
<p>
In the spirit of free and open source software as well as open data, the source code for both documents and tools are released to the public.
</p>
<p>
The power is in your hands.
<em>Use the source, Luke!</em>
</p>
<ul>
<li>
<a href="https://github.com/joelpurra/masters-thesis" rel="source">masters-thesis</a>: The source code for the thesis documents.
</li>
<li>
<a href="https://github.com/joelpurra/har-heedless">har-heedless</a>: Scriptable batch downloading of webpages to generate <a href="http://www.softwareishard.com/blog/har-12-spec/">HTTP Archive (HAR) files</a>, using <a href="http://phantomjs.org/">PhantomJS</a>.
</li>
<li>
<a href="https://github.com/joelpurra/har-dulcify">har-dulcify</a>: Extract data from <a href="http://www.softwareishard.com/blog/har-12-spec/">HTTP Archive (HAR) files</a> for some aggregate analysis.
</li>
<li>
<a href="https://github.com/joelpurra/har-portent">har-portent</a>: Using har-heedless to download and har-dulcify to analyze web pages in aggregate.
</li>
<li>
<a href="https://github.com/joelpurra/masters-thesis-site" rel="source">masters-thesis-site</a>: The source code for this page.
</li>
</ul>
</section>
</main>
<footer class="container">
<hr />
<p>
Part of <a href="http://joelpurra.com/projects/masters-thesis/" rel="home">Joel Purra's master's thesis</a> © 2014 <a href="http://joelpurra.com/" rel="author me">Joel Purra</a>.
</p>
<ul>
<li>Data is made available under the <a href="http://opendatacommons.org/licenses/odbl/1.0/" rel="external license">Open Database License 1.0</a>.</li>
<li>Any rights in individual contents of the database are licensed under the <a href="http://opendatacommons.org/licenses/dbcl/1.0/" rel="external license">Database Contents License 1.0</a>.</li>
<li>Documents and site content licensed under <a href="https://creativecommons.org/licenses/by-nc-nd/4.0/" rel="external license">Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0)</a>.</li>
<li>Code licensed according to terms in each separate project.</li>
</ul>
</footer>
<!-- DO NOT COPY THIS SCRIPT TAG -->
<!-- It's used for my statistics -->
<!-- DO NOT COPY THIS SCRIPT TAG -->
<script type="text/javascript">
//<![CDATA[
// DO NOT COPY THIS SCRIPT TAG
// It's used for my statistics
// DO NOT COPY THIS SCRIPT TAG
(function(i, s, o, g, r, a, m) {
i['GoogleAnalyticsObject'] = r;
i[r] = i[r] || function() {
(i[r].q = i[r].q || []).push(arguments)
}, i[r].l = 1 * new Date();
a = s.createElement(o),
m = s.getElementsByTagName(o)[0];
a.async = 1;
a.src = g;
m.parentNode.insertBefore(a, m)
})(window, document, 'script', '//www.google-analytics.com/analytics.js', 'ga');
ga('create', 'UA-15653943-1', 'joelpurra.com');
ga('send', 'pageview');
//]]>
</script>
</body>
<!-- jekyll-theme-demivolte, http://joelpurra.github.io/jekyll-theme-demivolte by Joel Purra, http://joelpurra.com/ -->
</html>