Supplementary materials used in analysis for CHIIR 2020 publication "Towards Search Strategies for Better Privacy and Information"
- Annotations can be found in this folder
- URL_TRACKER_DATA.csv contains the number of 3rd party trackers for each URL. This dataset also includes where the tracking data was retreived. Details for WhoTracks.me dataset are described further below.
- annotations_metadata.csv contains a unique id, URL visited, the search task number (as provided by Pogacar et al. ITCIR 2017), Cochrane medical question, correct answer to question
- annotations_new.csv contains the annotations from each annotator and the final annotation after each annotator worked together to come to agreement. The ID in here matches the ID in URL_TRACKER_DATA.csv
- annotations_against_previous_studies.csv contains annotations against previous studies (Pogacard et al. ITCIR 2017 and Zimmerman et al. CHIIR 2019). File contains same fields as annotations_new.csv plus the annotation from previous studies. This data was used to test agreement between annotators and previous research.
- For our publication and usage of the annotations, please use the following bibtex entry for citation.
@inproceedings{Zimmerman2020Towards,
author = {Zimmerman, Steven and Thorpe, Alistair and Chamberlain, Jon and Kruschwitz, Udo},
title = {Towards Search Strategies for Better Privacy and Information},
booktitle = {Proceedings of the 2020 Conference on Human Information Interaction and Retrieval},
series = {CHIIR '20},
year = {2020},
location = {Vancouver, BC, Canada},
pages = {TBD}
}
For the WhoTracks.me 10,000 website 3rd party tracking dataset.
- Data used for analysis can be found in this folder
- WhoTracks.me is collaborative project with teams at Cliqz and Ghostery.
- This dataset was made available for our research via an endpoint provided by Cliqz and Ghostery.
- Please cite the paper "WhoTracks.Me: Shedding light on the opaque world of online tracking".
- Usage of this dataset for future research is allowed under the following license:
This dataset is licensed under a Creative Commons Attribution 4.0 International License and is attributed to https://whotracks.me/.