Python scripts used to analyse Google search results for the DuckDuckGo 2018 filter bubble study.
Switch branches/tags
Nothing to show
Clone or download
tagawa Merge pull request #1 from duckduckgo/tagawa/setup
Added link to filter bubble study report
Latest commit 3914b36 Dec 4, 2018
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data First commit Dec 3, 2018
LICENSE First commit Dec 3, 2018
README.md Added link to filter bubble study report Dec 4, 2018
count_domains.py First commit Dec 3, 2018
count_variations.py First commit Dec 3, 2018
measure_difference.py First commit Dec 3, 2018
requirements.txt First commit Dec 3, 2018
serp_parser.py First commit Dec 3, 2018
serp_parser.pyc First commit Dec 3, 2018

README.md

DuckDuckGo Filter Bubble Study (2018)

Python scripts used to analyse Google search results for the 2018 filter bubble study. This assumes Python version 2.7.

Usage

To count domain occurrences within organic links (i.e. excluding infoboxes):

$ python count_domains.py

To count variations of the search results and infoboxes:

$ python count_variations.py

To calculate the average differences (edit distances) of participants' search results:

$ python measure_difference.py

Dependencies

pyxDamerauLevenshtein >= 1.5

Reference

For the filter bubble study report, please see here:

See also the spreadsheets containing all data from the study here: