Website crawler for Adblock Plus
Python JavaScript
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
chrome/locale/en-US
lib
.hgignore
README.md
build.py
dependencies
ensure_dependencies.py
icon.png
icon64.png
metadata.gecko
run.py

README.md

abpcrawler

Firefox extension that loads a range of websites and records which elements are filtered by Adblock Plus.

Requirements

Running

Execute the following:

./run.py -b /usr/bin/firefox urls.txt outputdir

This will run the specified Firefox binary to crawl the URLs from urls.txt (one URL per line). The resulting data and screenshots will be written to the outputdir directory. Firefox will close automatically once all URLs have been processed.

Optionally, you can provide the path to the Adblock Plus repository - Adblock Plus will no longer be downloaded then.

License

This Source Code is subject to the terms of the Mozilla Public License version 2.0 (the "License"). You can obtain a copy of the License at http://mozilla.org/MPL/2.0/.