Skip to content
This repository has been archived by the owner on Jul 23, 2024. It is now read-only.
/ webcompat-crawls Public archive

INACTIVE - http://mzl.la/ghe-archive - MVP of a OpenWPM-based crawl setup for Webcompat analysis

Notifications You must be signed in to change notification settings

mozilla/webcompat-crawls

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Webcompat Crawls

Configuration and instructions used to crawls top sites using a specially instrumented version of Firefox gathering data for Webcompat analysis.

Generate the seed list via a series of pre-crawls

See ./crawl-prep/README.md.

Run an OpenWPM crawl in Google Cloud Platform

See ./crawl-engineering/gcp/README.md.

Developer notes

To update the OpenWPM Crawler and crawl-prep submodules to the latest commits in the remotely tracked branches:

git submodule update --remote

About

INACTIVE - http://mzl.la/ghe-archive - MVP of a OpenWPM-based crawl setup for Webcompat analysis

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages