Skip to content

josh-chamberlain/PDAP-Scrapers

 
 

Repository files navigation

Police Data Accessibility Project Scrapers

This repo contains the data scrapers for Police Data Accessibility Project. Thank you for being here!

Scrapers are one piece of the puzzle of making police data more accessible. For a broader look at the work we're doing, and where our attention is focused, start with the docs: https://docs.pdap.io/ and the roadmap: https://github.com/orgs/Police-Data-Accessibility-Project/projects/17

How to run a scraper

Right now, this requires some Python knowledge and patience. We're in the early stages: there's no automated scraper farm or fancy GUI yet.

  1. Install Python.
  2. Clone this repo.
  3. Find the scraper you wish to run. These are sorted geographically, so start by looking in /USA/....
  4. Run the scraper.py file with something like python3 <scraper path> depending on how you installed it.

Did it work?

If it worked, discuss your findings in our Discord. If it didn't, make an issue in this repo or reach out in Discord.

How to contribute

To write a scraper, start with CONTRIBUTING.md. Be sure to check out the /common folder!

For everything else, start with docs.pdap.io.

What data are we scraping?

The data sources listed here are our to-do list. If we should targeting a new data type, suggest it in Discord or make a DoltHub PR!

Resources

Potentially useful tools. If you find something useful, or if one of these is out of date, make a PR!

About

Code relating to scraping public police data.

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 97.1%
  • JavaScript 2.2%
  • Other 0.7%