Scrape the Austin Police Department's messy Indcident Reports Database (police reports) into a machine-readable CSV format. Reports come from the official APD website: https://www.austintexas.gov/police/reports/index.cfm
Pretty simple ...
apd_incident_reports.py [-h] [--verbose] STARTDATE ENDDATE FILENAME
Arguments: STARTDATE Date to start scraping from (furthest date from present) in MM/DD/YYYY format. ENDDATE Date to scrape to (nearest to now) in MM/DD/YYYY format. FILENAME Filename to write CSV data to. Optional Arguments: --verbose Increase output verbosity
The Austin Police Department only holds the past 18 months of police reports on their site, get it while it lasts.
There is no way to disinguish between non-Hispanic whites and Hispanic whites in the reports. The APD doesn't include ethnicity data here, so all Hispanic people are lumped into "White". One possible solution to doing a preliminary sort on this is to cross-reference these records with the Travis County Clerk's Misdemeanor Records, which does include ethnicity details. Another strategy would be to use something like the Census Bureau's "Building a Spanish Surname List for the 1990s" as a model.
Juvenile offenders' names are censored from the reports.