Challenge.gov is a government website that hosts prize competitions and challenges across the U.S. federal government.
All of the currently active challenges listed are on the homepage, with more details in permalinks for each.
AFAICT, there is no RSS feed or way to be notified where there new challenges posted. So this project is a way to get the challenges into a machine-readable format by scraping the homepage periodically.
This repo is set up to work as an automated, periodic process in the manner of
Git scraping as described by Simon
Willison. See
.github/workflows/scrape.yml
.
The scraper reads the Challenge.gov homepage, which appears to be the canonical
place for the list of currently active challenges. The scraper parses the HTML
and extracts details about each challenge, then serializes them to a formatted
JSON document, challenges.json
in the top-level directory.
The GitHub action that runs the scraper periodically automatically checks in
any differences in challenges.json
, producing a diff history over time.