-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Build scraper for Statements of Administrative Policy, connect to bill #34
Comments
The SAP may be connected to one or more bills (this will be described in the title) |
Related to #82 |
Obama administration SAP data archive link: https://obamawhitehouse.archives.gov/omb/legislative_sap_default. |
As promised, here's some feedback on the JSON metadata currently at https://github.com/aih/FlatGov/blob/FT_branch_1/server_py/flatgov/dump_statement.json.zip. I think it might be important to re-think the philosophy here in terms of building a permanent archive of this data (Statements of Administration Policy) first and then thinking about how it integrates into the flatgov application second. This creates a little buffer between the usefulness and value of your data-gathering efforts and the longevity of the Flatgov application --- a permanent archive will live forever and will create value for researchers so long as humans continue to exist, but Flatgov might not. As a permanent archive, you want to make sure the archive is complete and accurate and that the metadata is clear in meaning, well organized, has some documentation about what it is/where it's from/how it's organized, and doesn't have extraneous (e.g. flatgov-internal) information in it. I would move it out of a repository that has UI/front-end/server things. Then a second step is to integrate it into the UI --- being a consumer of your own data to prove that the data is consumable. The JSON currently looks like: (my specific comments are below) [
...
{
"model": "bills.statement",
"pk": 24,
"fields": {
"bill_number": "HR1140",
"bill": "H.R. 1140 Rights for Transportation Security Officers Act of 2020",
"congres": "116",
"date_issued": " March 2, 2020",
"pdf_link": null,
"link": "https://www.whitehouse.gov/wp-content/uploads/2020/03/SAP_HR-1140.pdf",
"created_at": "2021-01-16T17:11:53.026Z"
}
},
...
]
|
https://obamawhitehouse.archives.gov/omb/legislative_sap_default
https://www.presidency.ucsb.edu/documents/app-categories/written-statements/presidential/statements-administration-policy?items_per_page=10&page=127
https://www.presidency.ucsb.edu/documents/app-categories/written-statements/presidential/statements-administration-policy?items_per_page=10&page=127 George w bush saps https://georgewbush-whitehouse.archives.gov/omb/legislative/sap/index.html
The text was updated successfully, but these errors were encountered: