Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rewrite EPA CEMS archiver to use API #68

Closed
zaneselvans opened this issue Feb 22, 2023 · 2 comments · Fixed by #138
Closed

Rewrite EPA CEMS archiver to use API #68

zaneselvans opened this issue Feb 22, 2023 · 2 comments · Fixed by #138
Assignees

Comments

@zaneselvans
Copy link
Member

zaneselvans commented Feb 22, 2023

At some point in the last few months, EPA seems to have removed the bulk CSV files containing hourly emissions data from their site, and are now exclusively making the data available through an API.

Looking at their example bulk Python download script it seems like it's possible to download CSVs from an s3 bucket. Hopefully they're not too different from the CSVs we were previously downloading directly from EPA!

Note that this API replaces .zip files with one .csv file for each state-year, so we will also need to adjust the extraction step in pudl to match the new data format.

@zaneselvans
Copy link
Member Author

@bendnorman as you work on the gridstatus data archiving maybe keep this issue in mind too for application of similar patterns.

@ggurjar333
Copy link

@zaneselvans Can I take this?

@e-belfer e-belfer self-assigned this Jul 24, 2023
@e-belfer e-belfer linked a pull request Aug 1, 2023 that will close this issue
9 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

4 participants