Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EPA Facility Level GHG emissions #28

Open
nickrsan opened this issue Jan 12, 2017 · 20 comments
Open

EPA Facility Level GHG emissions #28

nickrsan opened this issue Jan 12, 2017 · 20 comments

Comments

@nickrsan
Copy link
Member

Name: EPA Facility Level GHG emissions
Organization: EPA
Description URL: https://www.epa.gov/ghgreporting/ghg-reporting-program-data-sets
Download URL:
File Types:
Size:
Status: Done

@meyerzinn
Copy link

This dataset is not going to be mirrored soon. Please advise, we need a more permanent mirror.

@nickrsan
Copy link
Member Author

This shows as having at least one mirror that's hosted on private servers, so that's good, but I'd like to get a public URL for it before considering it mirrored. I'll look back through the form data and see if I can find if they submitted a URL. Thanks for flagging this.

@meyerzinn
Copy link

meyerzinn commented Jan 15, 2017

This was the one I submitted. I'm unable to keep it up for more than a few weeks. I will focus on moving it to IPFS.

@nickrsan
Copy link
Member Author

OK, good to know - I'm going to remove the One Mirror flag then so it appears as a higher priority. If nobody else steps in, I'll mirror it.

@colinbeier
Copy link

have the full Oracle db (.dmp) as well as .csv of summary data. cannot upload to mirror yet but can confirm there are multiple complete copies archived in safe places.

@ghost
Copy link

ghost commented Jan 26, 2017 via email

@mskallisti
Copy link

Pulling.

@dcmccabe
Copy link

Hi all,
I'm with an environmental NGO. We use this particular dataset all the time and we'd like to help get the mirrored data set up on a publicly available website. If those who have already pulled the data can contact me, we can start that process. (We are working separately to FOIA the information from EPA, but that could take weeks).

I am at Clean Air Task Force. You won't have any trouble finding my direct email address on the CATF website.

Thanks for all your work!

  • David

@sasignell
Copy link

When we talk about mirroring, are we talking about mirroring the (handy) GUI too? If so, how can we get the front-end code?

@meyerzinn
Copy link

I believe the Internet Archive can preserve GUIs, we are focused on the data (depending on how hard it is to retrieve the GUI).

@dcmccabe
Copy link

My partners have a skeletal plan for building a UI (just duplicating what EPA already built, scraping their html etc. to do so.)

There are a number of ways to access the GHGRP data. We may be more focused on the tools that the site uses for downloading all sorts of specific data, rather than the graphic UIs.

So, if you have the data downloaded (or will soon), drop me a line so I can partner you with the folks that are planning to build and host this.

@sasignell
Copy link

sasignell commented Jan 26, 2017 via email

@colinbeier
Copy link

colinbeier commented Jan 26, 2017 via email

@BethTrask
Copy link

Colin & all: I'm from Environmental Defense Fund. We'd be happy to stand up a publically accessible site to store these files and make them available for download. As Dave noted earlier today, we also just started talking internally about creating a UI with capabilities similar to EPA's Flight. That will take some time, but we want to start by providing a hosted site to keep these data public. Many thanks to all of you who are saving these vital data!

@sasignell
Copy link

sasignell commented Jan 26, 2017 via email

@meyerzinn
Copy link

I was interested in creating an index in a machine readable format to help your website efforts. I think we can have a mix of machine-readable and human-editable, tell me what you think:

In this repo, we create a folder for datasets that contains a bunch of files, each with its own file that has all the info pertaining to it. We also have a folder with mirrors + indices per mirror. Thus, we could have the website feed from there and also make a bot to check on mirrors.

@BethTrask
Copy link

Thank you, Steve and Meyer! I would love to have your help. I can start by getting some the infrastructure lined up with my IT person and then circle back with you to do some brainstorming.

@JeremiahCurtis
Copy link

I am assuming that all the subpart-level data have been mirrored in this issue?
https://www.epa.gov/enviro/greenhouse-gas-customized-search

@colinbeier
Copy link

colinbeier commented Apr 12, 2017 via email

@JeremiahCurtis
Copy link

@colinbeier Thanks for the update. Given that the LTM, CASTNET, and other data have been archived (which I assume means that an offline copy exists), does this mean that the industry and facility-level subpart data have also been mirrored from envirofacts (https://www.epa.gov/enviro/greenhouse-gas-customized-search) , such as the cement production emissions data accessed via https://oaspub.epa.gov/enviro/AD_HOC_TABLE_COLUMN_SELECT_V2.retrieval_list?

Would like to contribute a second mirror for this data, but don't know how to get the subpart-level data outside envirofacts (which appears to require a large number of url queries to get all the GHGRP subpart-level data). Is there a straightforward tool for these data? They don't seem to be on ftp://newftp.epa.gov/ or ftp://ftp.epa.gov/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

8 participants