Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

v0.1.2 #251

Merged
merged 40 commits into from
Oct 26, 2022
Merged

v0.1.2 #251

merged 40 commits into from
Oct 26, 2022

Conversation

grgmiller
Copy link
Collaborator

Merges v0.1.2 patch updates into main.

Fixes issue with assignment of subplant IDs
Fixes an issue that was causing some generators/units to be assigned missing or incorrect subplant IDs. This issue caused several downstream issues including inaccurate hourly shapes being assigned to certain subplants, or inaccurate conversion of gross to net generation in the CEMS data. This patch ensures that every generator_id and unitid is associated with a non-missing subplant ID, and that these subplant assignments account for boiler-generator associations from EIA-860 (full details).

Fixes anomalous spikes in emission rate data
Several grid regions were exhibiting anomalous dips in their regional emission intensity values due to an issue with the methodology used to shape data from plants with partial CEMS data. Specifically, the generation from certain non-emitting plants (e.g. nuclear, solar, etc) that had a fossil-fuel backup generator onsite were being assigned the intermittent hourly profile of the backup generator if that generator reported to CEMS. This resulted in data quality issues in both the generated and consumed emission rates for some regions. This patch fixes that issue by excluding all non-emitting generators and plants with subplants of mixed fuel types from using the partial CEMS methodology. (full details).

Other updates

  • Fixes an issue that was resulting in an infeasible conda environment by updating our dependency on the Public Utility Data Liberation Project to a stable branch of the project.
  • Improves the speed of running the part of the data pipeline that identifies subplant IDs.
  • Updates the plant_metadata.csv file to help users more easily identify the methodologies used for each plant.
  • Adds adjusted R2 values to the gross to net generation regression outputs available in the data/outputs/gross_to_net_conversions.csv file.
  • Renames the data/outputs/subplant_crosswalk.csv file to subplant_crosswalk_[YEAR].csv to clarify that subplant IDs are only valid for a specific year.

grgmiller and others added 30 commits September 15, 2022 17:03
* add notebook in progress for issue 230
(hourly_validation and map_visualization)
* update 930 time lag notebook for new dir structure
notebook was used during issue 230 investigation

notebook is from gailin/clean_cems branch, which can now be deleted
* A renewable generator (hydro, wind, solar, nuclear, geothermal) won't use the `partial_cems_plant` hourly shaping methodology
* A subplant that contains generators of mixed fuel types will choose the hourly shaping method of the generator with the largest generation
@gailin-p gailin-p merged commit bce51f7 into main Oct 26, 2022
@gailin-p gailin-p deleted the v0.1.2 branch October 26, 2022 22:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants