OGE v0.5.0 is a new major release that expands the dataset's historical coverage back to 2005, and includes other methodological enhancements that improve data quality in all years.
In addition to the new data, users should expect changes to the existing 2019-2022 data: NOx and SO2 totals may change for some plants, net generation totals may change for some plants, data may change for CHP plants (see the "methodological updates" section for more details)
Input data changes
- Updates to use the most recent data version of PUDL (v2024.5.0). This includes a re-release of the 2022 EIA-923 data, which may change some of the 2022 results.
- Updates reference tables including the
energy_source_groups
file, and theutility_name_ba_code_map
file (#374), andepa_eia_crosswalk_manual
(#372), andemission_factors_for_co2_ch4_n2o
(#377)
Output data changes
- Expands historical coverage of OGE to include monthly and annual data for 2005-2018 (#295 and #362)
- All output files (those in the
outputs/
directory are now saved as compressed.csv.zip
files instead of.csv
files. This reduces the disk space of the outputs folder from approximately 16GB to 2.5GB. (#366) - Expands the data in the plant_static_attributes table to include location data (lat/long, address) and nameplate capacity (#364, #382, #385); commercial operation dates and retirement dates (#367). We also screen for and correct erroneous lat/long data (#368)
- Fixes a bug where the "total" values in the
outputs/annual_generation_averages_by_fuel
file were not being calculated correctly
Methodological updates
- When calculating the electric allocation factors for combined heat and power (CHP) plants, we previously were calculating this at the generator level, which was introducing bugs for certain combined cycle units when fuel and generation is reported for different generators at the same subplant. We now calculate this factor at the subplant level (#363)
- Fixes several bugs with the gross-to-net generation conversions where anomalous fleet-average ratios were being introduced, and default factors were not being mapped to certain generators. Also fixed a bug where GTN ratios were being calculated where there was missing gross generation or net generation data. (#370, #375, #383)
- Updates uncontrolled NOx and SO2 factors to align assumptions with those used by the EIA Electric Power Annual, and to fix a bug where we were adjusting the SO2 values for fluidized bed boilers, even though the control efficiencies are already incorporated into the uncontrolled emission factors (#373). In addition, because fuel sulfur content data is not available pre-2008, we use sulfur content values averaged from 2008-2012 to backfill the missing data. When calculating backstop values for missing values in any year, we now use state-specific values (rather than national-average) to reflect differences in the sulfur contents of fuels being delivered in specific parts of the country (#376)
Other minor fixes
- Remove the option to run the EIA-923 allocation at the plant level. This was an artifact that was no longer used (#361)
- Clean up function typehints and continue converting docstrings to Google format
- Updates where files are stored and accessed from in s3 (#384)
Pull Requests in this update
- Expand historical coverage pre-2019 by @grgmiller in #295
- Remove add_subplant_id optional argument by @rouille in #361
- Add 2005, 2006 and 2007 years by @rouille in #362
- Calculate electric_allocation_factor by subplant by @grgmiller in #363
- Compress OGE Outputs by @grgmiller in #366
- Add geographical information to the plant static attributes data frame by @rouille in #364
- Add operating and retirement dates to plant static attributes by @rouille in #367
- Update to use most recent version of pudl by @grgmiller in #369
- Fix issues with anomalous gross to net conversions by @grgmiller in #370
- Fix and add information to plant static attributes by @rouille in #368
- Fix function calculating averages of the fuel types by @rouille in #371
- Update manual epa eia crosswalk reference table by @rouille in #372
- Update Uncontrolled NOx and SO2 factors by @grgmiller in #373
- Update Energy Source Codes and Utility Name Map by @grgmiller in #374
- Correct Gross to Net Generation Bugs by @grgmiller in #375
- update co2 factors based on manual energy source group updates by @grgmiller in #377
- Add geopy to pyproject dependencies by @grgmiller in #378
- Add backstop sulfur content percentage for years 2005, 2006 and 2007 by @rouille in #376
- Compare plants coordinates from PUDL and EIA-860 by @rouille in #379
- Update warning message about validated years by @rouille in #381
- Discard non-operational generators when calculating plant capacity by @rouille in #382
- Revert removal of GTN shift factors by @grgmiller in #383
- Update to 0.5.0 and change s3 directory by @grgmiller in #384
- Fix missing capacity in plant static attributes by @grgmiller in #385
- Update documentation by @rouille in #380
- Historical coverage feature / v0.5.0 by @grgmiller in #386
- Update Citation by @grgmiller in #387
Full Changelog: v0.4.0...v0.5.0