Update data quality metric outputs #217
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
As part of the research paper for OGEI, I wanted to update some of the output data quality metrics and figured these changes would be useful to merge for our next release.
This PR makes the following changes:
diba_imputation_performance.csv
andnational_imputation_performance.csv
into a single filewind_solar_profile_imputation_performance.csv
which contains one column for the DIBA results and another column for the national results.cems_pollutant_measurement_quality.csv
which summarizes what percentage of CO2 mass, SO2 mass, and NOx mass reported in CEMS was directly measured versus imputed. This metric is based on the reported "mass_measurement_code" reported for each observation in the CEMS data.input_data_source.csv
andhourly_profile_method.csv
for CO2e, SO2, and NOx (both totals and _for_electricity values)input_data_source.csv
to distinguish between monthly-reported EIA data and annually reported EIA data.NOTE: I am requesting this to be merged into a new
development
branch where we can stage cumulative updates before releasing a new version onmain