Remove ProcessName column from consumption files by tsmbland · Pull Request #448 · EnergySystemsModellingLab/MUSE_OS

tsmbland · 2024-08-08T09:24:44Z

Description

One of my biggest gripes with the input files was that when specifying consumption data you had to specify a process (with the "ProcessName" column). This gives the impression that demand is for a particular process, but this is NOT true. Demand is for the commodities, and is agnostic to the process. Even though process must be specified in the file, this information isn't used in the model. If consumption data is specified for multiple processes, this just ends up being summed. Same thing with preset supply data which shares the same reader function (although this isn't used in any of the examples)

I think it would be much clearer to remove this column from the consumption files (or at least not to mandate it). However, we still need things to work if the column is present.

The main changes I've made are to the read_csv_outputs function (which I've also renamed), taking the summing operation from the PresetSector class and moving it here, and then dropping the ProcessName column. This will now work with or without the ProcessName column, and I think it makes it clearer what's actually going on.

I've also updated the documentation accordingly.

Fixes #391

Type of change

Please add a line in the relevant section of
CHANGELOG.md to
document the change (include PR #) - note reverse order of PR #s.

New feature (non-breaking change which adds functionality)
Optimization (non-breaking, back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)
Breaking change (whatever its nature)

Key checklist

All tests pass: $ python -m pytest
The documentation builds and looks OK: $ python -m sphinx -b html docs docs/build

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works

codecov · 2024-08-08T09:39:28Z

Codecov Report

Attention: Patch coverage is 50.00000% with 5 lines in your changes missing coverage. Please review.

Project coverage is 71.28%. Comparing base (b2220a6) to head (679f673).

Files	Patch %	Lines
src/muse/readers/csv.py	50.00%	3 Missing and 1 partial ⚠️
src/muse/sectors/preset_sector.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #448      +/-   ##
===========================================
- Coverage    71.31%   71.28%   -0.04%     
===========================================
  Files           44       44              
  Lines         5881     5885       +4     
  Branches      1153     1154       +1     
===========================================
+ Hits          4194     4195       +1     
- Misses        1366     1369       +3     
  Partials       321      321

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

alexdewar

LGTM. I would consider giving users a warning if they supply a ProcessName column, but other than that I think it's all good.

alexdewar · 2024-08-08T10:55:20Z

+        assert all(u in data.columns for u in indices)
+
+        # Legacy: drop ProcessName column and sum data (PR #448)
+        if "ProcessName" in data.columns:


How about warning users that having a ProcessName column is deprecated here?

alexdewar · 2024-08-08T10:56:26Z

    datas = {}
    for path in allfiles:
        data = pd.read_csv(path, low_memory=False)
+        assert all(u in data.columns for u in indices)


Note that assert statements aren't included in optimised Python bytecode. But I guess if one of these columns is missing an error will be raised below?

Ah ok I didn't know that, but yes, an error will be raised below

tsmbland added 2 commits August 8, 2024 10:23

Remove ProcessName column from consumption files

1580793

Update test

6a795c7

Simplify and rename function

5614fab

tsmbland marked this pull request as ready for review August 8, 2024 10:26

tsmbland requested a review from alexdewar August 8, 2024 10:26

Update tutorials

68892bb

alexdewar approved these changes Aug 8, 2024

View reviewed changes

Add warning message

679f673

tsmbland enabled auto-merge August 8, 2024 13:00

tsmbland merged commit 0f1f518 into develop Aug 8, 2024

tsmbland deleted the process_name branch August 8, 2024 13:16

tsmbland mentioned this pull request Aug 13, 2024

PR for v1.2.0rc2 release #452

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove ProcessName column from consumption files#448

Remove ProcessName column from consumption files#448
tsmbland merged 5 commits intodevelopfrom
process_name

tsmbland commented Aug 8, 2024 •

edited

Loading

Uh oh!

codecov Bot commented Aug 8, 2024 •

edited

Loading

Uh oh!

alexdewar left a comment

Uh oh!

alexdewar Aug 8, 2024

Uh oh!

alexdewar Aug 8, 2024

Uh oh!

tsmbland Aug 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tsmbland commented Aug 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Key checklist

Further checks

Uh oh!

codecov Bot commented Aug 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

alexdewar left a comment

Choose a reason for hiding this comment

Uh oh!

alexdewar Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

alexdewar Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

tsmbland Aug 8, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tsmbland commented Aug 8, 2024 •

edited

Loading

codecov Bot commented Aug 8, 2024 •

edited

Loading