fix: use max() for penalty scaling floor (closes #75)#77
Merged
Conversation
The penalty scaling base was computed with np.min([max_import_price, 0.1e-3]) despite the comment intending a positive floor. min() inverts penalties on negative market prices (turning them into rewards in the maximizing objective) and clamps them to the tiny floor whenever real prices are higher, making penalties too weak to dominate. Switch to np.max() for a proper positive floor that scales with the highest import price, and deduplicate the expression—repeated five times—into a single penalty_base. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
andig
commented
May 31, 2026
Member
Author
|
@ekkea lgtm? |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Fixes #75. The penalty-scaling base was computed with
np.min([self.max_import_price, 0.1e-3]), despite the comment stating the intent was to keep the penalty positive.min()does the opposite:1e-4floor get clamped down to that tiny floor, making penalties too weak to reliably dominate real costs.The intended behavior is a positive floor that scales with the highest import price, which is
np.max().Changes
np.min→np.maxfor the penalty base.penalty_basecomputed once (the "clean up the calculation of incentives" part of the issue).Behavior is unchanged for typical positive prices above the floor, and corrected for the zero/negative-price edge cases the issue flagged.
Testing
uv run pytest→ 13 passeduv run ruff check→ clean🤖 Generated with Claude Code