&benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` #5354

fkiraly · 2023-10-03T19:26:20Z

This PR ensures pre-existing syntax to pass alpha and coverage via metrics to evaluate works again, fixing #5336.

Not commenting here on whether the status quo is a good idea or not (I think it was cleaner to remove it, or is, in the long run), but such a change should not happen without deprecation.

FYI @hazrulakmal.

Question to the reviewers, especially @hazrulakmal: do we need to change the naming convention to something else to ensure we do not change the names - or their order - again, accidentally?

Depends on #5337, so this change should trigger the test that is failing on main.

benHeid · 2023-10-05T09:36:06Z

The reindexing in line 290 (not changed) is the problem. Since the indexes are named differently. See pandas documentation: https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.reindex.html

Thus, _get_column_order_and_datatype should return the same names as the DataFrame result have. Thus, it needs also to use the newly introduced for loop introduced in this PR.

This also requires some changes in the test, since accessing columns using df.col_name is not working anymore since the columns may be named as: test_PinballLoss_[0.1, 0.5, 0.9].

As an alternative, we could also try to rename all columns. However, this might lead to problems if multiple PinbalLosses are used in evaluate.

fkiraly · 2023-10-05T12:36:11Z

I see. So we need to restore the expected column names at least until the deprecation has taken place.

For the same reason I asked @hazrulakmal to keep the old column names for now (not breaking anything for the user!), I think we need to keep them here as well.

More precisely, my preferred solution would be:

check whether in the current PR state, there are multiple columns relating to a loss.
If not, replace it with the name that does not contain alpha or coverage

If this happens before the reindex, that should solve the issue?

benHeid · 2023-10-05T13:24:01Z

I see. So we need to restore the expected column names at least until the deprecation has taken place.

Yes. The challenge might be to ensure that we restore it correctly. I.e., we need to have a mapping somewhere from old to new.

If this happens before the reindex, that should solve the issue?

yes.

Would you then also directly introduce a legacy parameter that enables to control the naming of the columns?

benHeid · 2023-10-05T14:33:41Z

The fix is based on:

remembering a mapping from new names to old ones
introducing a flag to decide if old or new naming should be used!

I have not introduced a legacy parameter

fkiraly · 2023-10-05T17:05:32Z

amazing. Very elegant. I would have written sth more painful.

fkiraly · 2023-10-05T17:59:25Z

I approve your part, do but I cannot approve my own code...

benHeid

Looks good to me now. However, the code is partly written by me. However, I can technically approve it. If you are fine with my changes and I am fine with yours, it should be okay I suppose..

fkiraly · 2023-10-05T18:23:46Z

I'm fine with your changes, as said - that implies the four eyes principle is satisfied.

…in-ForecastX * origin/main: [BUG] ensure `Catch22` parameter setting `n_jobs = -1` uses all cores (sktime#5361) [MNT] simplified CI - merge windows CI step with test matrix (sktime#5362) &benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` (sktime#5354) [ENH] Link `test_interval_wrappers.py` to changes in `evaluate` for conditional testing (sktime#5337) [ENH] Add a CurveFitForecaster based on scipy optimize_curve (sktime#5240) [ENH] in scitype check, replace base class register logic with type tag inspection (sktime#5288)

* origin/main: [BUG] ensure `Catch22` parameter setting `n_jobs = -1` uses all cores (sktime#5361) [MNT] simplified CI - merge windows CI step with test matrix (sktime#5362) &benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` (sktime#5354) [ENH] Link `test_interval_wrappers.py` to changes in `evaluate` for conditional testing (sktime#5337)

* origin/split-ci: Revert "added 3.12 in matrix" [BUG] ensure `Catch22` parameter setting `n_jobs = -1` uses all cores (sktime#5361) [MNT] simplified CI - merge windows CI step with test matrix (sktime#5362) &benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` (sktime#5354) [ENH] Link `test_interval_wrappers.py` to changes in `evaluate` for conditional testing (sktime#5337)

…recasting * origin/split-ci: Revert "added 3.12 in matrix" [BUG] ensure `Catch22` parameter setting `n_jobs = -1` uses all cores (sktime#5361) [MNT] simplified CI - merge windows CI step with test matrix (sktime#5362) &benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` (sktime#5354) [ENH] Link `test_interval_wrappers.py` to changes in `evaluate` for conditional testing (sktime#5337)

fkiraly added 2 commits October 2, 2023 15:08

Update test_interval_wrappers.py

888e0ec

pass alpha and coverage on to evaluate from metric

0e0cbfb

fkiraly added bugfix Fixes a known bug or removes unintended behavior module:metrics&benchmarking metrics and benchmarking modules labels Oct 3, 2023

fkiraly requested review from achieveordie, benHeid and yarnabrina as code owners October 3, 2023 19:26

fkiraly added 2 commits October 3, 2023 21:38

Update _functions.py

e0f4e8d

Update _functions.py

fd4b4fe

fkiraly mentioned this pull request Oct 5, 2023

[MNT] python 3.12 compatibility #5364

Closed

6 tasks

Fix problem with renamed columns.

e92a4c2

benHeid approved these changes Oct 5, 2023

View reviewed changes

fkiraly changed the title ~~[BUG] allow alpha and coverage to be passed again via metrics to evaluate~~ &benheid [BUG] allow alpha and coverage to be passed again via metrics to evaluate Oct 5, 2023

fkiraly mentioned this pull request Oct 5, 2023

[ENH] Link test_interval_wrappers.py to changes in evaluate for conditional testing #5337

Merged

fkiraly merged commit c552caf into main Oct 5, 2023
24 checks passed

fkiraly deleted the fix-interval-evaluate branch October 5, 2023 18:26

fkiraly mentioned this pull request Dec 29, 2023

[ENH] forecasting evaluate utility failing with quantile forecasts #5336

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

&benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` #5354

&benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` #5354

fkiraly commented Oct 3, 2023 •

edited

benHeid commented Oct 5, 2023 •

edited

fkiraly commented Oct 5, 2023

benHeid commented Oct 5, 2023

benHeid commented Oct 5, 2023 •

edited

fkiraly commented Oct 5, 2023

fkiraly commented Oct 5, 2023

benHeid left a comment •

edited

fkiraly commented Oct 5, 2023

&benheid [BUG] allow alpha and coverage to be passed again via metrics to evaluate #5354

&benheid [BUG] allow alpha and coverage to be passed again via metrics to evaluate #5354

Conversation

fkiraly commented Oct 3, 2023 • edited

benHeid commented Oct 5, 2023 • edited

fkiraly commented Oct 5, 2023

benHeid commented Oct 5, 2023

benHeid commented Oct 5, 2023 • edited

fkiraly commented Oct 5, 2023

fkiraly commented Oct 5, 2023

benHeid left a comment • edited

Choose a reason for hiding this comment

fkiraly commented Oct 5, 2023

&benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` #5354

&benheid [BUG] allow `alpha` and `coverage` to be passed again via metrics to `evaluate` #5354

fkiraly commented Oct 3, 2023 •

edited

benHeid commented Oct 5, 2023 •

edited

benHeid commented Oct 5, 2023 •

edited

benHeid left a comment •

edited