[BUG] Performance Metric: multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput #6413

kdekker-private · 2024-05-13T07:27:29Z

Describe the bug
Performance metric classes do not work when using the combination multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput.

To Reproduce

Using MeanAbsoluteError, but the error does not remain to this class.

import numpy as np
from sktime.performance_metrics.forecasting import MeanAbsoluteError
y_true = np.array([[0.5, 1], [-1, 1], [7, -6]])
y_pred = np.array([[0, 2], [-1, 2], [8, -5]])
mae = MeanAbsoluteError(multilevel="raw_values", multioutput="uniform_average")
mae(y_true, y_pred)
mae_custom_multitoutput_weights = MeanAbsoluteError(multilevel="raw_values", multioutput=[0.4,0.6])
mae_custom_multitoutput_weights(y_true, y_pred)

Output:

ValueError: DataFrame constructor not properly called!

Expected behavior
Both the output of mae and mae_custom_multitoutput_weights with custom multioutput weights should not throw an error.

Additional context
It seems like the wrapper class around the inner metric function fails to coerse the evaluations in the expected output format.

Versions
0.28.0

fkiraly · 2024-05-13T20:32:15Z

confirmed on current main, windows, python 3.11.

Note to devs: we need to check why this combination of inputs is not tested.

fkiraly · 2024-05-13T23:51:20Z

I am noting that the data in question has no hierarchy levels, as it is a "plain" series data container.

Still, the metric should not crash, so the coercion should be fixed.

…es is not hierarchical (#6418) This PR allows metric classes to be called with `multilevel` arg in all cases, if the series is not hierarchical. Previously, this would crash the metric. This PR changes behaviour in this degenerate case so a single-entry `pd.DataFrame` is returned. Fixes #6413

kdekker-private added the bug Something isn't working label May 13, 2024

fkiraly added the module:metrics&benchmarking metrics and benchmarking modules label May 13, 2024

fkiraly added this to Needs triage & validation in Bugfixing via automation May 13, 2024

fkiraly moved this from Needs triage & validation to Reproduced/confirmed in Bugfixing May 13, 2024

fkiraly mentioned this issue May 14, 2024

[BUG] allow metric classes to be called with multilevel arg if series is not hierarchical #6418

Merged

fkiraly moved this from Reproduced/confirmed to Under review in Bugfixing May 14, 2024

fkiraly closed this as completed in #6418 May 19, 2024

Bugfixing automation moved this from Under review to Fixed/resolved May 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Performance Metric: multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput #6413

[BUG] Performance Metric: multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput #6413

kdekker-private commented May 13, 2024

fkiraly commented May 13, 2024

fkiraly commented May 13, 2024

[BUG] Performance Metric: multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput #6413

[BUG] Performance Metric: multilevel="raw_values" with either multioutput="uniform_average" or custom weights for multioutput #6413

Comments

kdekker-private commented May 13, 2024

fkiraly commented May 13, 2024

fkiraly commented May 13, 2024