[ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency #674

Omswastik-11 · 2025-12-06T07:41:32Z

Summary

This PR brings the skpro probabilistic metrics module into alignment with sktime, as discussed in issue #367.
It adds sample_weight support across metrics, ensures consistent output types, and fixes weighted multioutput behavior.

Changes

1. Sample Weight Support

Added a sample_weight parameter to:
- BaseProbaMetric.evaluate
- BaseDistrMetric.evaluate
Updated input validation methods:
- _check_consistent_input
- _check_ys
  to correctly propagate sample_weight
Implemented proper weighted averaging using np.average in:
- _evaluate (Proba metrics)
- evaluate (Distribution metrics)

2. Output Consistency Improvements

BaseDistrMetric.evaluate now returns a pd.Series when:
- multioutput="raw_values"
- the metric is univariate (non-multivariate)
This matches:
- BaseProbaMetric behavior
- sktime conventions
Fixes the previous inconsistency where a 1-row DataFrame was returned.

3. Weighted Multioutput Fixes

Improved handling of array-like multioutput weights in BaseDistrMetric.evaluate
Ensures correct weighted aggregation across variables
Fixes incorrect averaging when users pass custom weights.

Tests Added / Updated

test_sample_weight_pinball (in test_probabilistic_metrics.py)
Verifies sample_weight support for quantile and interval metrics.
test_sample_weight_logloss (in test_distr_metrics.py)
Verifies sample_weight support for distribution-based metrics.
test_multioutput_weights_logloss (in test_distr_metrics.py)
Ensures weighted multioutput aggregation behaves correctly.
Updated test_distr_evaluate to assert that raw-value outputs are returned as pd.Series.

Fixes #367

fkiraly

Sorry if the description of the task has not been precise enough, I think you did not quite catch what should happen in the TestAll classes.

The TestAll classes run their tests on all the metrics, and should check API elements that are common to all. So, if you add sample_weight as an argument, the TestAll classes should pass examples where sample_weight is being passed.

sync skpro probabilistic modules with sktime

c682165

Omswastik-11 changed the title ~~[ENH] sync skpro probabilistic modules with sktime~~ [ENH] sync skpro probabilistic metric modules with sktime Dec 6, 2025

ensure consistent output handelling

e9a166a

Omswastik-11 changed the title ~~[ENH] sync skpro probabilistic metric modules with sktime~~ [ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency* Dec 6, 2025

Omswastik-11 changed the title [ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency* [ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency Dec 6, 2025

Omswastik-11 marked this pull request as ready for review December 6, 2025 18:30

Omswastik-11 requested review from SaiRevanth25, felipeangelimvieira and fkiraly as code owners December 6, 2025 18:30

Omswastik-11 mentioned this pull request Dec 7, 2025

[ENH] merge test_probabilistic_metrics into TestAllDistrMetrics #675

Draft

fkiraly requested changes Dec 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency #674

[ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency #674

Uh oh!

Omswastik-11 commented Dec 6, 2025 •

edited

Loading

Uh oh!

fkiraly left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency #674

Are you sure you want to change the base?

[ENH] Sync skpro and sktime probabilistic metrics modules: sample_weight and output consistency #674

Uh oh!

Conversation

Omswastik-11 commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

1. Sample Weight Support

2. Output Consistency Improvements

3. Weighted Multioutput Fixes

Tests Added / Updated

Uh oh!

fkiraly left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Omswastik-11 commented Dec 6, 2025 •

edited

Loading