Prevent zero-variance instability in BaseProbaRegressor.predict_proba by kindler-king · Pull Request #956 · sktime/skpro

kindler-king · 2026-03-16T20:31:46Z

Reference Issues/PRs
Fixes #955

What does this implement/fix?

This PR fixes a numerical instability in BaseProbaRegressor.predict_proba.

When predict_var returns 0, the fallback Normal distribution is constructed with sigma=0, which leads to divide-by-zero warnings and NaN values when evaluating pdf or log_pdf.

To prevent this, the predicted variance is clipped to machine epsilon before computing the standard deviation:

pred_var = np.clip(pred_var, np.finfo(float).eps, None)
This ensures the resulting Normal distribution always has a strictly positive scale while leaving normal model outputs effectively unchanged.

Does your contribution introduce a new dependency?
No.

What should a reviewer concentrate their feedback on?

Whether clipping variance at machine epsilon is the appropriate safeguard.
Consistency with the existing probabilistic regression design.

Did you add any tests for the change?
Yes.

A regression test was added that uses a mock regressor returning zero variance and verifies that:
predict_proba().pdf() and log_pdf() remain finite
no numerical warnings are raised

fkiraly

I would say this is a hack. Instead of clipping it, I would instead return a Delta distribution if the variance is below machine epsilon (possibly times a factor).

Also, code formatting tests are failing. Please look at the dev guide, and pre-commit.

…llback

kindler-king · 2026-03-24T13:40:34Z

Hello @fkiraly , thanks for the suggestion, I’ve updated the implementation.

All-zero variance now returns a Delta distribution.
Non-zero variance continues to return Normal.

For the mixed case, I explored returning per-row heterogeneous distributions, but according to my understanding, skpro doesn’t support that (no concat in BaseDistribution, and Mixture uses global weights).

So for now, I fall back to Normal with eps clamping for zero-variance entries to maintain numerical stability.

Also fixed formatting issues using pre-commit.

Would love your feedback on this approach.

kindler-king requested review from felipeangelimvieira and fkiraly as code owners March 16, 2026 20:31

fkiraly added bug module:probability&simulation probability distributions and simulators module:regression probabilistic regression module and removed module:probability&simulation probability distributions and simulators labels Mar 21, 2026

fkiraly requested changes Mar 21, 2026

View reviewed changes

[BUG] Return Delta distribution for zero-variance in predict_proba fa…

8c79ee4

…llback

kindler-king force-pushed the bugfix-zero-variance-predict-proba branch from 8fc8178 to 8c79ee4 Compare March 24, 2026 13:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent zero-variance instability in BaseProbaRegressor.predict_proba#956

Prevent zero-variance instability in BaseProbaRegressor.predict_proba#956
kindler-king wants to merge 1 commit intosktime:mainfrom
kindler-king:bugfix-zero-variance-predict-proba

kindler-king commented Mar 16, 2026

Uh oh!

fkiraly left a comment •

edited

Loading

Uh oh!

kindler-king commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kindler-king commented Mar 16, 2026

Uh oh!

fkiraly left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kindler-king commented Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fkiraly left a comment •

edited

Loading