[BUG] Fix probability mass loss and temporal alignment in _SksurvAdapter predict_proba function by MurtuzaShaikh26 · Pull Request #990 · sktime/skpro

MurtuzaShaikh26 · 2026-03-23T23:03:05Z

Reference Issues/PRs

Fixes #958

What does this implement/fix? Explain your changes.

The predict_proba method in _SksurvAdapter (used by estimators like CoxPHSkSurv) calculated distribution weights via raw np.diff ignoring boundaries, resulting in massive probability drops and time regressions:

Initial Mass Loss: The drop from 1.0 to the first timepoint was discarded.
Tail Mass Loss: The remaining survival probability after the last valid timepoint was discarded instead of mapped to the last step, resulting in Empirical distribution weights summing to far less than 1.0.
Temporal Alignment: It incorrectly stripped the last recorded survival times using [:-1], stripping context and effectively dragging all probability masses backwards in time to previous disconnected timesteps.

**Proposed Solution:
This PR replaces np.diff with _clip_surv from _common.py which leverages _surv_diff (applying prepend=1.0, append=0.0) handling both margin boundaries. This efficiently validates that every mass is conserved, monotonically scaled, and sums exactly to 1.0. We also preserve the full unmodified unique_times_ for perfectly matched event distribution timestamps.

Verification Script

When tracking an estimator with survival probabilities [0.8, 0.5, 0.5] descending over timestamps [10.0, 20.0, 30.0]:

import numpy as np
import pandas as pd
from unittest.mock import MagicMock
from skpro.survival.adapters.sksurv import _SksurvAdapter

class MockSksurvAdapter(_SksurvAdapter):
    def _get_sksurv_class(self): return MagicMock()
    def get_params(self, deep=True): return {}

X = pd.DataFrame({"feature1": [1.0]})
adapter = MockSksurvAdapter()
adapter._estimator = MagicMock()
adapter._estimator.predict_survival_function = MagicMock(return_value=np.array([[0.8, 0.5, 0.5]]))
adapter._estimator.unique_times_ = np.array([10.0, 20.0, 30.0])
adapter._y_cols = ["time"]

dist = adapter._predict_proba(X)

print("\n--- RESULTS ---")
print("Times in resulting Empirical distribution:", dist.spl.values.flatten())
print("Weights in resulting Empirical distribution:", dist.weights.values)
print(f"Total mass: {dist.weights.sum()}")

Does your contribution introduce a new dependency? If yes, which one?

No.

What should a reviewer concentrate their feedback on?

Reviewing the np.diff/[:-1] replacement logic using _clip_surv to preserve the probability boundaries.
The new adapter test test_sksurv_adapter.py validating the resulting dist.weights, timeline boundary assignments, and asserting total mass strictly evaluates to 1.0.

Did you add any tests for the change?

Yes, added test_sksurv_adapter_probability_mass_and_alignment in skpro/tests/test_sksurv_adapter.py.

Any other comments?

N/A

PR checklist

For all contributions

I've added myself to the list of contributors with any new badges I've earned :-)
The PR title starts with either [ENH], [MNT], [DOC], or [BUG].

For new estimators

I've added the estimator to the API reference
I've added one or more illustrative usage examples to the docstring
If the estimator relies on a soft dependency, I've set the python_dependencies tag

…ter predict_proba

MurtuzaShaikh26 added 2 commits March 24, 2026 04:06

fix(survival): preserve probability mass and alignment in _SksurvAdap…

5abf514

…ter predict_proba

docs: add MurtuzaShaikh26 as a contributor

56bb21f

MurtuzaShaikh26 requested review from felipeangelimvieira and fkiraly as code owners March 23, 2026 23:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix probability mass loss and temporal alignment in _SksurvAdapter predict_proba function#990

[BUG] Fix probability mass loss and temporal alignment in _SksurvAdapter predict_proba function#990
MurtuzaShaikh26 wants to merge 2 commits intosktime:mainfrom
MurtuzaShaikh26:fix-sksurv-mass-loss

MurtuzaShaikh26 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

MurtuzaShaikh26 commented Mar 23, 2026

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

For all contributions

For new estimators

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant