Model evaluation metric #79

peiyaoli · 2019-08-22T01:59:26Z

I have several questions regard evaluation of survival prediction model, which are extensions of my former question #75
In my project, I would like to build a survival model using GradientBoosting. Since the gradientboosting in scikit-survival is slow, I choose XGBoost to implement the model. Two metrics are used to evaluate and compare model： C-index and time ROC, as you suggested in tutorial.

For the XGBoost, one answer from stackoverflow suggested I could use

xgb_model.predict(x_test, margin=True)

to get comparable result with scikit-survival prediction result. Then I could use your implementation of c-index to compare two models. However, I am not sure if this work.

In Shap's official tutorial notebook, they implemented C-index as bellow:

def c_statistic_harrell(pred, labels):
    total = 0
    matches = 0
    for i in range(len(labels)):
        for j in range(len(labels)):
            if labels[j] > 0 and abs(labels[i]) > labels[j]:
                total += 1
                if pred[j] > pred[i]:
                    matches += 1
    return matches/total

So what the difference between this and yours? I tried both, there are some difference.

Thanks for your answer

Best
Peiyao

sebp · 2019-08-22T05:57:59Z

AFAICT, the code you posted differs in 2 aspects from concordance_index_censored.

It does not consider tied risk scores.
Assuming labels is the time of the event, then two scores pred[i] and pred[j] are concordant if the i-th patient survived longer and has a lower predicted score, whereas for concordance_index_censored:

If the estimated risk is larger for the sample with a higher time of event/censoring, the predictions of that pair are said to be concordant.

If you get a c-index smaller 0.5, you might need to flip the sign of the predictions to obtain the correct order.

sebp closed this as completed Sep 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model evaluation metric #79

Model evaluation metric #79

peiyaoli commented Aug 22, 2019

sebp commented Aug 22, 2019

Model evaluation metric #79

Model evaluation metric #79

Comments

peiyaoli commented Aug 22, 2019

sebp commented Aug 22, 2019