More general predict proba. #6817

trivialfis · 2021-03-31T12:24:48Z

Use output_margin for softmax.
Add test for dask binary cls.
Remove unused output margin.

trivialfis · 2021-03-31T14:44:41Z

Marked as blocking for 2 reasons.

I found a weird case that the output shape is incorrectly inferred by dask. I'm not sure what's the cause yet. This happens when output prediction is 2-dim (multi-class), which has the same shape as input data and map_blocks seems to be ignoring the change of output shape (from n_features to n_classes). I don't know how exactly is it triggered since before this PR it ~~works fine~~ workes fine on direct load of data where shape is known. ~~Still investigating.~~
multi:softmax can be used safely with classifier, which seems to be a nice thing to have.

hcho3 · 2021-03-31T23:13:28Z

python-package/xgboost/sklearn.py

+        # softprob:        Do nothing, output is proba.
+        # softmax:         Use output margin to remove the argmax in PredTransform.
+        # binary:logistic: Expand the prob vector into 2-class matrix after predict.
+        # binary:logitraw: Unsupported, let's deprecate this objective when possible.


I've seen at least a few users using this objective. Should we actually deprecate it?

It's not clear to me in what scenario is it useful.

If you want to have an untransformed prediction, use output_margin instead.

So far we've avoided breaking existing model files, i.e. you could load a very old model file and run predictions. Removing support for binary:logitraw will make old models files inaccessible. We don't yet have a formal policy for changes affecting old model files. (e.g. Wait 3 major versions?)

The objective binary:logitraw is clearly used in the wild already. For example: #6509

I removed the note.

* Use `output_margin` for `softmax`. * Add test for dask binary cls.

python-package/xgboost/sklearn.py

Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>

* Use `output_margin` for `softmax`. * Add test for dask binary cls. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>

trivialfis added the Blocking label Mar 31, 2021

hcho3 reviewed Mar 31, 2021

View reviewed changes

hcho3 approved these changes Mar 31, 2021

View reviewed changes

RAMitchell approved these changes Apr 1, 2021

View reviewed changes

trivialfis added 6 commits April 1, 2021 16:23

More general predict proba.

557a17e

* Use `output_margin` for `softmax`. * Add test for dask binary cls.

Fix special case.

6190ba4

lint.

efd5e02

Fix loading from a booster model.

287b3cd

Fix.

1b36419

Remove the deprecation note.

8d4f14f

trivialfis force-pushed the predict-proba branch from 82e3f58 to 8d4f14f Compare April 1, 2021 08:25

hcho3 reviewed Apr 1, 2021

View reviewed changes

python-package/xgboost/sklearn.py Outdated Show resolved Hide resolved

Update python-package/xgboost/sklearn.py

41abc17

Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>

trivialfis merged commit 47b6248 into dmlc:master Apr 1, 2021

trivialfis deleted the predict-proba branch April 1, 2021 11:52

trivialfis mentioned this pull request Apr 2, 2021

1.4.0 Release Candidate #6793

Closed

8 tasks

trivialfis added a commit to trivialfis/xgboost that referenced this pull request Apr 6, 2021

More general predict proba. (dmlc#6817)

73f3111

* Use `output_margin` for `softmax`. * Add test for dask binary cls. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>

trivialfis added a commit that referenced this pull request Apr 6, 2021

[back port] More general predict proba. (#6817) (#6831)

c6a0bdb

* Use `output_margin` for `softmax`. * Add test for dask binary cls. Co-authored-by: Philip Hyunsu Cho <chohyu01@cs.washington.edu>

trivialfis mentioned this pull request Apr 7, 2021

Throw error when using unsupported objective with predict_proba. #6835

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More general predict proba. #6817

More general predict proba. #6817

trivialfis commented Mar 31, 2021

trivialfis commented Mar 31, 2021 •

edited

hcho3 Mar 31, 2021

trivialfis Apr 1, 2021

trivialfis Apr 1, 2021

hcho3 Apr 1, 2021 •

edited

trivialfis Apr 1, 2021

More general predict proba. #6817

More general predict proba. #6817

Conversation

trivialfis commented Mar 31, 2021

trivialfis commented Mar 31, 2021 • edited

hcho3 Mar 31, 2021

Choose a reason for hiding this comment

trivialfis Apr 1, 2021

Choose a reason for hiding this comment

trivialfis Apr 1, 2021

Choose a reason for hiding this comment

hcho3 Apr 1, 2021 • edited

Choose a reason for hiding this comment

trivialfis Apr 1, 2021

Choose a reason for hiding this comment

trivialfis commented Mar 31, 2021 •

edited

hcho3 Apr 1, 2021 •

edited