scikit-learn api section documentation correction #3967

mbouznif · 2018-12-05T08:42:23Z

The documentation is quite inconsistent in the scikit-learn api section since the fit paragraph tells that when early stopping rounds occurs, the last iteration is returned not the best one, but the predict paragraph tells that when the predict is called without ntree_limit specified, then ntree_limit is equals to best_ntree_limit.

Thus, when reading the fit part, one could think that it is needed to specify what is the best iter when calling the predict, but when reading the predict part, then the best iter is given by default, it is the last iter that you have to specify if needed.

the description of early stopping round was quite inconsistent in the scikit-learn api section since the fit paragraph tells that when early stopping rounds occurs, the last iteration is returned not the best one, but the predict paragraph tells that when the predict is called without ntree_limit specified, then ntree_limit is equals to best_ntree_limit. Thus, when reading the fit part, one could think that it is needed to specify what is the best iter when calling the predict, but when reading the predict part, then the best iter is given by default, it is the last iter that you have to specify if needed.

fix doc according to the python_lightweight_test error

hcho3 · 2018-12-14T08:26:59Z

Thanks!

Edvard88 · 2018-12-27T10:48:51Z

What about this documentation
https://xgboost.readthedocs.io/en/latest/python/python_intro.html?highlight=early%20stopping
will be it fixed?

@lyxthe 've written in https://stackoverflow.com/questions/53483648/is-the-xgboost-documentation-wrong-early-stopping-rounds-and-best-and-last-it
if you fit with "best iteration" from early_stopping summary

For example:
Stopping. Best iteration:
[109] validation_0-auc:0.996667

fit with (109), you won't given the best score.
You should fit with "plus one" iteration, fit with 110 (because iterations starts from 0 ).
Then you'll get best score and it'll be best iteration.
Is it issue?

mbouznif added 3 commits December 4, 2018 16:54

Update sklearn.py

f472d3d

Update sklearn.py

752a724

fix doc according to the python_lightweight_test error

hcho3 merged commit 53f695a into dmlc:master Dec 14, 2018

hcho3 mentioned this pull request Dec 14, 2018

Is the xgboost documentation wrong ? (early stopping rounds and best and last iteration) #3942

Closed

hcho3 mentioned this pull request Mar 4, 2019

[RFC] Version 0.82 release candidate #4201

Merged

lock bot locked as resolved and limited conversation to collaborators Mar 27, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scikit-learn api section documentation correction #3967

scikit-learn api section documentation correction #3967

mbouznif commented Dec 5, 2018

hcho3 commented Dec 14, 2018

Edvard88 commented Dec 27, 2018

scikit-learn api section documentation correction #3967

scikit-learn api section documentation correction #3967

Conversation

mbouznif commented Dec 5, 2018

hcho3 commented Dec 14, 2018

Edvard88 commented Dec 27, 2018