
Loading…
| Commit has since been removed from the repository and is no longer available. |
|
|
yanirs |
Addressing issue #1802 (Gradient Boosting Out-of-bag Estimates): chan…
…
…ged oob_score_ to be calculated based only on trees where the OOB instances weren't used for training |
c28aed7
|
This issue was brought up on the ML by Yanir .
currently
y_predis updated for each sample (in and out bag) at each iteration. This way the OOB estimates are overly optimistic. When using OOB, we might use anothery_pred(sayy_oob_pred) that only contains the cumulative scores of those trees where the i-th sample was out-of-bag.I've check gbm, it seems that they use the same implementation that we use ATM - I'll check the difference between the OOB scores on some toy examples...