How does xgboost handle instance weights #144

BlindApe · 2015-01-18T19:20:12Z

Hi,

I think case weights should be present in calcGain and calcWeight functions in any way (multiplying to the grad and hess).
Where are used the case weights?

tqchen · 2015-01-18T23:02:38Z

This is a good question:) The instance weights are blended into gradient and second order gradients. Think about how will you calculate the g and h when you have weighted loss functions.

BlindApe · 2015-01-18T23:19:38Z

Think in case weights as repeated cases, so

calcGain = Sqr(case_weight * sum_grad) / (case_weight * sum_hess + reg_lambda);
calcWeight = - case_weight * sum_grad / (case_weight * sum_hess + reg_lambda);

See the R gbm implementation (point 4.3, terminal node estimates):

https://r-forge.r-project.org/scm/viewvc.php/*checkout*/pkg/inst/doc/gbm.pdf?revision=18&root=gbm&pathrev=22

tqchen · 2015-01-18T23:22:50Z

In xgboost, when you feed things into booster, for each instance i

grad_xgb[i] = grad[i] * case_weight[i]
hess_xgb[i] = hess[i] * case_weight[i]

grad_xgb, and hess_xgb is the statistics you feed into tree constructor. In another word, case_weight is "blended" into the gradient statistics already

tqchen · 2015-01-18T23:23:59Z

In your simplified case, sum_grad *case_weight is the actual sum_grad the calcGain function will see

tqchen · 2015-01-18T23:25:19Z

Interestingly, this question is also covered in this slide http://homes.cs.washington.edu/~tqchen/pdf/BoostedTree.pdf

See the last few slide about questions on weigthted training

BlindApe · 2015-01-18T23:27:59Z

I've searched in code and didn't found the weights in grad and hessian. I'll take a second look.

tqchen · 2015-01-19T17:15:49Z

You can checkout here
https://github.com/tqchen/xgboost/blob/master/src/learner/objective-inl.hpp#L149

BlindApe · 2015-01-19T18:52:26Z

Thank you. Seems OK and your terminal node estimates are congruent with given in R gbm document I posted yesterday.
'j = i % nstep' line I suppose is for multinomial loss where there are k preds for each label, ok?

A last thing:
sum_hess in weighted instances is computed in same way? How does this affect the min_child_weight restriction? If all instances have big weights this restriction should be manually incresed or this sum_hess is computed without the weights?

tqchen · 2015-01-19T21:43:35Z

sum_hess is also computed with the weights.

vatsan · 2016-12-15T09:07:28Z

@tqchen
For a user supplied objective, should we manually blend the weights with the gradients & hessian? Sorry i am not able to verify this myself as the link you pointed to doesn't work anymore: (https://github.com/tqchen/xgboost/blob/master/src/learner/objective-inl.hpp#L149).

tqchen changed the title ~~case weights~~ How xgboost handle instance weights Jan 19, 2015

tqchen changed the title ~~How xgboost handle instance weights~~ How does xgboost handle instance weights Jan 19, 2015

tqchen closed this as completed Jan 25, 2015

lock bot locked as resolved and limited conversation to collaborators Oct 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does xgboost handle instance weights #144

How does xgboost handle instance weights #144

BlindApe commented Jan 18, 2015

tqchen commented Jan 18, 2015

BlindApe commented Jan 18, 2015

tqchen commented Jan 18, 2015

tqchen commented Jan 18, 2015

tqchen commented Jan 18, 2015

BlindApe commented Jan 18, 2015

tqchen commented Jan 19, 2015

BlindApe commented Jan 19, 2015

tqchen commented Jan 19, 2015

vatsan commented Dec 15, 2016

How does xgboost handle instance weights #144

How does xgboost handle instance weights #144

Comments

BlindApe commented Jan 18, 2015

tqchen commented Jan 18, 2015

BlindApe commented Jan 18, 2015

tqchen commented Jan 18, 2015

tqchen commented Jan 18, 2015

tqchen commented Jan 18, 2015

BlindApe commented Jan 18, 2015

tqchen commented Jan 19, 2015

BlindApe commented Jan 19, 2015

tqchen commented Jan 19, 2015

vatsan commented Dec 15, 2016