What is the mechanism of using param ‘scale_pos_weight’? #1299

pr0x2b · 2018-04-03T12:42:33Z

It seems that XGBoost and LightGBM both have scale_pos_weight argument but calculation is done completely different. I couldn't find any authentic answer regarding how to calculate it in LightGBM so requesting it here.

I'm using following method to calculate scale_pos_weight. I am not sure if it's correct.

Number of positive: 143540 , number of negative: 59856460
Number of data: 60000000, number of used features: 11

scale_pos_weight = 100 - ( [number of positive samples / total samples] * 100 )
scale_pos_weight = 100 - ( [ 143540 / 60000000 ] * 100 ) 
scale_pos_weight = 99.76

The text was updated successfully, but these errors were encountered:

bbennett36 · 2018-04-05T20:53:05Z

scale_pos_weight is always just negatives / positives I thought?

https://stats.stackexchange.com/questions/243207/what-is-the-proper-usage-of-scale-pos-weight-in-xgboost-for-imbalanced-datasets

This answer is for XGBoost but it should be the same for both implmentations. If you're still unsure, I think 'is_unbalance' essentially does the same thing but calculates 'scale_pos_weight' by itself.

pr0x2b · 2018-04-05T22:35:53Z

Thanks @bbennett36
Actually, that formula is also mentioned in XGBoost documentation but LightGBM documentation lacks details on this parameter. That's why I wanted to confirm if formula stays the same for XGBoost and LightGBM or different.
I took reference for the formula from this post

samratp-zz · 2018-04-06T07:53:36Z

@pranavpandya84
negatives / positives looks more accurate to me.

From the document we can see

scale_pos_weight, default=1.0, type=double
– weight of positive class in binary classification task

With the default value of '1', it implies that the positive class has a weight equal to the negative class. So, in your case as the positive class is less than the negative class the number should have been less than '1' and not more than '1'.

This is just my understanding and I may not be correct...

Laurae2 · 2018-04-06T08:04:00Z

For both xgboost and LightGBM, scale_pos_weight, if assuming perfectly balanced positive/negative samples, means that:

number of positive samples = number of negative samples

which also means the following when using weights through scale_pos_weight:

number of positive samples * sample_pos_weight = number of negative samples

Therefore, its value, if asking for balance, is the following:

sample_pos_weight = number of negative samples / number of positive samples

More simple explanation: https://sites.google.com/view/lauraepp/parameters and type "scale" in the search box, then click on "Positive Binary Scaling".

Related C++ code:

xgboost proof: w += y * ((param_.scale_pos_weight * w) - w); where y is the label (0 negative or 1 positive in src/objective/regression_obj.cc)
LightGBM proof: label_weights_[1] *= scale_pos_weight_; where the 2nd index (1) is for positive labels (in src/objective/binary_objective.hpp)

pr0x2b · 2018-04-06T15:45:38Z

Thanks a lot. Perfect!

Laurae2 closed this as completed Apr 6, 2018

lock bot locked as resolved and limited conversation to collaborators Mar 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What is the mechanism of using param ‘scale_pos_weight’? #1299

What is the mechanism of using param ‘scale_pos_weight’? #1299

pr0x2b commented Apr 3, 2018

bbennett36 commented Apr 5, 2018 •

edited

pr0x2b commented Apr 5, 2018

samratp-zz commented Apr 6, 2018

Laurae2 commented Apr 6, 2018

pr0x2b commented Apr 6, 2018

What is the mechanism of using param ‘scale_pos_weight’? #1299

What is the mechanism of using param ‘scale_pos_weight’? #1299

Comments

pr0x2b commented Apr 3, 2018

bbennett36 commented Apr 5, 2018 • edited

pr0x2b commented Apr 5, 2018

samratp-zz commented Apr 6, 2018

Laurae2 commented Apr 6, 2018

pr0x2b commented Apr 6, 2018

bbennett36 commented Apr 5, 2018 •

edited