add pos_weight hyperparameter for dealing with imbalanced dataset #2703 #8557

hulkds · 2024-02-29T14:43:42Z

Related issue: Is it possible to change class weight? #2703
Description:
This PR add a new feature to Ultralytics YOLOv8 Detection by adding pos_weight to the BCEWithLogitsLoss for dealing with imbalanced dataset.
Example:

model = YOLO("yolov8s.pt")
model.train(data="coco.yaml", pos_weight=[0.5, 2]) # pos_weight must be the same length as class dimension.

I have read the CLA Document and I hereby sign the CLA

🛠️ PR Summary

_{Made with ❤️ by Ultralytics Actions}

🌟 Summary

Addition of class imbalance handling with pos_weight in loss calculation.

📊 Key Changes

Introduced a new configuration option pos_weight in the default settings (default.yaml).
Updated the loss calculation logic in loss.py to consider the pos_weight for the binary cross-entropy calculation.

🎯 Purpose & Impact

🎯 Purpose: To improve model performance on imbalanced datasets by adjusting the weight of positive samples in loss computation.
👥 Impact: Users working with datasets that have class imbalances may notice improved model training results as the model now accounts for this imbalance, potentially leading to better generalization and performance.

…aset.

github-actions · 2024-02-29T14:44:00Z

CLA Assistant Lite bot All Contributors have signed the CLA. ✅

hulkds · 2024-02-29T14:45:46Z

I have read the CLA Document and I sign the CLA

codecov · 2024-02-29T14:52:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 75.75%. Comparing base (36408c9) to head (605e503).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #8557      +/-   ##
==========================================
+ Coverage   75.74%   75.75%   +0.01%     
==========================================
  Files         117      117              
  Lines       14693    14695       +2     
==========================================
+ Hits        11129    11132       +3     
+ Misses       3564     3563       -1

Flag	Coverage Δ
Benchmarks	`36.29% <0.00%> (-0.01%)`	⬇️
GPU	`39.02% <100.00%> (+0.02%)`	⬆️
Tests	`70.86% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Burhan-Q · 2024-03-01T16:42:35Z

@hulkds thank you for your work!

I don't have final say on if your changes will get integrated or not, but I'd like to provide some feedback for you to help improve the chances that it could get accepted. I have not tested the new code, these are just pointers from my review of the changes made.

You have included a new initialization for self.bce on L171 however the original on L156 remains, which means it would be initialized twice. It' probably better to move your new line up to replace the old one and insert the line for self.pos_weight above.
I see you added this new argument to the configuration YAML, but have you verified that it works? When you add the argument for training like you show in your example
```
model = YOLO("yolov8s.pt")
model.train(
    data="coco.yaml",
    pos_weight=[0.5, 2],  # pos_weight must be the same length as class dimension.
    )
```
are you certain that this change is applied? If so, can you share your results that demonstrate the applied change?
Have you tested what implications there are when changing the pos_weight argument for a balanced dataset?
Changes that will update arguments should also be incorporated into the documentation. Please be sure to add your changes to docs as well.

satyrmipt · 2024-03-03T14:18:11Z

@hulkds , please answer Burhan-Q's questions. We all are waiting for your improvement to be implemented. In particular those of us who use Google Colab right now have no workarounds except for over\undersampling. For his 3rd question i think it's user responsibility to use correct weights. Default weights must be ones anyway.

hulkds · 2024-03-03T15:33:13Z

@Burhan-Q thanks for your feedback!

This is my bad, I'll fix it :)
I confirmed that the pos_weight argument is functioning as expected. After printing self.hyp, I noticed the inclusion of pos_weight. Furthermore, I can access self.hyp.pos_weight without any issues, indicating that the addition of pos_weight to the configuration YAML file is indeed effective.
I successfully trained my model incorporating this modification and observed a significant improvement in detecting vehicles and license plates (with a car to license plate ratio in my dataset is nearly 6:1), thanks to the pos_weight parameter.
I plan to submit another pull request soon. This PR will address your initial suggestion and include updates to the documentation to reflect these changes.

Burhan-Q · 2024-03-03T18:22:53Z

@hulkds thanks for the follow up! I checked out this PR and was able to verify that it works as well for both the case when adding weights or using the default value. I tested with the coco128 dataset and did not encounter any issues. I did not test with any other task segment, classify, obb, or pose but since the CI tests are passing, it seems that it's unlikely to be an issue for those tasks. Obviously with a limited test like mine, it's difficult to observe the differences. Later I will test with your changes versus without your changes to ensure that for the default there is no change in outcomes.

It would be preferable if you could checkout this PR or push commits to your fork for the documentation updates so that everything is included with a single PR. This will help to ensure your PR has the best chance to get accepted (I can't say for certain it will tho).

hulkds · 2024-03-03T19:41:02Z

@Burhan-Q
The change was made specifically to the v8DetectionLoss class, so I don't expect it to affect other tasks, but it's still good to double-check. I've created a new branch and pushed a commit there. If everything looks fine, I will then create a new PR like so, everything will be included with a single PR.

Burhan-Q · 2024-03-03T20:18:41Z

@hulkds yes your updates look like they should be okay. The reason I mentioned the other tasks is because they inherit from the v8DetectionLoss for example, the Segmentation Loss

ultralytics/ultralytics/utils/loss.py

Line 250 in 906b8d3

class v8SegmentationLoss(v8DetectionLoss):

I checked the results from the existing training (without pos_weight) code against the training code with pos_weight=[1] and the results from my limited test seem to match. Seems good to me overall. However you'd like to commit your changes, it's up to you, but if you make a PR it would be good to close this PR and mention it in the new one. 🚀

hulkds · 2024-03-03T21:35:36Z

@Burhan-Q
Here is the PR: #8620

feat(): add pos_weight hyperparameter for dealing with imbalanced dat…

237b054

…aset.

Auto-format by https://ultralytics.com/actions

67ae6b2

hulkds mentioned this pull request Feb 29, 2024

Is it possible to change class weight? #2703

Closed

1 task

Merge branch 'main' into fix#2703

9277f30

hulkds mentioned this pull request Mar 1, 2024

pos_weight for dealing with imbalanced dataset #8578

Closed

2 tasks

glenn-jocher added 2 commits March 2, 2024 16:18

Merge branch 'main' into fix#2703

d8d645d

Merge branch 'main' into fix#2703

605e503

satyrmipt mentioned this pull request Mar 3, 2024

Yolov8 training custom dataset utility or Error using this mode? #2358

Closed

1 task

hulkds mentioned this pull request Mar 3, 2024

feat(Detection): add pos_weight parameter in BCEWithLogitsLoss #8620

Open

hulkds closed this Mar 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pos_weight hyperparameter for dealing with imbalanced dataset #2703 #8557

add pos_weight hyperparameter for dealing with imbalanced dataset #2703 #8557

hulkds commented Feb 29, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Feb 29, 2024 •

edited

Loading

hulkds commented Feb 29, 2024

codecov bot commented Feb 29, 2024 •

edited

Loading

Burhan-Q commented Mar 1, 2024

satyrmipt commented Mar 3, 2024

hulkds commented Mar 3, 2024 •

edited

Loading

Burhan-Q commented Mar 3, 2024

hulkds commented Mar 3, 2024

Burhan-Q commented Mar 3, 2024

hulkds commented Mar 3, 2024

add pos_weight hyperparameter for dealing with imbalanced dataset #2703 #8557

add pos_weight hyperparameter for dealing with imbalanced dataset #2703 #8557

Conversation

hulkds commented Feb 29, 2024 • edited by github-actions bot Loading

🛠️ PR Summary

🌟 Summary

📊 Key Changes

🎯 Purpose & Impact

github-actions bot commented Feb 29, 2024 • edited Loading

hulkds commented Feb 29, 2024

codecov bot commented Feb 29, 2024 • edited Loading

Codecov Report

Burhan-Q commented Mar 1, 2024

satyrmipt commented Mar 3, 2024

hulkds commented Mar 3, 2024 • edited Loading

Burhan-Q commented Mar 3, 2024

hulkds commented Mar 3, 2024

Burhan-Q commented Mar 3, 2024

hulkds commented Mar 3, 2024

hulkds commented Feb 29, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Feb 29, 2024 •

edited

Loading

codecov bot commented Feb 29, 2024 •

edited

Loading

hulkds commented Mar 3, 2024 •

edited

Loading