The size of the model input causes prediction results problems #6062

largestcabbage · 2023-11-01T07:31:44Z

Search before asking

I have searched the YOLOv8 issues and discussions and found no similar questions.

Question

In model training, if the input size of the model is 1024 or 960, the defect location can be correctly predicted. However, when the input size of the model is 1280, the predicted boundary is smaller than the actual boundary.

Can I take a look at the predicted pictures

Additional

No response

largestcabbage · 2023-11-07T07:27:24Z

Hello, can this problem be solved? What is the specific reason?

glenn-jocher · 2023-11-07T09:11:14Z

@largestcabbage hello,

Thank you for bringing this issue to our attention. The difference in bounding box sizes at varying input resolutions could be attributed to multiple factors. Primarily, there could be a disparity in how the network is interpreting features at different scales. This is not uncommon, especially when models are trained on one input size and then inferred on another.

Here are some steps you could take to mitigate the issue:

Ensure Consistent Training and Inference Resolutions: If you trained your model at a lower resolution, try to keep the inference resolution the same or close to the training resolution where the model performs best.
Re-visit Data Augmentation: Consider incorporating data augmentations that expose the model to a variety of scales during training. This helps the model learn size invariance and improves its ability to generalize over different input sizes.
Fine-tuning: You may fine-tune your model on a higher resolution if that's your intended use case. Starting from the weights learned at a lower resolution and continuing the training at the higher resolution can help the model adjust better.
Evaluate Anchor Sizes: If you have significant control over the architecture choices, review the anchor box sizes. Sometimes, anchor boxes may not align well with the scale of objects at different resolutions.
Post-processing Adjustments: Adjust the non-maximum suppression (NMS) threshold and confidence thresholds during inference to see if that helps fit the bounding boxes more accurately.
Inspect Training Data: Ensure your training data is representative of the resolution at which you wish to perform inference. The model's performance can degrade if there's a large discrepancy.

Keep in mind that each model may respond differently to changes in input resolution, and the adjustments may require some experimentation to identify what works best for your specific scenario. Feel free to experiment with these steps and observe which combination gives you the improved results at your desired inference resolution.

Best regards,
The Ultralytics Team

github-actions · 2023-12-08T01:04:01Z

👋 Hello there! We wanted to give you a friendly reminder that this issue has not had any recent activity and may be closed soon, but don't worry - you can always reopen it if needed. If you still have any questions or concerns, please feel free to let us know how we can help.

For additional resources and information, please see the links below:

Docs: https://docs.ultralytics.com
HUB: https://hub.ultralytics.com
Community: https://community.ultralytics.com

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLO 🚀 and Vision AI ⭐

largestcabbage added the question Further information is requested label Nov 1, 2023

github-actions bot added the Stale label Dec 8, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Dec 19, 2023

Y-T-G mentioned this issue May 5, 2024

TaskAlignedAssigner doesn't take into account reg_max which affects prediction of large objects #11634

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The size of the model input causes prediction results problems #6062

The size of the model input causes prediction results problems #6062

largestcabbage commented Nov 1, 2023

largestcabbage commented Nov 7, 2023

glenn-jocher commented Nov 7, 2023

github-actions bot commented Dec 8, 2023

The size of the model input causes prediction results problems #6062

The size of the model input causes prediction results problems #6062

Comments

largestcabbage commented Nov 1, 2023

Search before asking

Question

Additional

largestcabbage commented Nov 7, 2023

glenn-jocher commented Nov 7, 2023

github-actions bot commented Dec 8, 2023