Kidney stone detection with yolo26 #24093
-
|
Hello, I'm trying to train a model for my graduation project, with this model I am trying to detect kidney stones in CT images. I used yolo26(I did some benchmarks and results are better rather than rt-detr). I'm currently using a dataset that contains 2564 train images. mAP50 : 0.85 I wanted to get better results so I did some research: what I found was bunch of different custom yolo models (different backbone, P2 aux head or attention block etc.) and their recall or mAP50 increased even if it's not so big so I tried to implement these changes. Results decrased pretty dramatically. I couldnt figure it out why so how I can achieve what I aim? Am I looking the wrong way ? What could be the reason for getting low results ? P.S. I'm fine-tuning the model |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 6 replies
-
|
👋 Hello @CihangirEmre, thank you for your interest in Ultralytics 🚀 and for sharing details about your kidney stone detection project with YOLO26. This is an automated response to help get the discussion moving quickly, and an Ultralytics engineer will also assist you soon. We recommend a visit to the Docs for new users where you can find many Python and CLI usage examples and where many of the most common questions may already be answered. If this is a 🐛 Bug Report, please provide a minimum reproducible example to help us debug it. Since this appears to be a custom training ❓ question, please provide as much information as possible, including dataset image examples and training logs, and verify you are following our Tips for Best Training Results. In particular, it would help if you could share:
Join the Ultralytics community where it suits you best. For real-time chat, head to Discord 🎧. Prefer in-depth discussions? Check out Discourse. Or dive into threads on our Subreddit to share knowledge with the community. UpgradeUpgrade to the latest pip install -U ultralyticsEnvironmentsYOLO may be run in any of the following up-to-date verified environments (with all dependencies including CUDA/CUDNN, Python and PyTorch preinstalled):
StatusIf this badge is green, all Ultralytics CI tests are currently passing. CI tests verify correct operation of all YOLO Modes and Tasks on macOS, Windows, and Ubuntu every 24 hours and on every commit. |
Beta Was this translation helpful? Give feedback.
-
|
I've hit this exact wall with medical datasets before. Standard YOLO architectures often struggle when the target is only a few pixels wide. The reason those custom P2 heads and attention blocks likely tanked your results is that they are notoriously hard to converge on smaller datasets, like your 2.5k images, without massive pre-training. Instead of fighting the architecture, I would look at SAHI (Slicing Aided Hyper Inference). By tiling your CT scans during inference, the model sees the stones at their original resolution instead of downsampled blobs. It is usually the fastest way to spike recall for tiny objects. Also, check your One gotcha: Watch your |
Beta Was this translation helpful? Give feedback.
-
|
Thank you for advices while I was searching I saw the SAHI too but didnt try it I will defenitly check it out. @glenn-jocher there are some validation and train batch results and training args. I never tried imgsz=1024 with this dataset so I take that as a note. hyperparameters = { } Again thank you for your advices I just found out this QA section and I think I will be spent time in here. |
Beta Was this translation helpful? Give feedback.




Suggested reply:
Thanks for sharing the batches — the
mAP50tomAP50-95gap suggests the main issue is likely localization on very small targets, so I’d pause backbone/head changes and instead check annotation tightness, review the class-wisemodel.val()outputs plus the confusion matrix and PR/F1 curves, and for fine-tuning trywarmup_epochs=0and training-time image tiling before making architecture edits; the model evaluation guide and performance metrics guide are the best references here. (docs.ultralytics.com)