Changing model size doesn't seem to improve detection, actually makes it worse? #37

mellerbeck · 2024-08-04T00:10:15Z

model_size: str. The size of the model to use. It can be 'n' (nano), 's' (small), 'm' (medium) or 'l' (large). Larger models are more accurate but slower. Default: 's'.

I haven't tested with a large sample size, but at first pass if I change the model size to anything besides s it stops detecting.

Eric-Canas · 2024-08-04T06:14:15Z

HI! Yes, you are true, it may actually happen.

Larger models are usually more capable for difficult task, but also more prone to overfit data if it is not varied enough.

I manually tag the dataset I use for detection, so it size grows in every tagging-training round, but actually, its current size may be not large enough to give larger models the data they need to squeeze their capabilities.

So it is posible that currently, while the dataset is not large enough, larger models have suffered from overfit, actually decreasing their capabilities on real world problems.

Sorry for the inconvinience, I'll modify that part of the documentation. While that claim should become true in the long future, it is likely not happening now.

PD: Thanks for the sponsoring <3

Eric-Canas closed this as completed Aug 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changing model size doesn't seem to improve detection, actually makes it worse? #37

Changing model size doesn't seem to improve detection, actually makes it worse? #37

mellerbeck commented Aug 4, 2024

Eric-Canas commented Aug 4, 2024

Changing model size doesn't seem to improve detection, actually makes it worse? #37

Changing model size doesn't seem to improve detection, actually makes it worse? #37

Comments

mellerbeck commented Aug 4, 2024

Eric-Canas commented Aug 4, 2024