Skip to content
Discussion options

You must be logged in to vote

hi @Praso28, thanks for the detailed write-up.

on your first question - yes. with the default imgsz=640, a 1056x1600 image is letterboxed down to 640x640: aspect ratio is preserved and the rest is padded, so the image is not stretched. a list like imgsz=[1056,1600] (both already multiples of the stride 32) feeds the image at close to native size instead.

a few general points that usually explain this kind of result:

  • a model detects objects best at roughly the pixel scale it saw during training. you trained at 640, so inference at 640 stays closest to that, while a much larger inference size shows the object at a very different scale than the model learned. so higher inference resolution …

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@Praso28
Comment options

@raimbekovm
Comment options

Answer selected by Praso28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested detect Object Detection issues, PR's
3 participants