Confusing Resizing in the Pipelines #8254

sarmientoj24 · 2022-06-23T15:51:57Z

I would like to understand how mmdetection resizes the images and pad them and how does keep_ratio come into play.

Consider the config

transforms=[
            dict(type='Resize', img_scale=(640, 640), keep_ratio=True),
            dict(type='RandomFlip', flip_ratio=0.0),
            dict(
                type='Normalize',
                mean=[123.675, 116.28, 103.53],
                std=[58.395, 57.12, 57.375],
                to_rgb=True),
            dict(type='Pad', size_divisor=32),
            dict(type='DefaultFormatBundle', keys=['img']),
            dict(type='Collect', keys=['img'])
        ])

If I have an Image that is 1400x1900, what happens here?

Does it resize it to Mx640 where M is just the short side converted from the ratio?
Does it get padded?
What is the output here?

Is it possible to mimic square training like how YOLOv5 does it where it resizes it to the target 640x640, keeps ratio by padding it?

Additional

It seems that pytorch2onnx results for both ONNX and Pytorch resizes the input image into the test_cfg image size and that the predictions are predictions for that image size and not rescaled for the original image resolution. It also seems that it doesn't pad the image.

Meanwhile, the image_demo provides the outputs rescaled to the original input shape.

Could you please elaborate on this? It is rather confusing

The text was updated successfully, but these errors were encountered:

sarmientoj24 · 2022-06-28T12:06:04Z

@RangiLyu any on this?

RangiLyu · 2022-09-15T08:05:01Z

Sorry for the late reply.

Does it resize it to Mx640 where M is just the short side converted from the ratio?

yes

Does it get padded?

the Resize won't pad the image. The Pad transform dict(type='Pad', size_divisor=32), will.

What is the output here?

640 / 1900 * 1400 = 472
472 + 8(pad size_divisor=32) 480

Is it possible to mimic square training like how YOLOv5 does it where it resizes it to the target 640x640, keeps ratio by padding it?

just use a square padding dict(type='Pad', pad_to_square=True),

mm-assistant bot assigned RangiLyu Jun 23, 2022

RangiLyu closed this as completed Sep 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusing Resizing in the Pipelines #8254

Confusing Resizing in the Pipelines #8254

sarmientoj24 commented Jun 23, 2022 •

edited

Loading

sarmientoj24 commented Jun 28, 2022

RangiLyu commented Sep 15, 2022

Confusing Resizing in the Pipelines #8254

Confusing Resizing in the Pipelines #8254

Comments

sarmientoj24 commented Jun 23, 2022 • edited Loading

sarmientoj24 commented Jun 28, 2022

RangiLyu commented Sep 15, 2022

sarmientoj24 commented Jun 23, 2022 •

edited

Loading