Scaling YOLOv3 predictions instead of resize may have a better performance? #14

xtyDoge · 2019-07-25T09:47:21Z

The prediction of YOLOv3 usually a rectangle and I see that you use transforms method to resize the image before put it into HRNet. It stretching the image and make pose estimation network performs badly.
I try to make YOLOv3 prediction regions have the same h/w ratio as the input of HRNet. For example, HRNet requires image_resolution (256, 256)，but YOLOv3 gives a rectangle region. What we should do is calculate the center of YOLO prediction region and expand it as a square.

In SimpleHRNet._predict_single I modify it by adding:

squareLen = max(x2 - x1, y2 - y1) // 2
centerX = x1 + (x2 - x1) // 2
centerY = y1 + (y2 - y1) // 2
x1 = max(0, centerX - squareLen)
x2 = min(len(image[1]), centerX + squareLen)
y1 = max(0, centerY - squareLen)
y2 = min(len(image[0]), centerY + squareLen)

And it works on pose_hrnet_w32_256x256 with mpii annotation.
Maybe you can add it on your proj :D

The text was updated successfully, but these errors were encountered:

stefanopini · 2019-08-01T17:43:50Z

That's a cool idea, thank you! I will surely add it to the project, probably as an option.

Have you tested also with models pre-trained on COCO and bounding boxes with aspect ratio 4/3 (as pose_hrnet_w32_256x192 and pose_hrnet_w48_384x288)?

xtyDoge · 2019-08-04T13:25:39Z

@stefanopini Sorry I don't, maybe you can try it.

Adapt detection bounding boxes to match HRNet input aspect ratio (as suggested by xtyDoge in issue #14). Huge accuracy improvement in the multiperson setting.

stefanopini · 2019-09-29T23:50:42Z

Added from commit af40b39
Thank you for the suggestion!

stefanopini added enhancement New feature or request priority Issues with priority labels Aug 1, 2019

xtyDoge closed this as completed Aug 4, 2019

stefanopini added a commit that referenced this issue Sep 29, 2019

Improve accuracy in multiperson setting

af40b39

Adapt detection bounding boxes to match HRNet input aspect ratio (as suggested by xtyDoge in issue #14). Huge accuracy improvement in the multiperson setting.

stefanopini mentioned this issue Sep 29, 2019

Question :Performance against other 2D detectors (openpose) #6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling YOLOv3 predictions instead of resize may have a better performance? #14

Scaling YOLOv3 predictions instead of resize may have a better performance? #14

xtyDoge commented Jul 25, 2019

stefanopini commented Aug 1, 2019

xtyDoge commented Aug 4, 2019

stefanopini commented Sep 29, 2019

Scaling YOLOv3 predictions instead of resize may have a better performance? #14

Scaling YOLOv3 predictions instead of resize may have a better performance? #14

Comments

xtyDoge commented Jul 25, 2019

stefanopini commented Aug 1, 2019

xtyDoge commented Aug 4, 2019

stefanopini commented Sep 29, 2019