MonoFlex: Objects are Different: Flexible Monocular 3D Object Detection

April 2021

tl;dr: Decouple the prediction of truncated objects in mono3D.

Overall impression

How to deal with truncated objects remained one key task for mono3D. Most of the existing method (especially CenterNet) did not treat this in particular and the prediction on truncated objects looks distorted. This is the first paper which explicitly addresses this challenge.

The idea is to decouple the learning of truncated objects (outside objects) and untruncated object (inside objects), by using different representative points (or anchor point).

Key ideas

Representative points
- For inside objects, 3D (projected) center is better than 2D center. --> different from what RTM3D uses. RTM3D refer to this representative anchor point as "main center".
- For outside objects, 3D center is outside the image, and the intersection point between the image edge and the line connecting $x_b$ (2D center) to $x_c$ (3D center).
Edge fusion
- Extracts boundary from feature map, 1D conv, then adds back to the feature map
Depth prediction via height
- Direct prediction of transformed target $d = \frac{1}{\sigma(x)} - 1$
- Depth by scaling predicted keypoints and the average of 3 heights (the top surface and bottom surface centers, and two diagonal height). --> If a certain height is not fully visible due to cropping, then discard.
- Soft ensemble according to predicted aleatoric uncertainty
Loss
- gIoU loss
- corner loss, following monoDIS.

Technical details

The 2D center is not the center of the tight 2D bbox annotated on the image, but the 2D bbox around the partial projected 3D bbox that is inside the image, as can be seen from Fig. 4(c).
Only random horizontal flip is used for data augmentation.
Simply discarding outside objects can improve the performance compared to the baseline, demonstrating the necessity of decoupling outside objects.

Notes

Code on github

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

monoflex.md

monoflex.md

MonoFlex: Objects are Different: Flexible Monocular 3D Object Detection

Overall impression

Key ideas

Technical details

Notes

Files

monoflex.md

Latest commit

History

monoflex.md

File metadata and controls

MonoFlex: Objects are Different: Flexible Monocular 3D Object Detection

Overall impression

Key ideas

Technical details

Notes