Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
imvoxelnet_4x8_kitti-3d-car.py		imvoxelnet_4x8_kitti-3d-car.py
metafile.yml		metafile.yml

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

In this paper, we introduce the task of multi-view RGB-based 3D object detection as an end-to-end optimization problem. To address this problem, we propose ImVoxelNet, a novel fully convolutional method of 3D object detection based on posed monocular or multi-view RGB images. The number of monocular images in each multiview input can variate during training and inference; actually, this number might be unique for each multi-view input. ImVoxelNet successfully handles both indoor and outdoor scenes, which makes it general-purpose. Specifically, it achieves state-of-the-art results in car detection on KITTI (monocular) and nuScenes (multi-view) benchmarks among all methods that accept RGB images. Moreover, it surpasses existing RGB-based 3D object detection methods on the SUN RGB-D dataset. On ScanNet, ImVoxelNet sets a new benchmark for multi-view 3D object detection.

Introduction

We implement a monocular 3D detector ImVoxelNet and provide its results and checkpoints on KITTI dataset. Results for SUN RGB-D, ScanNet and nuScenes are currently available in ImVoxelNet authors repo (based on mmdetection3d).

Results and models

KITTI

Backbone	Class	Lr schd	Mem (GB)	Inf time (fps)	mAP	Download
ResNet-50	Car	3x			17.26	model \| log

Citation

@article{rukhovich2021imvoxelnet,
  title={ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection},
  author={Danila Rukhovich, Anna Vorontsova, Anton Konushin},
  journal={arXiv preprint arXiv:2106.01178},
  year={2021}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

imvoxelnet

imvoxelnet

README.md

README.md

imvoxelnet_4x8_kitti-3d-car.py

imvoxelnet_4x8_kitti-3d-car.py

metafile.yml

metafile.yml

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Introduction

Results and models

KITTI

Citation

Files

imvoxelnet

Directory actions

More options

Directory actions

More options

Latest commit

History

imvoxelnet

Folders and files

parent directory

README.md

README.md

imvoxelnet_4x8_kitti-3d-car.py

imvoxelnet_4x8_kitti-3d-car.py

metafile.yml

metafile.yml

README.md

ImVoxelNet: Image to Voxels Projection for Monocular and Multi-View General-Purpose 3D Object Detection

Abstract

Introduction

Results and models

KITTI

Citation