M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Garrick Brazil, Xiaoming Liu

This code uses the newer AP3D metric with 40 points at IoU= 0.7 for evaluation. The code also reports the newer AP3D evaluation with 40 points at IoU= 0.5.

Introduction

Monocular 3D region proposal network source code as detailed in arXiv report, accepted to ICCV 2019 (Oral). Please also visit their project page.

Their framework is implemented and tested with Ubuntu 16.04, CUDA 8.0, Python 3, NVIDIA 1080 Ti GPU. Unless otherwise stated the below scripts and instructions assume working directory is the project root.

If you utilize this framework, please cite their ICCV 2019 paper.

@inproceedings{brazil2019m3drpn,
    title={M3D-RPN: Monocular 3D Region Proposal Network for Object Detection},
    author={Brazil, Garrick and Liu, Xiaoming},
    booktitle={Proceedings of the IEEE International Conference on Computer Vision},
    address={Seoul, South Korea},
    year={2019}
}

Setup

Cuda & Python

In this project we utilize Pytorch with Python 3, Cuda 8, and a few Anaconda packages. Please review and follow this installation guide. However, feel free to try alternative versions or modes of installation.
Data

Download the full KITTI detection dataset. Then place a softlink (or the actual data) in M3D-RPN/data/kitti.
```
 cd M3D-RPN
 ln -s /path/to/kitti data/kitti
```
Then use the following scripts to extract the data splits, which use softlinks to the above directory for efficient storage.
```
python data/kitti_split1/setup_split.py
python data/kitti_split2/setup_split.py
```
Next, build the KITTI devkit eval for each split.
```
 sh data/kitti_split1/devkit/cpp/build.sh
 sh data/kitti_split2/devkit/cpp/build.sh
```
Lastly, build the nms modules
```
cd lib/nms
make
```

Training

We use visdom for visualization and graphs. Optionally, start the server by command line

python -m visdom.server -port 8100 -readonly

The port can be customized in scripts/config files. The training monitor can be viewed at http://localhost:8100

Training is split into a warmup and main configurations. Review the configurations in scripts/config for details.

// First train the warmup (without depth-aware)
python scripts/train_rpn_3d.py --config=kitti_3d_multi_warmup

// Then train the main experiment (with depth-aware)
python scripts/train_rpn_3d.py --config=kitti_3d_multi_main

If your training is accidentally stopped, you can resume at a checkpoint based on the snapshot with the restore flag. For example to resume training starting at iteration 10k, use the following command.

python scripts/train_rpn_3d.py --config=kitti_3d_multi_main --restore=10000

Testing

The authors provide models for the main experiments on val1 / val2 / test data splits available to download here M3D-RPN-Release.zip.

Testing requires paths to the configuration file and model weights, exposed variables near the top scripts/test_rpn_3d.py. To test a configuration and model, simply update the variables and run the test file as below.

python scripts/test_rpn_3d.py

Contact

For questions regarding M3D-RPN, feel free to post here or directly contact the authors {brazilga, liuxm}@msu.edu.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
lib		lib
models		models
scripts		scripts
LICENSE		LICENSE
python_packages.txt		python_packages.txt
readme.md		readme.md
setup.md		setup.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

lib

lib

models

models

scripts

scripts

LICENSE

LICENSE

python_packages.txt

python_packages.txt

readme.md

readme.md

setup.md

setup.md

Repository files navigation

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Introduction

Setup

Training

Testing

Contact

About

Releases

Packages

Contributors 2

Languages

License

abhi1kumar/M3D-RPN

Folders and files

Latest commit

History

Repository files navigation

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Introduction

Setup

Training

Testing

Contact

About

Resources

License

Stars

Watchers

Forks

Languages