inverse σSRN

This is the official code repository for the paper "Generalizable Pose Estimation Using Implicit Scene Representations" presented at ICRA 2023.

Please see our project website for demo videos and more details about our method!

Usage

The code for the σSRN model is based on the original SRN codebase. The modified code is located in the sigma_srn folder.

Installation

git clone --recurse-submodules https://github.com/vaibhavsaxena11/inverse-sigma-srn.git

conda env create -f environment.yml

conda activate sigma-srns

Data

Download Shapenet v2 cars and chairs classes dataset from here. Fetch the cars_train.zip, cars_train_val.zip, chairs_train.zip and chairs_train_val.zip files.

Training a σSRN Model

See python train.py --help for all train options. Example train call:

python train.py --data_root [path to directory with dataset] \
                --logging_root [path to directory where tensorboard summaries and checkpoints should be written to] \
                --no_validation \
                --img_sidelengths 64 \
                --batch_size_per_img_sidelength 10

To monitor progress, the training code writes tensorboard summaries every 100 steps into a "events" subdirectory in the logging_root.

Pose Estimation through inverse σSRN

See python pose.py --help for all pose evaluation options. Example pose call:

python pose.py --train_data_root [path to directory with dataset] \
               --val_data_root [path to directory with train_val dataset - unseen poses for evaluation] \
               --logging_root [path to directory where the results are stored] \
               --checkpoint_path [path to the σSRN model checkpoint *.pth file] 
               --loss l1

For each image in the validation set (from an unseen camera pose), pose.py will optimize the camera pose parameters to move the view to match the input image. Multiple starting poses are used, and the pose with the lowest loss is chosen as the estimated camera pose associated with the input image. For each pose optimization, the rendered views as the camera parameters move towards the target pose are stored in the logging directory. If plotting is enabled (on by default), error plots are stored in the logging directory as well. For each input target image, a pose.txt file indicates the best estimate.

Citation

If you find this repository useful, please cite our work:

@misc{saxena2023generalizable,
      title={Generalizable Pose Estimation Using Implicit Scene Representations}, 
      author={Vaibhav Saxena and Kamal Rahimi Malekshan and Linh Tran and Yotto Koga},
      year={2023},
      eprint={2305.17252},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
sigma_srn		sigma_srn
LICENSE		LICENSE
README.md		README.md
cam_util.py		cam_util.py
environment.yml		environment.yml
invert_renderer.py		invert_renderer.py
pose.py		pose.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

inverse σSRN

Usage

Installation

Data

Training a σSRN Model

Pose Estimation through inverse σSRN

Citation

About

Releases

Packages

Languages

License

vaibhavsaxena11/inverse-sigma-srn

Folders and files

Latest commit

History

Repository files navigation

inverse σSRN

Usage

Installation

Data

Training a σSRN Model

Pose Estimation through inverse σSRN

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages