GitHub

Pytorch implementation of "Self-supervised Vision Transformers for 3D Pose Estimation of Novel Objects"

Stefan Thalhammer, Jean-Basptiste Weibel, Markus Vincze and Jose Garcia-Rodriguez

If our project is helpful for your research, please consider citing :

@article{thalhammer2023selfsupervised,
      title={Self-supervised Vision Transformers for 3D Pose Estimation of Novel Objects}, 
      author={Thalhammer, Stefan and Weibel, Jean-Baptiste and Vincze, Markus Vincze and Garcia-Rodriguez, Jose},
      journal={Image and Vision Computing},
      volume={139},
      pages={104816},
      year={2023},
      publisher={Elsevier},
}

Table of Content

Installation
Datasets
Training and Visualization
Acknowledgement

Installation

Either setup an Anaconda environment:

conda env create -f environment.yml
conda activate template

or a Docker container. Please modify the paths in ''docker_launch.sh'' and run:

./docker_launch.sh

Datasets

Please refer to this repo, or to this branch for data retrieval.

Training

1. To launch a training on all Linemod splits:

./train_vit_LM_splits.sh

2. To launch a training on T-LESS:

python train_tless.py --config_path ./config_run/TLESS.json

3. To visualize the Self-attention for a specific image:

python vizualize_SA.py --config_path config_run/<config_file> --pretrained_weights <your_weights>.pth --image_path <image_to_visualize_SA>.png --mask_path <corresponding_template_mask.png  --output_dir <path_to_safe_SA> --threshold 0.75

Acknowledgement

The code is adapted from template-pose. Please also cite the original paper if the provided code is used:

@inproceedings{nguyen2022template,
    title={Templates for 3D Object Pose Estimation Revisited: Generalization to New objects and Robustness to Occlusions},
    author={Nguyen, Van Nguyen and Hu, Yinlin and Xiao, Yang and Salzmann, Mathieu and Lepetit, Vincent},
    booktitle={Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
    year={2022}}

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
bop_toolkit_lib		bop_toolkit_lib
config_run		config_run
data		data
figures		figures
lib		lib
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.json		config.json
docker_launch.sh		docker_launch.sh
environment.yml		environment.yml
test_linemod.py		test_linemod.py
test_tless.py		test_tless.py
train_ResNet50_LM_splits.sh		train_ResNet50_LM_splits.sh
train_linemod.py		train_linemod.py
train_tless.py		train_tless.py
train_vit_LM_splits.json		train_vit_LM_splits.json
train_vit_LM_splits.sh		train_vit_LM_splits.sh
vizualize_SA.py		vizualize_SA.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Table of Content

Installation

Datasets

Training

1. To launch a training on all Linemod splits:

2. To launch a training on T-LESS:

3. To visualize the Self-attention for a specific image:

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

sThalham/TraM3D

Folders and files

Latest commit

History

Repository files navigation

Table of Content

Installation

Datasets

Training

1. To launch a training on all Linemod splits:

2. To launch a training on T-LESS:

3. To visualize the Self-attention for a specific image:

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages