Authors: Shun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab and Federico Tombari
This repository contains the network part of the SceneGraphFusion work. For the incremental framework, please check here.
The code has been tested on Ubuntu 18.04 and gcc 7.5. You can either create a conda environment by
conda env create --name <env_name> --file environment.yml
or install the dependnecies manually
# Dependencies
# for training and evaluation:
# - Pytorch, Pytorch Geometric, Trimesh, Tensorboard
# for tracing:
# - onnxruntime
# for data generation:
# - open3d
# Install commends
# Main env
conda create -n 3dssg pytorch=1.8.9 cudatoolkit=10.2 -c pytorch tensorboard trimesh -c conda-forge
# Onnxruntime
pip install onnxruntime=1.7.2
# Pytorch Geometric
export CUDA=10.2
pip install torch-scatter -f${CUDA}.html
pip install torch-sparse -f${CUDA}.html
pip install torch-cluster -f${CUDA}.html
pip install torch-spline-conv -f${CUDA}.html
pip install torch-geometric
# open3d
pip install open3d
Download data
cd files
Run a toy example:
python --mode [train,eval,trace] --config ./config_example.json
The file will create a folder at the same directory of config with the NAME from config_[NAME] and a log folder stors the logging from Tensorboard. The trained models/ evaluation results/ traced models will all be stored within the NAME folder.
We provide a trained model here. The model is able to perform equivelent result as reported in the SceneGraphFusion paper. Note: The model is trained with 20 NYUv2 object classes used in ScanNet benchmark, and with 8 support types of predicates.
The trained model can be traced and then be used on our SceneGraphFusion framework.
python --mode trace --config ./path/to/config
For example, to trace our pre-trained model
python --mode trace --config ./CVPR21/config_CVPR21.json
The traced model will be stored at
See under data_processing folder
files under ./src/pointnet/*
are with Apache License, Version 2.0
is modifed from under MIT License # main file
src/ # main codes
src/network_* # basic network/layers/operations
src/model_* # a network model consists of multiple layers
src/dataset_* # data related
src/* # basic class templates
src/*_util*.py # utilities
src/[name].py # top-level class to train/eval/trace a model
data_processing/ # the codes to generate training data from 3RScan/ScanNet
utils/ # utilities
scripts/ # evaluation script/ scene reconstruction script/ etc.
If you find the code useful please consider citing our paper:
title = {{SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences}},
author = {Shun-Cheng Wu and Johanna Wald and Keisuke Tateno and Nassir Navab and Federico Tombari},
booktitle = {Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2021}
title = {{Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions}},
author = {Wald, Johanna and Dhamo, Helisa and Navab, Nassir and Tombari, Federico},
booktitle = {Proceedings IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2020}
This work is supported by the German Research Foundation (DFG, project number 407378162) and the Bavarian State Ministry of Education, Science and the Arts in the framework of the Centre Digitisation Bavaria (ZD.B).