Surface Normal Clustering for Implicit Representation of Manhattan Scenes

This repository contains the implementation of the method proposed in ICCV23 paper "Nikola Popovic, Danda Pani Paudel, Luc Van Gool - Surface Normal Clustering for Implicit Representation of Manhattan Scenes".

Abstract

Novel view synthesis and 3D modeling using implicit neural field representation are shown to be very effective for calibrated multi-view cameras. Such representations are known to benefit from additional geometric and semantic supervision. Most existing methods that exploit additional supervision require dense pixel-wise labels or localized scene priors. These methods cannot benefit from high-level vague scene priors provided in terms of scenes' descriptions. In this work, we aim to leverage the geometric prior of Manhattan scenes to improve the implicit neural radiance field representations. More precisely, we assume that only the knowledge of the indoor scene (under investigation) being Manhattan is known -- with no additional information whatsoever -- with an unknown Manhattan coordinate frame. Such high-level prior is used to self-supervise the surface normals derived explicitly in the implicit neural fields. Our modeling allows us to cluster the derived normals and exploit their orthogonality constraints for self-supervision. Our exhaustive experiments on datasets of diverse indoor scenes demonstrate the significant benefit of the proposed method over the established baselines.

Python environment

conda create --name ENV_NAME python=3.8.5
conda activate ENV_NAME
cd CODE_ROOT
pip install --ignore-installed torch==1.11.0+cu113 torchvision==0.12.0+cu113 torchaudio==0.11.0 --extra-index-url https://download.pytorch.org/whl/cu113
pip install -r requirements.txt
pip install torch-scatter -f https://data.pyg.org/whl/torch-1.11.0+cu113
pip install git+https://github.com/NVlabs/tiny-cuda-nn/#subdirectory=bindings/torch
git clone https://github.com/NVIDIA/apex
cd apex
pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./
# Had to comment out some line for this to work: https://github.com/NVIDIA/apex/pull/323
pip install plotly==5.10.0
pip install imgviz
pip install h5py
pip install pandas
pip install wandb
# The line below needs to be called every time the cuda scripts are modified
pip install models/csrc/

Data

Hypersim

Download all Hypersim scenes by using the script at code_root/datasets/hypersim_src/_utils/ . Provide necessary arguments when calling the script. Please note that the argument args.contains is already hardcoded in main(), to download only files useful for this repo (each scene has many other accompanying files that are not used, and therefore not downloaded). If you want to download only a subset of scenes (e.g. only 20 scenes from split A), you need to modify the script. One way is to comment out the defined variable URLS = [..] and create a new one with desired scenes.
After downloading, the dataset root should contain a folder for every scene d_root/scene (e.g. "d_root/ai_053_004"). Inside every scene's folder should be a folder d_root/scene/_detail and a folder d_root/scene/images. Inside d_root/scene/images, there should be d_root/scene/images/scene_cam_00_final_hdf5 where images are stored as hdf5 files, as well as d_root/scene/images/scene_cam_00_geometry_hdf5 where depth maps, normal maps and semantics are stored. Sometimes, a scene has multiple cameras, and there are cam_01, cam_02,... versions of these two folders, which will never be used because only the first camera trajectory is used.
The following files contained in code_root/datasets/hypersim_src/metadata/ should be copied to the Hypersim root d_root/: hypersim_A_scenes.json; hypersim_B_scenes.json; hypersim_C_scenes.json; all_scenes_metadata.json; most_scenes_list.json; scene_boundaries.json; scene_semantic_classes.json; metadata_camera_parameters.csv.

ScanNet

Download prepared scenes available at this repo, and place them into the root of the ScanNet dataset directory.

Replica

Download pre-rendered scenes available at this repo, and place them into the root of the Replica dataset directory.

Experiments

Multiple useful script for batching experiments can be found in code_root/experiments/hypersim/, code_root/experiments/scannet_man and code_root/experiments/replica_semnerf. In the following it will be explained how to run Hypersin experiments, since scripts for other datasets are very similar.

The script code_root/experiments/hypersim/train_one_euler.py reads hyperparameter values from code_root/experiments/hypersim/hyperparameters.py and creates a linux bash script with instructions to train a NeRF for one specified scene with the specified hyperparameters. Similarily, code_root/experiments/hypersim/train_all_euler.py create many linux bash script to train NeRF's for all Hypersim scenes. Similarily, code_root/experiments/hypersim/train_ABC_euler.py.py creates bash scripts with instructions to train on specified Hypersim-{A/B/C} splits. These scripts are made with the assumption that jobs are batched via the SLURM or LSF cluster batching system, which is reflected in the way these bash scripts are made. The advice here is to generate a script with code_root/experiments/hypersim/train_one_euler.py, adapt it to your computing environment, remove all unnecessary commands related to the cluster, and run it. For running directly from the terminal, most probably everything before the like which calls python train_nerf.py... can be removed. For clarity, some of the additional things that these scripts do for my environment are: specifying cluster job requirements; activating environment and dependencies on the cluster; sending compressed data to the cluster and uncompressing it;...

Running code_root/train_nerf.py will train a NeRF for the specified scene on the training split, evaluate on the test split, while logging everything to Weights&Biases. It will also create a folder for every experiment inside the experiment root directory, which will contain the result summary, network weights, rendered images, etc.

Contact

Please feel free to reach out if there are any questions, suggestion or issues with the code. My e-mail is nipopovic@vision.ee.ethz.ch.

Code starting point

The code in this repo is build on top of the excellent ngp_pl repository.

Citation

If you use this code, please consider citing the following paper:

@inproceedings{Popovic23ManhattanDF,
      title = {Surface Normal Clustering for Implicit Representation of Manhattan Scenes},
      author    = {Popovic, Nikola and
                   Paudel, ‪Danda Pani and
                   Van Gool, Luc},
      year = {2023},
      booktitle = {IEEE/CVF International Conference on Computer Vision (ICCV), 2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
datasets		datasets
experiments		experiments
metrics		metrics
models		models
README.md		README.md
install_environment.txt		install_environment.txt
losses.py		losses.py
opt.py		opt.py
requirements.txt		requirements.txt
teaser.jpg		teaser.jpg
train_nerf.py		train_nerf.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Surface Normal Clustering for Implicit Representation of Manhattan Scenes

Abstract

Python environment

Data

Hypersim

ScanNet

Replica

Experiments

Contact

Code starting point

Citation

About

Releases

Packages

Languages

nikola3794/normal-clustering-nerf

Folders and files

Latest commit

History

Repository files navigation

Surface Normal Clustering for Implicit Representation of Manhattan Scenes

Abstract

Python environment

Data

Hypersim

ScanNet

Replica

Experiments

Contact

Code starting point

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages