TMPI: Tiled Multiplane Images

PyTorch implementation of the ICCV 2023 paper.

Tiled Multiplane Images for Practical 3D Photography
Numair Khan¹, Douglas Lanman¹, Lei Xiao¹
¹Reality Labs Research

Setup

The code has been tested in the following setup

Linux (Ubuntu 20.04.04/Fedora 34)
Python 3.7
PyTorch 1.10.2
CUDA 11.3

We recommend running the code in a Conda environment: after cloning the repo, run the following commands from the base directory:

$ conda env create -f environment.yml
$ conda activate tmpi

Run the initialization script to download model checkpoints and set up the monocular depth estimator; we use DPT:

$ sh ./initialize.sh

Running the Code

To execute 3D photos from the test images provided in the test_data folder run:

$ python ./run.py

The results are written to the output folder.

To execute the code on your own images use:

$ python ./run.py --indir=/PATH_TO_INPUT_IMAGES_DIR --outdir=/PATH_TO_OUTPUT_DIR

By default, the code uses OpenGL to render the tiled multiplane images efficiently. We notice that on some implementations this may cause flickering around a small number of tile edges. To avoid this (or in case your system does not have OpenGL installed) the differentiable PyTorch renderer we use for training may be utilized by providing the --pytorch_renderer flag.

$ python ./run.py --pytorch_renderer --indir=/PATH_TO_INPUT_IMAGES_DIR --outdir=/PATH_TO_OUTPUT_DIR

Note, however, that this will run much slower.

While we use DPT as the depth estimator, the method can work with any depth input. To use a different source, we suggest replacing the DPT depth loader on L87 of the dataset.py file with the desired (inverse) depth input.

Citation

If you find our work useful for your research, please cite the following paper:

@article{khan2023tcod,
  title={Tiled Multiplane Images for Practical 3D Photography},
  author={Numair Khan, Eric Penner, Douglas Lanman, Lei Xiao},
  journal={International Conference on Computer Vision (ICCV)},
  year={2023},
}

License

Our source code is CC-BY-NC licensed, as found in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
DPT @ f43ef9e		DPT @ f43ef9e
imgs		imgs
test_data		test_data
utils		utils
weights		weights
.gitignore		.gitignore
.gitmodules		.gitmodules
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
README.md		README.md
config.py		config.py
dataset.py		dataset.py
dpt_wrapper.py		dpt_wrapper.py
environment.yml		environment.yml
homography_sampler.py		homography_sampler.py
initialize.sh		initialize.sh
kmeans.py		kmeans.py
mpi_rendering.py		mpi_rendering.py
networks.py		networks.py
rendering_utils.py		rendering_utils.py
run.py		run.py
tmpi.py		tmpi.py
tmpi_renderer.py		tmpi_renderer.py
tmpi_renderer_gl.py		tmpi_renderer_gl.py

License

facebookresearch/TMPI

Folders and files

Latest commit

History

Repository files navigation

TMPI: Tiled Multiplane Images

Setup

Running the Code

Citation

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Languages