A Statistical Manifold Framework for Point Cloud Data

The official repository for <A Statistical Manifold Framework for Point Cloud Data> (Yonghyeon Lee*, Seungyeon Kim*, Jinwon Choi, and Frank C. Park, ICML 2022).

^* The two lead co-authors contributed equally.

This paper proposes a new Riemannian geometric structure for the space of point cloud data, using the theory of statistical manifold and information geometry, with applications to point cloud autoencoders.

Preview

Geodesic Point Cloud Interpolation

Figure 1: Left: Latent space with linear and geodesic interpolants. The orange interpolants connect a wide cylinder to a tall cylinder, while the magenta interpolants connect a cylinder to a cone. Linear interpolants and geodesic interpolants under the Euclidean and info-Riemannian metrics are drawn as dotted, dashed, and solid lines, respectively. Right: Generated point clouds from those interpolants.

Point Cloud Autoencoder Regularization

Figure 2: From left to right: latent spaces with equidistant ellipse centered on some selected points, Gaussian Mixture Model (GMM) fitting results, generated samples from the GMM, and the heat map of the pairwise Euclidean distances in the latent space of all test data. The upper figure is a vanilla autoencoder trained without regularization, while the lower figure is trained with regularization (using the proposed info-Riemannian metric).

Requirements

Environment

The project is developed under a standard PyTorch environment.

python 3.8.8
numpy
scipy
matplotlib
pytorch 1.8.0
CUDA 11.3
tensorboard 2.4.1
scikit-learn 0.24.1
torchcubicspline
Open3D 0.13.0

Warining: We confirmed that the latest version of Open3D has slightly different syntax than the version we used. We highly recommend using the Open3D version of 0.13.0 as noted above.

Datasets

Datasets should be stored in datasets/ directory. Datasets can be set up as follows.

For synthetic 3D basic shape dataset, you can download through the Google drive link.
For standard benchmark dataset (ModelNet10, ModelNet40, and ShapeNetCore.v2), you can download through the Github link (https://github.com/antao97/PointCloudDatasets).

After set up, the datasets/ directory should be as follows.

datasets
├── interpolation_dataset
├── modelnet10_hdf5_2048
├── modelnet40_hdf5_2048
├── regularization_dataset
└── shapenetcorev2_hdf5_2048

(Optional) If you want to generate your own custom basic 3D shape dataset, run the following script:

python data_generation.py

The above script is for generation of regularization dataset. The interpolation dataset comes from our other research (https://github.com/seungyeon-k/DSQNet-public).

Pretrained model

Pre-trained models should be stored in pretrained/. The pre-trained models are provided through the Google drive link. After set up, the pretrained/ directory should be as follows.

pretrained
├── interpolation_config
│   ├── vanilla
│   └── regularized
└── regularization_config
    ├── vanilla
    └── regularized

Running

Training

The training script is train.py.

--config specifies a path to a configuration yml file.
--logdir specifies a directory where the results will be saved.
--run specifies a name for an experiment.
--device specifies an GPU number to use.

Training on synthetic 3D basic shape dataset for interpolation experiment (Section 4.1.1):

python train.py --config configs/interpolation_config.yml --model.fm_reg {F}

F is either None or 10000000.

Training on synthetic 3D basic shape dataset for regularization experiment (Section 4.1.2):

python train.py --config configs/regularization_config.yml --model.fm_reg {F}

F is either None or 10000000.

Training on standard benchmark dataset (Section 4.2, Table 2):

python train.py --config configs/{M}_config.yml --model.fm_reg {F}

M is either fcnet, foldingnet, pointcapsnet, or dgcnnfcnet.
F is either None or positive real numbers (see Appendix D.2.1 for the values used).

Training on standard benchmark dataset for noisy and semi-supervised settings (Section 4.2, Table 3 and Table 4):

python train.py --config configs/fcnet_{E}_{V}_config.yml --model.fm_reg {F}

E is either noise or semi.
If E is noise, then V is either 1, 5, 10, or 20. If E is semi, then V is either 50, 10, 5, or 1.
F is either None or 8000.

Interpolation

The interpolation scripts consist of two python code: interpolation.py and interpolation_renderer.py.

interpolation.py makes a tensorboard file in interpolation_results/tensorboard where the results are visualized and a npy file in interpolation_results/data for rendering the (beautiful) result figures.
- --example specifies an index for the interpolation examples. The value should be 0, 1, or 2.
- --device specifies an GPU number to use.
- --run specifies a name for an experiment.
- If you want to see the results in tensorboard after code execution, run this code:
```
tensorboard --logdir interpolation_results/tensorboard --host {ip address}
```
interpolation_renderer.py reads a npy file in interpolation_results/data and makes a png figure in interpolation_results/png and a gif simple animated figure in interpolation_results/gif.
- --run specifies a name for the experiment to render figures.

Example code:

python interpolation.py --example 0 --run run1
python interpolation_renderer.py --run run1

Sample gif images for interpolation:

Figure 3: Linear interpolants (left) and geodesic interpolants under the Euclidean (middle) and info-Riemannian metrics (right).

Regularization

Similarly, the regularization scripts also consist of two python code: regularization.py and regularization_renderer.py.

regularization.py makes a tensorboard file in regularization_results/tensorboard and a npy file in regularization_results/data.
- --device specifies an GPU number to use.
- --run specifies a name for an experiment.
- If you want to see the results in tensorboard after code execution, run this code:
```
tensorboard --logdir regularization_results/tensorboard --host {ip address}
```
regularization_renderer.py reads a npy file in regularization_results/data and makes a png figure in regularization_results/png.
- --run specifies a name for the experiment to render figures.

Example code:

python regularization.py --run run2
python regularization_renderer.py --run run2

Warning: The rendering code interpolation_renderer.py and regularization_renderer.py does not work in server (i.e., without a connected display). If you want to render figures in server, try Open3D headless rendering.

Citation

If you found this repository useful in your research, please consider citing:

@inproceedings{lee2022statistical,
  title={A Statistical Manifold Framework for Point Cloud Data},
  author={Lee, Yonghyeon and Kim, Seungyeon and Choi, Jinwon and Park, Frank},
  booktitle={International Conference on Machine Learning},
  pages={12378--12402},
  year={2022},
  organization={PMLR}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
figures		figures
functions		functions
loader		loader
loss		loss
models		models
trainers		trainers
.gitignore		.gitignore
README.md		README.md
data_generation.py		data_generation.py
interpolation.py		interpolation.py
interpolation_renderer.py		interpolation_renderer.py
metrics.py		metrics.py
optimizers.py		optimizers.py
regularization.py		regularization.py
regularization_renderer.py		regularization_renderer.py
train.py		train.py
utils.py		utils.py

seungyeon-k/SMF-public

Folders and files

Latest commit

History

Repository files navigation

A Statistical Manifold Framework for Point Cloud Data

Preview

Geodesic Point Cloud Interpolation

Point Cloud Autoencoder Regularization

Requirements

Environment

Datasets

Pretrained model

Running

Training

Interpolation

Regularization

Citation

About

Resources

Stars

Watchers

Forks

Languages