PointNet: Point Cloud Processing Network

KAIST CS479: Machine Learning for 3D Data (Fall 2023)
Programming Assignment 1

Instructor: Minhyuk Sung (mhsung [at] kaist.ac.kr)
TA: Hyunjin Kim (rlaguswls98 [at] kaist.ac.kr)

Due: September 17, 2023 (Sunday) 23:59 KST

Where to Submit: Gradescope

Abstract

PointNet is a fundamental yet powerful neural network processing point cloud data. In the first tutorial, we will learn how to use PointNet for different tasks including classification, auto-encoding, and segmentation by implementing them. Since we aim to make you familiar with implementing neural network models and losses using Pytorch, we provide skeleton codes and what you have to do is just fill in the TODO parts of the codes. Before implementing codes, please read the PointNet paper together with our brief summary and the provided codes careful and check how codes flow. Also, we recommend you to read how to implement codes using Pytorch (Pytorch Tutorial link).

Table of Content

Abstract
Setup
Code Structure
Tasks
Submission Guidelines
Grading
Further Readings

Setup

We recommend creating a virtual environment using conda. By following below commands, you can create and activate the conda environment.

conda create -n pointnet python=3.9
conda activate pointnet

After that, install pytorch 1.13.0 and other essential packages by running:

conda install pytorch=1.13.0 torchvision pytorch-cuda=11.6 -c pytorch -c nvidia
conda install -c fvcore -c iopath -c conda-forge fvcore iopath
conda install pytorch3d -c pytorch3d

Lastly, install remained necessary packages using pip:

pip install tqdm h5py matplotlib

Code Structure

Below shows the overall structure of this repository. Bascially, in this tutorial, what you have to do is implementing models and losses by filling in the TODO parts of below 4 files.

TODOs

- model.py
- train_cls.py
- train_ae.py
- train_seg.py

pointnet
│ 
├── model.py              <- PointNet models implementation. <TODO>
│ 
├── dataloaders 
│   ├── modelnet.py         <- Dataloader of ModelNet40 dataset.
│   └── shapenet_partseg.py <- Dataloader of ShapeNet Part Annotation dataset. 
│
├── utils
│   ├── metrics.py          <- Easy-to-use code to compute metrics.
│   ├── misc.py             <- Point cloud normalization ft. and code to save rendered point clouds. 
│   └── model_checkpoint.py <- Automatically save model checkpoints during training.
│
├── train_cls.py          <- Run classification. <TODO>
├── train_ae.py           <- Run auto-encoding. <TODO>
├── train_seg.py          <- Run part segmentation. <TODO>
├── visualization.ipynb   <- Simple point cloud visualization example code.
│
├── data                  <- Project data.
│   ├── modelnet40_ply_hdf5_2048     <- ModelNet40   
│   └── shapenet_part_seg_hdf5_data  <- ShapeNet Part Annotation
│
└── checkpoints           <- Directory storing checkpoints. 
    ├── classification
    │    └── mm-dd_HH-MM-SS/epoch=16-val_acc=88.6.ckpt
    ├── auto_encoding
    └── segmentation

Tasks

Task 0. Global Feature Extraction

PointNet takes 3D point clouds(# points, 3) as inputs and extracts a 1024-sized global feature latent vector, which contains the geometric information of the input point clouds. This global feature vector will be used in the downstream tasks; point cloud classification, segmentation, and auto-encoding. In this part, you implement PointNetFeat model that only results out the global feature vector so that you can utilize this model for implementing the remaining 3 tasks.

💡 The figure above is the guideline for the implementation, but you don't need to implement the code completely the same as it. You can assume that each MLP layer in the figure consists of MLP, batch normalization, and activation.

TODOs

- model.py

Fill in the TODO in model.py > PointNetFeat class

※ When implementing PointNetFeat, you can utilize STDkd we give you in model.py code.

Task 1. Point Cloud Classification

In point cloud classification tasks, PointNet inputs point clouds (# points, 3) and generates a 1024-sized global feature latent vector, which is then reduced to the number of categories (k) through multi-layer perceptrons, forming logits for each category.

💡 The figure above is the guideline for the implementation, but you don't need to implement the code completely the same as it.

TODOs

- model.py
- train_cls.py

Fill in the TODO in model.py > PointNetCls
Fill in the TODO in train_cls.py > step and train_step

You can start training the model by the following command. Also, at the end of the training it will automatically test the model on ModelNet40 dataset.

python train_cls.py

Also, you can change batch_size, lr, and epochs by using the command below.

python train_cls.py --batch_size {batch_size you want} --lr {lr you want} --epochs {epochs you want}

While training, if your model achieves the best result, model checkpoint will be saved automatically as pointnet/classification/MM-DD_HH-MM-SS/Classification_ckpt_epoch{epoch}_metric:{val_Acc}.ckpt.

On ModelNet40 test set:

	Overall Acc
Paper	89.2 %
Ours (w/o feature trans.)	88.6 %
Ours (w/ feature trans.)	87.7 %

Task 2. Point Cloud Part Segmentation

For segmentation tasks, PointNet concatenates the second transformed feature with the global latent vector to form a point-wise feature tensor, which is then passed through an MLP to produce logits for m part labels.

💡 The figure above is the guideline for the implementation, but you don't need to implement the code completely the same as it.

TODOs

- model.py
- train_seg.py

Fill in the TODO in model.py > PointNetPartSeg
Fill in the TODO in train_seg.py > step and train_step

You can start training the model by the following command. Also, at the end of the training it will automatically test the model on ShapeNet part dataset.

python train_seg.py

Also, you can change batch_size, lr, and epochs by using the command below.

python train_seg.py --batch_size {batch_size you want} --lr {lr you want} --epochs {epochs you want}

While you are running train_seg.py, you are able to see progress bars:

ShapeNet part dataset will automatically be downloaded on data directory when train_seg.py is first executed.

We provide the code to measure instance mIoU in utils/metrics.py.

While training, if your model achieves the best result, model checkpoint will be saved automatically as pointnet/segmentation/MM-DD_HH-MM-SS/Segmentation_ckpt_epoch{epoch}_metric:{val_mIoU}.ckpt.

On ShapeNet Part test set:

	ins. mIoU
Paper	83.7 %
Ours	83.6 %

Task 3. Point Cloud Auto-Encoding

The PointNet Auto-encoder comprises an encoder that inputs point clouds and produces a 1024-sized global feature latent vector, and an MLP decoder that expands this latent vector incrementally until it reaches N*3. This tensor is reshaped into (N, 3), representing N points in 3D coordinates.

💡 The figure above is the guideline for the implementation, but you don't need to implement the code completely the same as it.

TODOs

- model.py
- train_ae.py

Fill in the TODO in model.py > PointNetAutoEncoder
Fill in the TODO in train_ae.py > step and train_step

💡 We recommend not using the T-Net (input transform and feature transform) in the AE task. That's why we provide the PointNetFeat class without T-Net inside the PointNetAutoEncoder class definition.

You can start training the model by the following command. Also, at the end of the training it will automatically test the model on ModelNet40 dataset.

python train_ae.py

Also, you can change batch_size, lr, and epochs by using the command below.

python train_ae.py --batch_size {batch_size you want} --lr {lr you want} --epochs {epochs you want}

While training, if your model achieves the best result, model checkpoint will be saved automatically as pointnet/auto_encoding/MM-DD_HH-MM-SS/AutoEncoding_ckpt_epoch{epoch}_metric:{val_CD}.ckpt.

On ModelNet40 test set:

	Chamfer Dist.
Ours	0.0043

What to Submit

Compile the following files as a ZIP file named {NAME}_{STUDENT_ID}.zip and submit the file via Gradescope.

4 codes that you implemented: model.py, train_ae.py, train_cls.py, train_seg.py;
Model checkpoint file that achieves the best performance for classification, segmentation, and auto-encoding each;
Screenshot at the end of the training for classification, segmentation, and auto-encoding each.

Screenshot Example:

Grading

You will receive a zero score if:

you do not submit,
your code is not executable in the Python environment we provided, or
you modify any code outside of the section marked with TODO.

Plagiarism in any form will also result in a zero score and will be reported to the university.

Your score will incur a 10% deduction for each missing item in the Submission Guidelines section.

Otherwise, you will receive up to 30 points from this assignment that count toward your final grade.

Evaluation Criterion	Classification (Acc)	Segmentation (mIoU)	Auto-Encoding (CD)
Success Condition (100%)	0.85	0.80	0.005
Success Condition (50%)	0.55	0.60	0.030

As shown in the table above, each evaluation metric is assigned up to 10 points. In particular,

Classification (Task 1)
- You will receive 10 points if the reported value is equal to or, greater than the success condition (100%);
- Otherwise, you will receive 5 points if the reported value is equal to or, greater than the success condition (50%).
Segmentation (Task 2)
- You will receive 10 points if the reported value is equal to or, greater than the success condition (100%);
- Otherwise, you will receive 5 points if the reported value is equal to or, greater than the success condition (50%).
Auto-Encoding (Task 3)
- You will receive 10 points if the reported value is equal to or, less than the success condition (100%);
- Otherwise, you will receive 5 points if the reported value is equal to or, less than the success condition (50%).

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Figure		Figure
pointnet		pointnet
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Figure

Figure

pointnet

pointnet

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

PointNet: Point Cloud Processing Network

Due: September 17, 2023 (Sunday) 23:59 KST

Where to Submit: Gradescope

Abstract

Setup

Code Structure

TODOs

Tasks

Task 0. Global Feature Extraction

TODOs

Task 1. Point Cloud Classification

TODOs

Task 2. Point Cloud Part Segmentation

TODOs

Task 3. Point Cloud Auto-Encoding

TODOs

What to Submit

Grading

Further Readings

About

Releases

Packages

Languages

License

KAIST-Visual-AI-Group/CS479-Assignment_1

Folders and files

Latest commit

History

Repository files navigation

PointNet: Point Cloud Processing Network

Due: September 17, 2023 (Sunday) 23:59 KST

Where to Submit: Gradescope

Abstract

Setup

Code Structure

TODOs

Tasks

Task 0. Global Feature Extraction

TODOs

Task 1. Point Cloud Classification

TODOs

Task 2. Point Cloud Part Segmentation

TODOs

Task 3. Point Cloud Auto-Encoding

TODOs

What to Submit

Grading

Further Readings

About

Resources

License

Stars

Watchers

Forks

Languages