`RandLA-Net` example #5117

CharlesGaydon · 2022-08-02T16:00:09Z

The paper: RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Context

There lacks a good pytorch implementation of RandLa-Net that leverages pytorch geometric standards and modules.
In torch-points3d, the current modules are outdated leading to some confusion among users.

The implementation with the most stars on github is aRI0U/RandLA-Net-pytorch, which has nasty dependencies (torch_points or torch_points_kernels), makes slow back-and-forth between cpu and gpu when calling knns, and only accepts fixed size point clouds.

Proposal

I would like to implement RandLA-Net as part of pyg's examples. For now I would tackle the ModelNet classification task, and would follow the structure of other examples (pointnet2_classification in particular).

The RandLa-Net paper focuses on segmentation, but for classification I would simply add a MLP+Global Max Pooling after the first DilatedResidualBlocks.

RandLa-Net architecture is conceptually close to PointNet++, augmented with different tricks to speed things up (random sampling instead of fps), use more context (with a sort of dilated KNN), and encode local information better (by explicitly calculating positions, distances, and euclidian distance between points in a neighborhood, and by using self-attention on these features).

If I have some success, I will take on the segmentation task as well (which is what interests me anyway for my own project)

Where I am at

I have a working implementation at examples/randlanet_classification.py. I still have to review it to make sure that I am following the paper as closely as possible, but I think I am on the right track.

I would love some guidance on how to move forward. In particular:

Am I using MessagePassing modules correctly?
What should I aim for in term of accuracy on ModelNet?
Should I stick strictly to the paper? Or adapt the architecture to ModelNet.

Indeed the hyperparameters were not chosen by the author for small objects but rather for large scale Lidar data, which could make convergence way longer that needed.

With 4 DilatedResidualBlocks (like in the paper), we reach ~57% accuracy at epoch 200.

With 3 DilatedResidualBlocks, we reach up to 75% accuracy at the 20th epoch

With only 2 DilatedResidualBlocks, we reach 90% accuracy at the 81st epoch, getting closer to the leaderboard for the ModelNet10 challenge.

for more information, see https://pre-commit.ci

codecov · 2022-08-02T16:06:43Z

Codecov Report

Merging #5117 (545b2cb) into master (07ba384) will decrease coverage by 1.86%.
The diff coverage is 100.00%.

❗ Current head 545b2cb differs from pull request most recent head cb66f4b. Consider uploading reports for the commit cb66f4b to get more accurate results

@@            Coverage Diff             @@
##           master    #5117      +/-   ##
==========================================
- Coverage   86.20%   84.34%   -1.87%     
==========================================
  Files         362      363       +1     
  Lines       20477    20487      +10     
==========================================
- Hits        17653    17279     -374     
- Misses       2824     3208     +384

Impacted Files	Coverage Δ
torch_geometric/nn/pool/decimation.py	`100.00% <100.00%> (ø)`
torch_geometric/nn/models/dimenet_utils.py	`0.00% <0.00%> (-75.52%)`	⬇️
torch_geometric/nn/models/dimenet.py	`14.90% <0.00%> (-52.76%)`	⬇️
torch_geometric/profile/profile.py	`36.27% <0.00%> (-26.48%)`	⬇️
torch_geometric/nn/conv/utils/typing.py	`81.25% <0.00%> (-17.50%)`	⬇️
torch_geometric/nn/pool/asap.py	`92.10% <0.00%> (-7.90%)`	⬇️
torch_geometric/nn/inits.py	`67.85% <0.00%> (-7.15%)`	⬇️
torch_geometric/nn/dense/linear.py	`87.40% <0.00%> (-5.93%)`	⬇️
torch_geometric/transforms/add_self_loops.py	`94.44% <0.00%> (-5.56%)`	⬇️
torch_geometric/nn/models/attentive_fp.py	`95.83% <0.00%> (-4.17%)`	⬇️
... and 13 more

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

for more information, see https://pre-commit.ci

CharlesGaydon · 2022-08-04T17:16:14Z

I implemented RandLa-Net for segmentation as well, and made some small refactor.
The model seems to learn quite well, and reaches 70% accuracy after 3 epochs. It takes ~1s to run on CPU.

Unfortunately, I am not able to fully test it out on ShapeNet's Airplane task due to pytorch conflicts that prevent me to use CUDA :/

I need to install the master branch of pyg to run segmentation training. When I follow instructions to build pytorch geometric from source, I see that I am working on a machine with CUDA 11.4, and that I therefore have to build pytorch against CUDA 11.4 before installing dependencies (torch_scatter, etc), and then pytorch_geometric from master branch directly. However, it seems that pytorch + CUDA 11.4 is not really supported - I could not find how to build it from source, and using cudatoolkit=11.4 in pytorch's conda install does not work.

Maybe I am missing something here... Any help would be appreciated :)

rusty1s · 2022-08-05T13:16:40Z

I suggest to simply install the wheels with CUDA 11.3, this should work even for CUDA 11.4.

for more information, see https://pre-commit.ci

CharlesGaydon · 2022-08-06T10:46:30Z

Thank you, this worked like a charm. I think this is ready for review. :)

Right now both scripts follow the paper's architecture (in terms of hyperparameters, depth and number of channels in MLPs). Those were chosen by authors for large scale aerial lidar, not ModelNet and ShapeNet. For ModelNet, removing a few layer enables to reach good accuracy. I was not able to replicate this for ShapeNet, which quickly plateaus around 70% train accuracy (vs. 90% train accuracy / 79% test IoU for PointNet++).

I think it is cleaner to keep everything as it is to follow the paper. We could also change the benchmark for this model (with e.g. S3DIS), but I am not really sure that this is worth the extra work for an example.

CharlesGaydon · 2022-08-25T16:33:33Z

I identified some differences between my implementation and the original paper (in particular in terms of batch norms, activations, and number of channels). Will come back with fixes and modifications!

rusty1s · 2022-08-26T05:57:26Z

Thanks! Sorry for the delay in review. Please keep me posted.

…bility.

for more information, see https://pre-commit.ci

CharlesGaydon · 2022-10-14T13:42:01Z

@rusty1s Hi!
Back to the office! I moved the function to get decimation indices to its own decimation module under torch.nn.pool, along with a few simple tests.
On a side note, I always wonder what ptr stands for (maybe pointer?). Feel free to edit its docstring if needed (currenly: ptr (LongTensor): indices of samples in the batch.).

CharlesGaydon · 2022-10-21T07:39:28Z

@rusty1s @saedrna The gentlest bump on this :)

rusty1s · 2022-10-21T07:40:07Z

Yes, I will merge this over the weekend. Sorry for the delay.

* Update with pyg-team/pytorch_geometric#5117 * Bump minor version to indicate no-model-compatibility * Update signature for pyg randlanet * Fix old randlanet signature * Get rid of legacy implementation of RandLA-Net * Disable example run until release of a model that is compatible * Fix misleading batch_size indication for multi-GPUs setting. * Pyg RandLaNet XP with min/max num_nodes and gradient accumulation. * Rename XP. * NoRS XP inherits from base XP. * 5 epochs of cooldown before reducing lr * 20 epochs of patience before reducing lr * Flake8 corrections. * Correct model version name in CICD workflow * Correct config name in CICD workflow

CharlesGaydon · 2022-11-03T10:31:15Z

@rusty1s Resolved the conflict in Changelog :)

CharlesGaydon · 2022-11-14T15:39:36Z

@rusty1s Small bump :)

rusty1s

Looks all good to me, just the remaining two comments.

examples/randlanet_classification.py

* Update with pyg-team/pytorch_geometric#5117 * Bump minor version to indicate no-model-compatibility * Update signature for pyg randlanet * Fix old randlanet signature * Get rid of legacy implementation of RandLA-Net * Disable example run until release of a model that is compatible * Fix misleading batch_size indication for multi-GPUs setting. * Pyg RandLaNet XP with min/max num_nodes and gradient accumulation. * Rename XP. * NoRS XP inherits from base XP. * 5 epochs of cooldown before reducing lr * 20 epochs of patience before reducing lr * Flake8 corrections. * Correct model version name in CICD workflow * Correct config name in CICD workflow

* WIP V3.*.* with torch-geometric RandLA-Net implementation (#39) Development of PyG-RandLA-Net Co-authored-by: Michel Daab <michel.daab@ign.fr> * Architecture update to latest state + Max Nodes Budgets (#43) * Update with pyg-team/pytorch_geometric#5117 * Bump minor version to indicate no-model-compatibility * Update signature for pyg randlanet * Fix old randlanet signature * Get rid of legacy implementation of RandLA-Net * Disable example run until release of a model that is compatible * Fix misleading batch_size indication for multi-GPUs setting. * Pyg RandLaNet XP with min/max num_nodes and gradient accumulation. * Rename XP. * NoRS XP inherits from base XP. * 5 epochs of cooldown before reducing lr * 20 epochs of patience before reducing lr * Flake8 corrections. * Correct model version name in CICD workflow * Correct config name in CICD workflow * New "create_hdf5" task for data-preparation-as-a-task (#42) * add get_las_paths_by_split_dict to utils * taking review into account * increase version number * add create_hdf5 to the doc * forgot a word * resolve version conflict * Isort all python files (#44) * Bump version to V3.1.2 * put the config file and the checkpoint into docker * code neatness patch * change the version number * Comet must be first import in run.py (#45) * fix a checkpoint bug * patch for checkpoint path * display of checkpoint path * test checkpoint path * test chekcpoint path * test checkpoint path * add nano to docker (for debugging purpose) * change to have only one hydra loading * exclue a couple of preparatory functions from coverage * exclude two method from coverage * cheating to pass coverage step (we are in a hurry!) * make path to checkpoint absolute * path works * code cleaning * 1 forgotten line * 1 line forgotten * Revert "Merge branch 'main' of https://github.com/IGNF/lidar-deep-segmentation" This reverts commit fbf5dcc, reversing changes made to 5129be6. * task.task_name is mandatory now, patched a test * updated config file * monkey patching for interpolator * change doc and version number * deals with "interpolator" * little change to redo the docker image * correct the path to ckpt * correction for interpolator and increase the version number * add proba_to_save to interpolator * CICD patch for interpolator * change to get the docker image * patch for confidence * little patch for confidence * test to psuh hidinf files on nexus * manual merge to repair history * forgot one file * another change to try and get "confidence" channel * patched and verified, update of its version number * patch for pul request reviews * correct path to the CICD directory to match the content of those directory * another correction for pul request reviews * change default.env to placeholder.env * correctly provide k_interpolation through interpolator * change name of default files for trained model assets to display version * another change to trained model assests default name * setting correct "interpolator" paths into the doc * puts interpolation_k back into interpolator * change back a regression * add testing the default config in the CICD Co-authored-by: Charles Gaydon <11660435+CharlesGaydon@users.noreply.github.com> Co-authored-by: CharlesGaydon <charles.gaydon@gmail.com>

CharlesGaydon added 4 commits August 1, 2022 20:59

Replaced FPS with decimation

8663aad

Squeleton for randlanet, need to correct dims of MLP + their options

7e4acc2

RandLaNet for classification is running

fbfca51

Shuffling input data with a FixedPoint transform.

0d4eefd

CharlesGaydon changed the title ~~[WIP] Implementation of RandLa-Net in pytorch geometric's examples~~ [WIP] RandLa-Net in pytorch geometric's examples Aug 2, 2022

[pre-commit.ci] auto fixes from pre-commit.com hooks

8388ab2

for more information, see https://pre-commit.ci

CharlesGaydon and others added 2 commits August 2, 2022 18:07

Add StepLR schedular to randlanet_classification example

649587b

[pre-commit.ci] auto fixes from pre-commit.com hooks

24a7613

for more information, see https://pre-commit.ci

rusty1s added feature 1 - Priority P1 example labels Aug 3, 2022

rusty1s assigned CharlesGaydon Aug 3, 2022

CharlesGaydon and others added 5 commits August 4, 2022 13:39

Add RandLa-Net segmentation example

6949bf4

Correct typo in name of DilatedResidualBlock

3a07a66

FInalize RandLaNet segmentation architecture

42ae661

Add randlanet-segmentation example

df6f5aa

[pre-commit.ci] auto fixes from pre-commit.com hooks

af5d365

for more information, see https://pre-commit.ci

CharlesGaydon and others added 4 commits August 6, 2022 12:27

Edit changelog for randlanet example

38844ff

Shorter mlp to match RandLA-Net paper

c7e040f

RandLaNet docstring

6503fbd

[pre-commit.ci] auto fixes from pre-commit.com hooks

717facc

for more information, see https://pre-commit.ci

CharlesGaydon marked this pull request as ready for review August 6, 2022 10:46

Remove unused import

278a57d

CharlesGaydon marked this pull request as draft August 25, 2022 16:31

CharlesGaydon and others added 7 commits October 14, 2022 14:43

Docstring for get_decimation_idx

e93a923

Move decimation_indices func to nn.pool

335cfeb

Split final MLP and final FC layer into two diff attributes for readi…

b2f048d

…bility.

Add tests for nn.pool.decimation_indices.

23c3451

Flake8

f7aad6c

Merge branch 'master' into randlanet

8c1bde6

[pre-commit.ci] auto fixes from pre-commit.com hooks

7ab9179

for more information, see https://pre-commit.ci

CharlesGaydon added a commit to IGNF/myria3d that referenced this pull request Oct 14, 2022

Update with pyg-team/pytorch_geometric#5117

2d7192f

CharlesGaydon mentioned this pull request Oct 14, 2022

20221014 PyG Rand-LA-Net update to latest state + Max Nodes Budgets IGNF/myria3d#43

Merged

CharlesGaydon requested review from rusty1s and removed request for saedrna October 21, 2022 07:38

CharlesGaydon added 2 commits October 24, 2022 15:37

Merge branch 'master' into randlanet

1edeb4b

Merge branch 'master' into randlanet

f28b4c9

rusty1s approved these changes Nov 15, 2022

View reviewed changes

examples/randlanet_classification.py Show resolved Hide resolved

examples/randlanet_classification.py Show resolved Hide resolved

rusty1s changed the title ~~RandLA-Net in pytorch geometric's examples~~ RandLA-Net example Dec 2, 2022

rusty1s added 2 commits December 2, 2022 07:31

Merge branch 'master' into randlanet

545b2cb

update

cb66f4b

rusty1s enabled auto-merge (squash) December 2, 2022 06:54

rusty1s merged commit 11c8cbd into pyg-team:master Dec 2, 2022

CharlesGaydon deleted the randlanet branch December 23, 2022 12:43

CharlesGaydon mentioned this pull request Feb 10, 2023

Number of input points and changing the size of the input point cloud. IGNF/myria3d#58

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`RandLA-Net` example #5117

`RandLA-Net` example #5117

CharlesGaydon commented Aug 2, 2022 •

edited

codecov bot commented Aug 2, 2022 •

edited

CharlesGaydon commented Aug 4, 2022 •

edited

rusty1s commented Aug 5, 2022 •

edited

CharlesGaydon commented Aug 6, 2022

CharlesGaydon commented Aug 25, 2022

rusty1s commented Aug 26, 2022

CharlesGaydon commented Oct 14, 2022

CharlesGaydon commented Oct 21, 2022

rusty1s commented Oct 21, 2022

CharlesGaydon commented Nov 3, 2022

CharlesGaydon commented Nov 14, 2022

rusty1s left a comment

RandLA-Net example #5117

RandLA-Net example #5117

Conversation

CharlesGaydon commented Aug 2, 2022 • edited

Context

Proposal

Where I am at

codecov bot commented Aug 2, 2022 • edited

Codecov Report

CharlesGaydon commented Aug 4, 2022 • edited

rusty1s commented Aug 5, 2022 • edited

CharlesGaydon commented Aug 6, 2022

CharlesGaydon commented Aug 25, 2022

rusty1s commented Aug 26, 2022

CharlesGaydon commented Oct 14, 2022

CharlesGaydon commented Oct 21, 2022

rusty1s commented Oct 21, 2022

CharlesGaydon commented Nov 3, 2022

CharlesGaydon commented Nov 14, 2022

rusty1s left a comment

Choose a reason for hiding this comment

`RandLA-Net` example #5117

`RandLA-Net` example #5117

CharlesGaydon commented Aug 2, 2022 •

edited

codecov bot commented Aug 2, 2022 •

edited

CharlesGaydon commented Aug 4, 2022 •

edited

rusty1s commented Aug 5, 2022 •

edited