SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion

ICML 2026 Spotlight

Zhaoyang Li, Zhichao You, Tianrui Li

Official PyTorch implementation of SplAttN, accepted to ICML 2026 as a Spotlight paper.

📌 Abstract

Although multi-modal learning has advanced point cloud completion, the theoretical mechanisms remain unclear. Recent works attribute success to the connection between modalities, yet we identify that standard hard projection severs this connection: projecting a sparse point cloud onto the image plane yields an extremely sparse support, which hinders visual prior propagation, a failure mode we term Cross-Modal Entropy Collapse. To address this practical limitation, we propose SplAttN, which replaces hard projection with Differentiable Gaussian Splatting to produce a dense, continuous image-plane representation. By reformulating projection as continuous density estimation, SplAttN avoids collapsed sparse support, facilitates gradient flow, and improves cross-modal connection learnability. Extensive experiments show that SplAttN achieves state-of-the-art performance on PCN and ShapeNet-55/34. Crucially, we utilize the real-world KITTI benchmark as a stress test for multi-modal reliance. Counter-factual evaluation reveals that while baselines degenerate into unimodal template retrievers insensitive to visual removal, SplAttN maintains a robust dependency on visual cues, validating that our method establishes an effective cross-modal connection.

🔥 News

2026.05.05: The arXiv paper link and citation have been updated.
2026.04.30: SplAttN has been accepted to ICML 2026 and selected as a Spotlight paper.
Code and pretrained checkpoints are released. Additional project materials will be updated in this repository.

✨ Highlights

We identify Cross-Modal Entropy Collapse, a sparse-support failure mode caused by hard 2D projection.
We introduce Differentiable Gaussian Splatting to form dense, continuous image-plane representations from sparse 3D observations.
SplAttN improves cross-modal connection learnability and achieves state-of-the-art performance on PCN and ShapeNet-55/34.
Counterfactual evaluation on KITTI shows stronger reliance on visual cues than prior multi-modal baselines.

🧠 Method Overview

SplAttN is designed for image-guided point cloud completion. The core idea is to maintain a learnable and differentiable connection between sparse 3D geometry and 2D visual priors.

Differentiable Gaussian Splatting converts sparse projected points into a dense image-plane representation.
Attention-based fusion learns cross-modal dependencies between geometric features and visual features.
Counterfactual evaluation measures whether a model genuinely relies on visual cues instead of treating images as incidental inputs.

🛠️ Installation

Requirements

Python >= 3.8
PyTorch >= 1.8.0
CUDA >= 11.1

Key Python dependencies include:

torchvision
timm
open3d
h5py
opencv-python
easydict
transforms3d
tensorboardX

CUDA Extensions

Install PointNet++ operations, KNN_CUDA, and Chamfer Distance:

cd pointnet2_ops_lib
python setup.py install --user

cd ../KNN_CUDA
python setup.py install --user

cd ../metrics/CD/chamfer3D
python setup.py install --user

📂 Datasets

Download the PCN and ShapeNet-55/34 datasets, then update the dataset paths in the corresponding configuration files.

Example configuration:

# PCN dataset (config_pcn.py)
__C.DATASETS.SHAPENET.PARTIAL_POINTS_PATH = 'data/PCN/%s/partial/%s/%s/%02d.pcd'
__C.DATASETS.SHAPENET.COMPLETE_POINTS_PATH = 'data/PCN/%s/complete/%s/%s.pcd'

# ShapeNet-55 dataset (config_55.py)
__C.DATASETS.SHAPENET55.COMPLETE_POINTS_PATH = 'data/ShapeNet55-34/shapenet_pc/%s'

# ShapeNet-34 / Unseen-21 split
__C.DATASETS.SHAPENET55.CATEGORY_FILE_PATH = 'datasets/ShapeNet34'
# or
__C.DATASETS.SHAPENET55.CATEGORY_FILE_PATH = 'datasets/ShapeNet-Unseen21'

📦 Pretrained Models

Pretrained checkpoints for PCN, ShapeNet-55, and ShapeNet-34 are available on Google Drive.

The pretrained TinyViT backbone can be downloaded from the TinyViT repository.

🚀 Evaluation

Set the checkpoint path in the corresponding configuration file before evaluation:

__C.CONST.WEIGHTS = "path/to/checkpoint.pth"

Run evaluation:

# Single-GPU evaluation
python main_pcn.py --test
python main_55.py --test
python main_34.py --test

# Distributed evaluation
CUDA_VISIBLE_DEVICES=0 python -m torch.distributed.launch \
  --master_port=13222 \
  --nproc_per_node=1 \
  main_pcn.py --test

🏋️ Training

# Single-GPU training
python main_pcn.py
python main_55.py
python main_34.py

# Multi-GPU distributed training
CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch \
  --master_port=13222 \
  --nproc_per_node=4 \
  main_pcn.py

CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch \
  --master_port=13222 \
  --nproc_per_node=4 \
  main_55.py

CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch \
  --master_port=13222 \
  --nproc_per_node=4 \
  main_34.py

📖 Citation

If you find this work useful, please cite:

@misc{li2026splattnbridging2d3d,
      title={SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion}, 
      author={Zhaoyang Li and Zhichao You and Tianrui Li},
      year={2026},
      eprint={2605.01466},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.01466}, 
}

🙏 Acknowledgements

This repository builds upon several excellent open-source projects:

We use Mitsuba 3 to visualize point cloud completion results.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
KNN_CUDA		KNN_CUDA
core		core
datasets		datasets
metrics		metrics
models		models
pointnet2_ops_lib		pointnet2_ops_lib
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config_34.py		config_34.py
config_55.py		config_55.py
config_pcn.py		config_pcn.py
logo.png		logo.png
main_34.py		main_34.py
main_55.py		main_55.py
main_pcn.py		main_pcn.py
overview.png		overview.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion

📌 Abstract

🔥 News

✨ Highlights

🧠 Method Overview

🛠️ Installation

Requirements

CUDA Extensions

📂 Datasets

📦 Pretrained Models

🚀 Evaluation

🏋️ Training

📖 Citation

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

SplAttN: Bridging 2D and 3D with Gaussian Soft Splatting and Attention for Point Cloud Completion

📌 Abstract

🔥 News

✨ Highlights

🧠 Method Overview

🛠️ Installation

Requirements

CUDA Extensions

📂 Datasets

📦 Pretrained Models

🚀 Evaluation

🏋️ Training

📖 Citation

🙏 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages