GST: 2D Gaussians Spatial Transport for Point-supervised Density-Regression (AAAI 2026)

Official Implement of AAAI 2026 paper '2D Gaussians Spatial Transport for Point-supervised Density Regression' (GST).

This paper introduces Gaussian Spatial Transport (GST), a novel framework that leverages Gaussian splatting to facilitate transport from the probability measure in the image coordinate space to the annotation map. We propose a Gaussian splatting-based method to estimate pixel-annotation correspondence, which is then used to compute a transport plan derived from Bayesian probability. To integrate the resulting transport plan into standard network optimization in typical computer vision tasks, we derive a loss function that measures discrepancy after transport. Extensive experiments on representative computer vision tasks, including crowd counting and landmark detection, validate the effectiveness of our approach. Compared to conventional optimal transport schemes, GST eliminates iterative transport plan computation during training, significantly improving efficiency.

Overview

We propose Gaussian Spatial Transport (GST), a novel framework that efficiently transports probability measures from image coordinates to annotation space. At its core, GST establishes an interpretable probabilistic correspondence between pixels and annotations by leveraging 2D Gaussian Splatting. This correspondence is then distilled into a fixed transport kernel, a matrix that encodes the static mapping from each pixel to all target annotations, which can be pre-computed before training. During optimization, the model’s predicted density map is pushed forward to the space of ground truth annotations via a single matrix multiplication with this pre-computed kernel. Whereas OT-based methods require iteratively solving a costly optimization problem to determine the transport plan and compute the loss, our loss is simply the discrepancy between the transported density and the ground truth. Extensive experiments demonstrate that GST achieves a strong balance of high accuracy and computational efficiency.

_{Comparison of Direct Regression, OT, and the proposed GST. Direct Regression (a) rely on handcrafted Gaussian kernels to generate pseudo-ground-truth density maps for direct pixel-to-pixel regression, creating a critical dependency on the quality of these synthetic approximations.
In contrast, Optimal Transport (b) is dynamic and more sophisticated; it iteratively re-calculates the best matching plan and its associated cost within every single training loop, making it accurate but computationally slow.
Our proposed GST (c) offers a "best-of-both-worlds" solution: it pre-computes a high-quality, fixed transport pattern based on Gaussian Splatting once before training begins. This allows the loss to be calculated in a single, highly efficient step during training, achieving the accuracy of a transport-based method without the costly iterative overhead.}

_{The GST Pipeline comprises two main components: transport kernel generation and model training. First, the transport kernel K is generated before training by reconstructing the RGB image via 2D Gaussian splatting and establishing pixel-toannotation correspondences (Eq. 7) to then form K (Eq. 5).
Second, during training, K transports the estimated density map to annotations, allowing for the computation of the transported mass discrepancy loss (Eq. 6).}

_{Visualizations. Our GST yielded a more accurate count estimate and a sharp density distribution as OT.}

Requirements

python 3.9 (We recommend to use Anaconda, since many python libs like numpy and sklearn are needed in our code.)
PyTorch 2.0.1-cuda11.8 (we run the code under version 2.0.1 with CUDA-11.8)
gsplat: we modified original gsplat for Gaussian Splatting on 2D images. Please refer to here to install the package.

Dataset preparation

Dataset download

QNRF can be downloaded here
JHU-Crowd++ can be downloaded here
NWPU can be downloaded here

Data preprocess

Due to large sizes of images in the datasets, we preprocess these datasets.

# for qnrf dataset
python preprocess_dataset_ucf.py --input-dataset-path <original data directory> --output-dataset-path <processed data directory>

The folder of the datasets is as follows (taking qnrf for example):

UCF-Train-Val-Test
├── train
│   ├── IMG_0001.jpg
│   ├── IMG_0001.npy
│   ├── IMG_0002.jpg
│   ├── IMG_0002.npy
│   └── ...
├── val
│   ├── IMG_0007.jpg
│   ├── IMG_0007.npy
│   ├── IMG_0012.jpg
│   ├── IMG_0012.npy
│   └── ...
├── test
... ...

GST Pre-computation

Perform 2d Gaussian Splatting for all images in the datasets, to obtain and store Gaussian primitives.

# for qnrf dataset
python 2dgs_counting_1gs_gsplat.py --config 'GS2D/config_qnrf_gsplat.yml'

The hyperparameters for other datasets are identical, just modify the dataset path in ``config_qnrf_gsplat.yml''.

Process distorted Gaussians

python GS2D/process_gaussians.py

After GST Pre-computation, the folder of the datasets is as follows:

UCF-Train-Val-Test
├── train
│   ├──gs_params
│   │  ├──IMG_0001_gs_params.h5
│   │  ├──IMG_0002_gs_params.h5
│   │  └──...
│   ├── IMG_0001.jpg
│   ├── IMG_0001.npy
│   ├── IMG_0002.jpg
│   ├── IMG_0002.npy
│   └── ...
├── val
│   ├── IMG_0007.jpg
│   ├── IMG_0007.npy
│   ├── IMG_0012.jpg
│   ├── IMG_0012.npy
│   └── ...
├── test
... ...

We have released our processed Gaussian primitives here, which can be used directly for task-tailored network training.

Task-tailored Network Training

python train_gs.py --model-name <backbon: vgg or vgg_trans> --dataset <dataset name: qnrf, jhu or nwpu> --data-dir <path to dataset> --device <gpu device id>

Evaluation

download our pretrained checkpoints here and prepare the dataset.

python test.py --save-path <path of the model to be evaluated> --data-dir <directory for the dataset> --device <gpu device id>

License

Please check the MIT license that is listed in this repository.

Acknowledgments

We thank the following repos providing helpful components/functions in our work.

Citation

If you use any content of this repo for your work, please cite the following bib entry:

@inproceedings{shang2026GST,
  title={2D Gaussians Spatial Transport for Point-supervised Density Regression},
  author={Shang, Miao and Hong, Xiaopeng},
  booktitle={Proceedings of the 40th Annual AAAI Conference on Artificial Intelligence},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
GS2D		GS2D
datasets		datasets
figs		figs
losses		losses
models		models
utils		utils
2dgs_counting_1gs_gsplat.py		2dgs_counting_1gs_gsplat.py
LICENSE		LICENSE
README.md		README.md
preprocess_dataset_jhu.py		preprocess_dataset_jhu.py
preprocess_dataset_ucf.py		preprocess_dataset_ucf.py
test.py		test.py
train_gs.py		train_gs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GST: 2D Gaussians Spatial Transport for Point-supervised Density-Regression (AAAI 2026)

Overview

Requirements

Dataset preparation

GST Pre-computation

Task-tailored Network Training

Evaluation

License

Acknowledgments

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GST: 2D Gaussians Spatial Transport for Point-supervised Density-Regression (AAAI 2026)

Overview

Requirements

Dataset preparation

GST Pre-computation

Task-tailored Network Training

Evaluation

License

Acknowledgments

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages