Skip to content

saeed-anwar/SAT3D

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 

Repository files navigation

SAT3D

This repository is an official implementation for SAT3D by Muhammad Ibrahim, Naveed Akhtar, Saeed Anwar, and Ajmal Mian, "SAT3D: Slot Attention Transformer for 3D Point Cloud Semantic Segmentation", IEEE Transactions on Intelligent Transportation Systems.

Coming soon

  • Segmentation Code
  • Segmentation Model
  • Swan Dataset
  • Results

Introduction

SAT3D, We introduce the first-ever Slot Attention Transformer based technique to effectively model object-centric features in point cloud data. Our method uses cylindrical splits of space for voxelization and computes channel-wise positional embeddings before repetitively encoding the point cloud with slot attentions. Our second major contribution is a Large-Scale Outdoor Point Cloud dataset (SWAN), collected in a dense urban environment, driving 150km distance. It provides 16 billion points in more than 200K frames. The dataset also provides annotations for 10K frames for 24 classes. We also contribute a data augmentation scheme to handle rare object classes in real-world point clouds. Besides benchmarking popular existing methods on SWAN for the first time, we thoroughly evaluate our technique on the existing large-scale datasets, Semantic KITTI and nuScenes.

The Proposed SWAN dataset

Annotated points per class (millions), instances per class (thousands), and average points per instance (thousands) in the proposed SWAN dataset

Car Truck Ped Bicycle Motor Cycle Bus Bridge Tree Bushes Building Road Rubbish Bin Bus Stop Light Pole Traffic Signal Road Workcone Letter-Box SidePath Road Exit Advertisement Board RoadSign Board Wall Road Divider
Points per class (M) 18.4 0.29 0.6 0.4 0.03 2.91 1.12 124.7 2.41 162.1 122.7 0.9 1.1 3.7 2.7 0.03 0.01 0.8 0.01 0.2 2.8 9.8 9.41
Instances per class (K) 46.4 0.4 8.3 2.1 0.2 1.8 0.4 139.5 1.3 22.8 8.4 6.6 2.4 34.8 28 1.7 0.2 0.3 0.01 0.7 45.4 3.3 5.8
Avg points per instance (K) 0.4 1.0 0.070 0.3 0.2 1.7 0.7 0.9 6.9 10.4 {14.6 0.5 0.1 0.1 0.1 0.02 0.06 2.6 0.04 0.2 0.062 3.0 1.6

Dataset Comparisons

Comparison of the SWAN (proposed) with Semantic KITTI and nuScenes dataset

Properties Ours Semantic KITTI nuScenes
Classes 24 19 16
Max points per frame 131K 131K 65K
Data Collection vehicle car, trolley car car
Point-wise labels Yes Yes Yes
Instance labels Yes No No
Labeled frames 10K 23K 35K
Location Perth,WA Karlsruhe Boston, SG
Sensor Ouster-64 Velodyne-64E LiDAR-32
Vertical resolution 64 64 32
Sensor Vertical FoV 45 26.9 40

Citations

If this work is helpful, please consider citing the following

@article{Ibrahim2023SAT3D,
  title={SAT3D: Slot Attention Transformer for 3D Point Cloud Semantic Segmentation},
  author={Ibrahim, Muhammad and Akhtar, Naveed and Anwar, Saeed and Mian, Ajmal},
  journal={IEEE Transactions on Intelligent Transportation Systems (T-ITS)},
  year={2023}
}

@article{Ibrahim2023slice,
  title={Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps},
  author={Ibrahim, Muhammad and Akhtar, Naveed and Anwar, Saeed and Wise, Michael and Mian, Ajmal},
  journal={IEEE International Conference on Robotics and Automation (ICRA)},
  year={2023}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published