Skip to content

Jun-Pu/CSMA-Net

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 

Repository files navigation

Authors: Yi Zhang, Wassim Hamidouche, Olivier Deforges


Introduction


Figure 1: The architecture of our CSMA-Net. The short names in the figure are detailed as follows: CSMA = the proposed channel-spatial mutual attention module. E2C/C2E = the projection interaction module which transforms the equirectangular (ER) image/cube maps to cube maps/ER image, respectively. ASPP = atrous spatial pyramid pooling module. Enc.ER = the hybrid-ViT-based encoder for ER image. Enc.CM = the Res2Net-based encoder for cube maps. Dec. = the decoder from RCRNet.

In this work, we conduct 360° panoramic salient object detection by taking advantage of both the global and local visual cues of 360° images, with a novel channel-spatial mutual attention network (CSMA-Net). The key component of the CSMA-Net is the proposed CSMA module, which cascades channel-/spatial weighting-based mutual attentions. The objective of our CSMA module is to refine and fuse the bottleneck features from two separate encoders with different planar representa- tions of 360° panorama as inputs, i.e., equirectangular image and cube map. Our CSMA-Net outperforms 10 state-of-the-art segmentation methods based on the proposed 360° SOD benchmark where multiple fine-tuning and testing strategies are applied to the widely-used 360° datasets. Extensive experimental results illustrate the effectiveness and robustness of the proposed CSMA-Net.


Performance


Figure 2: Performance comparison between CSMA-Net and the SOTAs.


Figure 3: Visual results of CSMA-Net and SOTAs. Refer to "Implementation" for whole visualization.


Implementation

The source codes are available at codes.

The pretrained models of our CSMA-Net can be downloaded at CSMA-Net-models.

The results of our CSMA-Net on 360-SOD and 360-SSOD can be downloaded at CSMA-Net-results.


Contact

E-mail address: yi23zhang.2022@gmail.com

About

Salient object segmentation in 360° images

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages