Self-Calibrated Cross Attention Network for Few-Shot Segmentation

This repository contains the code for our ICCV 2023 paper "Self-Calibrated Cross Attention Network for Few-Shot Segmentation".

Abstract: The key to the success of few-shot segmentation (FSS) lies in how to effectively utilize support samples. Most solutions compress support foreground (FG) features into prototypes, but lose some spatial details. Instead, others use cross attention to fuse query features with uncompressed support FG. Query FG could be fused with support FG, however, query background (BG) cannot find matched BG features in support FG, yet inevitably integrates dissimilar features. Besides, as both query FG and BG are combined with support FG, they get entangled, thereby leading to ineffective segmentation. To cope with these issues, we design a self-calibrated cross attention (SCCA) block. For efficient patch-based attention, query and support features are firstly split into patches. Then, we design a patch alignment module to align each query patch with its most similar support patch for better cross attention. Specifically, SCCA takes a query patch as Q, and groups the patches from the same query image and the aligned patches from the support image as K&V. In this way, the query BG features are fused with matched BG features (from query patches), and thus the aforementioned issues will be mitigated. Moreover, when calculating SCCA, we design a scaled-cosine mechanism to better utilize the support features for similarity calculation. Extensive experiments conducted on PASCAL-5ⁱ and COCO-20ⁱ demonstrate the superiority of our model, e.g., the mIoU score under 5-shot setting on COCO-20ⁱ is 5.6%+ better than previous state-of-the-arts.

Dependencies

Python 3.8
PyTorch 1.7.0
cuda 11.0
torchvision 0.8.0

> conda env create -f env_{ubuntu,windows}.yaml

Datasets

PASCAL-5ⁱ: VOC2012 + SBD
COCO-20ⁱ: COCO2014

The directory structure is:

../
├── SCCAN/
└── data/
    ├── VOCdevkit2012/
    │   └── VOC2012/
    │       ├── JPEGImages/
    │       ├── ...
    │       └── SegmentationClassAug/
    └── MSCOCO2014/           
        ├── annotations/
        │   ├── train2014/ 
        │   └── val2014/
        ├── train2014/
        └── val2014/

Models

Download the pre-trained backbones from here and put them into the initmodel directory.
Download exp.zip and compress it to obtain all pretrained models for PASCAL-5ⁱ and COCO-20ⁱ.

Usage

Change configuration via the .yaml files in config, then run the following commands for training and testing.

Meta-training

1/5-shot for PASCAL-5ⁱ

CUDA_VISIBLE_DEVICES=0 python train_sccan.py --config=config/pascal/pascal_split{0,1,2,3}_resnet{50,101}{_5s}.yaml

1/5-shot for COCO-20ⁱ

CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 --master_port=1234 train_sccan.py --config=config/coco/coco_split{0,1,2,3}_resnet{50,101}{_5s}.yaml

Meta-testing

1-shot

CUDA_VISIBLE_DEVICES=0 python test_sccan.py --config=config/{pascal,coco}/{pascal,coco}_split{0,1,2,3}_resnet{50,101}.yaml

5-shot

CUDA_VISIBLE_DEVICES=0 python test_sccan.py --config=config/{pascal,coco}/{pascal,coco}_split{0,1,2,3}_resnet{50,101}_5s.yaml

Performance

Performance comparison with the state-of-the-arts.

PASCAL-5ⁱ

COCO-20ⁱ

Visualization

References

This repo is mainly built based on BAM. Thanks for their great work!

BibTeX

If you find our work and this repository useful. Please consider giving a star ⭐ and citation 📚.

@InProceedings{Xu_2023_ICCV,
    author    = {Xu, Qianxiong and Zhao, Wenting and Lin, Guosheng and Long, Cheng},
    title     = {Self-Calibrated Cross Attention Network for Few-Shot Segmentation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2023},
    pages     = {655-665}
}

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
config		config
figure		figure
lists		lists
model		model
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env_ubuntu.yaml		env_ubuntu.yaml
env_windows.yaml		env_windows.yaml
test_sccan.py		test_sccan.py
train_sccan.py		train_sccan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Calibrated Cross Attention Network for Few-Shot Segmentation

Dependencies

Datasets

Models

Usage

Performance

PASCAL-5ⁱ

COCO-20ⁱ

Visualization

References

BibTeX

About

Releases

Packages

Languages

License

Sam1224/SCCAN

Folders and files

Latest commit

History

Repository files navigation

Self-Calibrated Cross Attention Network for Few-Shot Segmentation

Dependencies

Datasets

Models

Usage

Performance

PASCAL-5i

COCO-20i

Visualization

References

BibTeX

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

PASCAL-5ⁱ

COCO-20ⁱ

Packages