This is the official repository for the CVPR 2024 paper Advancing Saliency Ranking with Human Fixations: Dataset, Models and Benchmarks. [Paper Link]
We present the first saliency ranking dataset, SIFR, using genuine human fixations rather than mouse movements (ASSR and IRSR).
Download SIFR dataset (Google Drive).
To establish a baseline for this dataset, we propose QAGNet, a novel model that leverages salient instance query features from a query-based transformer detector (Mask2Former) within a tri-tiered nested graph.
Our proposed QAGNet is based on Mask2Former. Please follow the instructions to install the environment (Pytorch, Detectron2, MSDeformAttn etc). The main codes of QAGNet are written in ./mask2former/modeling/transformer_decoder/mask2former_transformer_decoder.py.
- Download ASSR, IRSR and SIFR dataset (Google Drive).
- Configure the dataset path in train_net.py/def main(args).
- Download the pretrained weights of Mask2Former
- Configure the corresponding yaml scripts (./configs/coco/instance-segmentation/) - Base-COCO-InstanceSegmentation/R50_bs16_50ep/swin_base_IN21k_384_bs16_50ep/swin_large_IN21k_384_bs16_100ep
- Configure and run train_net.py.
- Download our pretrained models for Res50, Swin-B and Swin-L.
- Configure the pretrained weights path in corresponding yaml files.
- Configure the evaluation settings in Base-COCO-InstanceSegmentation.yaml.
- Configure and run plain-test.py.
We opensource all SRD models' predicted saliency ranking maps (Google Drive). For more quantitative comparison and qualitative comparisons, please refer to our our paper and supplementary material.
1. ImportError: /lib64/libstdc++.so.6: version `CXXABI_1.3.9' not found
Check if you can load cuda-11.1/cudnn-v8.1.1.33/gcc-10.2.0
2. LooseVersion = distutils.version.LooseVersionAttributeError: module 'distutils' has no attribute 'version'
Please install setuptools 59.5.0
pip install setuptools==59.5.0
If you find the SIFR dataset or QAGNet beneficial for your work, please consider citing our research:
@inproceedings{deng2024advancing,
title={Advancing Saliency Ranking with Human Fixations: Dataset Models and Benchmarks},
author={Deng, Bowen and Song, Siyang and French, Andrew P and Schluppeck, Denis and Pound, Michael P},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={28348--28357},
year={2024}
}