🌟 CMoE

This repository provides the official implementation of our paper entilted “Taming Cascaded Mixture-of-Experts for Modality-missing Multi-modal Salient Object Detection” accepted by AAAI 2026.

We propose a Cascaded Mixture-of-Experts (CMoE) framework that effectively handles the modality-missing challenge in multi-modal salient object detection.

📰 Paper & Resources:
The camera-ready paper, pre-trained models, and benchmark results will be released soon.

📖 Citation

If you find this work useful in your research, please cite:

@inproceedings{wang2026taming,
  title={Taming Cascaded Mixture-of-Experts for Modality-missing Multi-modal Salient Object Detection},
  author={Wang, Kunpeng and Sun, Feifan and Chen, Keke},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={40},
  number={12},
  pages={9939--9947},
  year={2026}
}

⚙️ Usage

Ⅰ. Environment Setup

Install PyTorch and torchvision (recommended via conda):

conda install pytorch==1.12.0 torchvision==0.13.0 -c pytorch

Install additional dependencies:
```
pip install -r requirements.txt
```
Download datasets:
- RGB-T datasets: VT821, VT1000, VT5000
- RGB-D datasets: STERE, SIP, ReDWeb-S, NJUD, NLPR, DUTLF-Depth
Download pre-trained backbone:
- Swin-B model: swin_base_patch4_window12_384_22k.pth
Configure dataset paths:
- Modify ./CMoE-main/options.py to set the paths for all datasets and models.
Prepare directories for saving logs, checkpoints, and outputs as needed.

Ⅱ. Training Procedure

Pre-train Uni-modal Experts

python -m torch.distributed.launch --nproc_per_node=2 --master_port=2024 ./CMoE-main/train_parallel_rgb.py
python -m torch.distributed.launch --nproc_per_node=2 --master_port=2026 ./CMoE-main/train_parallel_t.py

Fine-tune Multi-modal Model

Before starting, set the paths for the pre-trained uni-modal weights in ./CMoE-main/options.py. Then, run:
```
python -m torch.distributed.launch --nproc_per_node=2 --master_port=2024 ./CMoE-main/train_parallel_multi.py
```

Ⅲ. Testing

To evaluate the model under both modality-complete and modality-missing conditions, follow these steps:

Prepare Black Modality Inputs:

For each test dataset, run the following script to generate zero-value (black) images as the missing modality input:
```
python ./CMoE-main/black.py
```
Set Paths:

In test_produce_maps.py, configure the paths to the trained model checkpoint, test dataset folder, and the saving directory.
Run Testing:

The model will automatically predict saliency results under modality-complete and modality-missing settings:
```
python test_produce_maps.py
```

Ⅳ. Evaluation

Place the ground-truth masks and predicted saliency maps into the ./Evaluation/GT/ and ./Evaluation/sal_map/ folders, respectively.
Open ./Evaluation/main.m using MATLAB.
Specify the evaluation dataset and run the script to compute performance metrics.

🙏 Acknowledgement

The implement of this project is based on the following link.

SOD Literature Tracking

📬 Contact

If you have any questions, please contact us (kp.wang@foxmail.com).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🌟 CMoE

📖 Citation

⚙️ Usage

Ⅰ. Environment Setup

Ⅱ. Training Procedure

Ⅲ. Testing

Ⅳ. Evaluation

🙏 Acknowledgement

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Models		Models
README.md		README.md
black.py		black.py
options.py		options.py
requirements.txt		requirements.txt
test_produce_maps.py		test_produce_maps.py
train_parallel_multi.py		train_parallel_multi.py
train_parallel_rgb.py		train_parallel_rgb.py
train_parallel_t.py		train_parallel_t.py

Folders and files

Latest commit

History

Repository files navigation

🌟 CMoE

📖 Citation

⚙️ Usage

Ⅰ. Environment Setup

Ⅱ. Training Procedure

Ⅲ. Testing

Ⅳ. Evaluation

🙏 Acknowledgement

📬 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages