Skip to content

zht8506/QMVOS

Repository files navigation

Video Object Segmentation with Dynamic Query Modulation

📢 Introduction

This is the official implementation of our paper titled "Video Object Segmentation with Dynamic Query Modulation", abbreviated as QMVOS. This paper has been accepted by 2024 IEEE International Conference on Multimedia and Expo (ICME).

For more details, please refer to our paper. This repo is based on PyTorch.

The code of our SIM and QCIM please refer to model/QMVOS_trainer.py.

📂 Result and Weight

Our QMVOS can bring significant improvements to memory-based SVOS method and achieve competitive performance on standard SVOS bench- marks.

We release the weight of models used in our paper. You may need to log out of your Google account to download them.

Download the weight(s) from corresponding links below.

🚀 Training

More detail please see Xmem.

bash train.sh
python -m torch.distributed.launch --master_port 25764 --nproc_per_node=4 --use_env train_QMVOS.py --exp_id train_QMVOS --stage 3

🎡 Visualization

Examples of segmentation results obtained by our QMVOS and baseline (Xmem).

📚 Citation

Please cite our work if you find our work and codes helpful for your research.

@article{zhou2024video,
  title={Video Object Segmentation with Dynamic Query Modulation},
  author={Zhou, Hantao and Hu, Runze and Li, Xiu},
  journal={arXiv preprint arXiv:2403.11529},
  year={2024}
}

Acknowledgement

This project is built upon numerous previous projects. We'd like to thank the contributors of Xmem.

About

Code of ICME2024 Paper: Video Object Segmentation with Dynamic Query Modulation

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published