Skip to content

zixuan-chen/M3VOS_Experiment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

66 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

M3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation (CVPR2025)

📝Paper | 🌍Project Page | 🤗Tools | 🛢️Data

alt text

1. Installation

# Clone this repo
git clone https://github.com/zixuan-chen/M3VOS_Experiment.git
cd M3VOS_Experiment

# Create a Conda environment
conda create -n mvos python=3.10.0
conda activate mvos

# Install pytorch
# Look up https://pytorch.org/get-started/previous-versions/ with your cuda version for a correct command
pip install torch==2.1.2 torchvision==0.16.2  --index-url https://download.pytorch.org/whl/cu121

pip install git+git://github.com/cheind/py-thin-plate-spline
# Install other prequisites
pip install -r requirements.txt
# for other methods you should look for their requirements in their respective folders.

2. Annotation Tool

We put our annotation tool in an independent GitHub repository.

3. Evaluation

Running Cutie_ReVOS and other methods on video object segmentation data.

4. Training

Training Cutie_ReVOS are similar to training Cutie

5. Useful scripts

in the ./scripts

  • merge_signle_video.py: merge the mask and image into a video , args:
    • images_folder: a folder contains images: 001.jpg , 002.jpg, ...
    • masks_folder: a folder contains masks: 001.png , 002.png, ...
    • output_video*.mp4
  • merge_png2video.py: process the dataset whose file structure just like VOST, get the merge videos folder
    • images_folder: a folder contains the images of seqs: just like ROVES_summary/ROVES_week_0/JPEGIMages
    • masks_folder: a folder contains he masks of seqs: ROVES_summary/ROVES_week_0/Annotations
    • output_video:a target folder contains merge videos : exp/merge_videos
  • align_direction.py: If you find the width of your video is more than its height , it will rotate it 90 degree counterclockwisely.

Citation

@InProceedings{chen2024m3vos_2025_CVPR,
    author    = {Zixuan Chen and Jiaxin Li and Liming Tan and Yejie Guo and Junxuan Liang and Cewu Lu and Yong-Lu Li},
    title     = {M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2025}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors