Skip to content

lyssyl157/MOSSTrack

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎯 MOSSTrack: Modality-Specific Spatio-Temporal Context Learning for RGB-T Tracking

MOSSTrack: Modality-Specific Spatio-Temporal Context Learning for RGB-T Tracking

Yisong Liu, He Yao, Junlong Cheng, Yujie Lu, Junqi Bai, Min Zhu*
CVPR 2026 Findings

This repository contains the official implementation of MOSSTrack. Unlike traditional methods that use identical spatio-temporal modeling for both modalities, MOSSTrack leverages modality-specific visual cues to guide the generation of feature-level spatio-temporal information. By integrating this generated information into feature representation learning and cross-modal fusion, MOSSTrack provides more accurate target references, maintaining stable performance in highly dynamic scenarios.

[Models], [Raw Results]

🚀 News

  • 🎉 [2026.03] Our paper has been accepted by CVPR 2026 Findings!
  • 📦 [2026.04] Training code, testing scripts, and configurations are officially released.

✨ Key Features & Contributions

MOSSTrack Framework

  • Spatio-Temporal Information Generator (STIG): Employs learnable modality-specific tokens to select representative visual features for each modality, establishing robust cross-frame spatio-temporal associations.
  • Spatio-Temporal Context Fusion (STCF): A simple yet effective module that leverages spatio-temporal cues to refine target-related features and facilitates efficient cross-modal interaction.
  • State-of-the-art Performance: MOSSTrack achieves superior results on four challenging RGB-T tracking benchmarks, validating the effectiveness of modality-specific spatio-temporal modeling.

⚙️ Installation

Create and activate a Conda environment:

conda create -n MoSSTrack python=3.8
conda activate MoSSTrack

Install the required packages:

bash install.sh

📂 Data Preparation

Download the following datasets and place them under ./data/:

$<PATH_of_MOSSTrack>
-- data
    -- GTOT
        |-- BlackCar
        |-- Black5wan1
        ...
    -- RGBT210
        |-- afterrain
        |-- aftertree
        ...
    -- RGBT234
        |-- afterrain
        |-- aftertree
        ...
    -- LasHeR/train
        |-- 1boygo
        |-- 1handsth
        ...
    -- LasHeR/test
        |-- 1blackteacher
        |-- 1boycoming
        ...
    -- VTUAV/train
        |-- animal_002
        |-- bike_002
        ...
    -- VTUAV/test_ST
        |-- animal_001
        |-- bike_003
        ...
    -- VTUAV/test_LT
        |-- animal_003
        |-- animal_004
        ...

Training

Dowmload the pretrained model (DUTrack) and put it under ./pretrained_models/.

python tracking/train.py --script mosstrack --config mosstrack_256_full --save_dir ./output --mode multiple --nproc_per_node 2 --use_wandb 0

Testing

[LasHeR & RGBT234 & VTUAVST]
Modify the <DATASET_PATH> and <SAVE_PATH> in./RGBT_workspace/test_rgbt_mgpus.py, then run:

bash test_rgbt.sh

Acknowledgment

  • This repo is based on ODTrack and DUTrack which are excellent works.
  • We thank for the PyTracking library, which helps us to quickly implement our ideas.

About

Official implementation of MOSSTrack : Modality-Specific Spatio-Temporal Context Learning for RGB-T Tracking

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages