Code for paper: View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification (CVPR2026)
Please refer to INSTALL.md.
Download the ViT-base Pre-trained model and modify the path. Line 13 in configs:
PRETRAIN_PATH: xxx
Training ViSA on the CARGO dataset with one GPU:
CUDA_VISIBLE_DEVICES=0 python3 tools/train_net.py --config-file ./configs/CARGO/visa.yml MODEL.DEVICE "cuda:0" SOLVER.IMS_PER_BATCH 256Training ViSA on the CARGO dataset with 4 GPU:
CUDA_VISIBLE_DEVICES=0,1,2,3 python3 tools/train_net.py --config-file ./configs/CARGO/visa.yml --num-gpus 4 SOLVER.IMS_PER_BATCH 512Testing ViSA on the CARGO dataset:
CUDA_VISIBLE_DEVICES=0 python3 tools/train_net.py --config-file ./configs/CARGO/visa.yml --eval-only MODEL.WEIGHTS xxx @INPROCEEDINGS{zhang2026visa,
author={Quan Zhang, Zeqiang Cai, Peiming Zhao, Jingze Wu, Cailun Wu, Hongbo Chen, Jianhuang Lai},
booktitle={2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
title={View-Aware Semantic Alignment for Aerial-Ground Person Re-Identification},
year={2026},
volume={},
number={},
pages={1-10},
}