This repo contains the code for our paper MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation
MaXTron is a simple yet effective unified meta-architecture for video segmentation, which enriches existing clip-level segmenters by introducing a within-clip tracking module and a cross-clip tracking module, thus achieving better temporally consistent segmentation results.
For detailed usage of MaXTron, see
If you use MaXTron in your research, please use the following BibTeX entry.
@misc{he2023maxtron,
title={MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation},
author={Ju He and Qihang Yu and Inkyu Shin and Xueqing Deng and Xiaohui Shen and Alan Yuille and Liang-Chieh Chen},
year={2023},
eprint={2311.18537},
archivePrefix={arXiv},
primaryClass={cs.CV}
}