TFFormer: Cross Time-Frequency Transformer for Temporal Action Localization

[TCSVT 2023] TFFomer: Cross Time-Frequency Transformer for Temporal Action Localization

TFFormer: Cross Time-Frequency Transformer for Temporal Action Localization

Introduction

This code repo implements TFFormer in TCSVT 2023, the first attempt to explore the cross-fusion of multi-scale time and frequency-domain features for temporal action localization. TFFormer achieves an average mAP of 23.2% on Ego4D and 25.6% on EPIC-Kitchens 100, which outperform previous state-of-the-arts by a large margin. It also obtains competitive results on ActivityNet v1.3 and THUMOS14, with an average mAP of 36.2% and 67.8%.

Data Preparation/Installation/More Details

Please refer to ActionFormer for more details.

Training and Evaluation

Train(Take epic_slowfast_verb as an example)

python ./train.py --config ./configs/epic_slowfast_verb.yaml --output model

Evaluation (Take epic_slowfast_verb as an example)

python ./eval.py ./configs/epic_slowfast_verb.yaml ./ckpt/epic_slowfast_verb_model

References

If you are using our code, please consider citing the following paper.

@article{yang2023cross,
  title={Cross Time-Frequency Transformer for Temporal Action Localization},
  author={Yang, Jin and Wei, Ping and Zheng, Nanning},
  journal={IEEE Transactions on Circuits and Systems for Video Technology},
  year={2023},
  publisher={IEEE}
}

@inproceedings{zhang2022actionformer,
  title={ActionFormer: Localizing Moments of Actions with Transformers},
  author={Zhang, Chen-Lin and Wu, Jianxin and Li, Yin},
  booktitle={European Conference on Computer Vision},
  series={LNCS},
  volume={13664},
  pages={492-510},
  year={2022}
}

The implementation of this code is built upon ActionFormer and our TransGMC, and we would like to express gratitude for the open-source contribution of ActionFormer.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
configs		configs
libs		libs
tools		tools
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
teaser.png		teaser.png
train.py		train.py
visualization results.gif		visualization results.gif
visualization results.mp4		visualization results.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TFFormer: Cross Time-Frequency Transformer for Temporal Action Localization

Introduction

Data Preparation/Installation/More Details

Training and Evaluation

References

The implementation of this code is built upon ActionFormer and our TransGMC, and we would like to express gratitude for the open-source contribution of ActionFormer.

About

Releases

Packages

Languages

License

EdenGabriel/TFFormer

Folders and files

Latest commit

History

Repository files navigation

TFFormer: Cross Time-Frequency Transformer for Temporal Action Localization

Introduction

Data Preparation/Installation/More Details

Training and Evaluation

References

The implementation of this code is built upon ActionFormer and our TransGMC, and we would like to express gratitude for the open-source contribution of ActionFormer.

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages