Skip to content

xiaogangpeng/HyperVD

Repository files navigation

This is a PyTorch implementation for Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space. The original paper is under review.

PWC

Abstract

In recent years, the field of weakly supervised audio-visual violence detection has gained substantial attention. The goal of this task is to identify multimodal violent snippets based on the video-level label. Despite the progress made, traditional Euclidean neural networks used in previous methods still face challenges in capturing discriminative representations. To overcome this limitation, we propose HyperVD, a novel framework that learns snippet embeddings in hyperbolic space to improve model discrimination. Specifically, our framework comprises a detour fusion module for multimodal fusion, which effectively alleviates modality inconsistency, and two branches of fully hyperbolic graph convolutional networks, which excavate feature similarities and temporal relationships among snippets in hyperbolic space. Extensive experiments on the XD-Violence benchmark demonstrate that our method outperforms the state-of-the-art methods by a sizable margin.

Training Stage

  • Download the extracted I3D features of XD-Violence dataset from here.
  • Change the file paths of make_list.py in the list folder to generate the training and test list.
  • The hyperparameters are saved in option.py, where we keep default settings as mentioned in our paper.
  • Run the following command for training:
python main.py

Test Stage

  • Change the checkpoint path of infer.py.
  • Run the following command for test:
python infer.py

Acknowledgements

The implementation mainly references the repositories of XDVioDet and fully-hyperbolic-nn . We greatly appreciate their excellent contribution.

If this work is helpful for your research, please consider citing the following BibTeX entry.

@misc{peng2023learning,
      title={Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space}, 
      author={Xiaogang Peng and Hao Wen and Yikai Luo and Xiao Zhou and Keyang Yu and Yigang Wang and Zizhao Wu},
      year={2023},
      eprint={2305.18797},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages