SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks

Yi Pan (panyi_jsjy@nudt.edu.cn), Jun-Jie Huang* (jjhuang@nudt.edu.cn), Zihan Chen, Wentao Zhao, and Ziyue Wang (*corresponding author)

Pytorch implementation for "SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks" (ICME'2024).

Robust and imperceptible adversarial video attack is challenging due to the spatial and temporal characteristics of videos. The existing video adversarial attack methods mainly take a gradient-based approach and generate adversarial videos with noticeable perturbations. In this paper, we propose a novel Sparse Adversarial Video Attack via Spatio-Temporal Invertible Neural Networks (SVASTIN) to generate adversarial videos through spatio-temporal feature space information exchanging. It consists of a Guided Target Video Learning (GTVL) module to balance the perturbation budget and optimization speed and a Spatio-Temporal Invertible Neural Network (STIN) module to perform spatio-temporal feature space information exchanging between a source video and the target feature tensor learned by GTVL module. Extensive experiments on UCF-101 and Kinetics-400 demonstrate that our proposed SVASTIN can generate adversarial examples with higher imperceptibility than the state-of-the-art methods with the higher fooling rate.

Requisites

PyTorch >= 2.1.1
Python3 >= 3.10.13
pywavelets >= 1.5.0
mmaction2 >= 1.2.0
NVIDIA GPU + CUDA CuDNN

Preparation

Dataset

Download dataset (UCF-101 and Kinetics-400) and prepare data by referring to mmaction2.

Models

Models on kinetics-400 dataset are are all available in mmaction2 library. We consider three models (MVIT, SLOWFAST, and TSN). Except that, we fine-tune these models on UCF-101 dataset. You can find them in Google Drive.

Run

You can run attack.py directly.

Description of the files in this repository

attack.py: Execute this file to attack
args.py: Video and model parameters setting
config.py: Hyperparameters setting
model/: Architecture of Spatio-Temporal Invertible Neural Networks
checkpoints/: Pre-trained model parameters

Citation

If you find this code and data useful, please consider citing the original work by authors:

@INPROCEEDINGS{10688258,
  author={Pan, Yi and Huang, Jun-Jie and Chen, Zihan and Zhao, Wentao and Wang, Ziyue},
  booktitle={2024 IEEE International Conference on Multimedia and Expo (ICME)}, 
  title={SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks}, 
  year={2024},
  volume={},
  number={},
  pages={1-6},
  keywords={Tensors;Codes;Perturbation methods;Neural networks;Streaming media;Optimization;Sparse Video Adversarial Attack;Invertible Neural Networks;Spatio-Temporal},
  doi={10.1109/ICME57554.2024.10688258}
}

Contact

If you have any questions, please contact panyi_jsjy@nudt.edu.cn.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
imgs		imgs
model		model
args.py		args.py
attack.py		attack.py
config.py		config.py
kinetic400.txt		kinetic400.txt
quantification.py		quantification.py
readme.md		readme.md
ucf101.txt		ucf101.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks

Requisites

Preparation

Dataset

Models

Run

Description of the files in this repository

Citation

Contact

About

Releases

Packages

Contributors 2

Languages

Brittany-Chen/SVASTIN

Folders and files

Latest commit

History

Repository files navigation

SVASTIN: Sparse Video Adversarial Attack via Spatio-Temporal Invertible Neural Networks

Requisites

Preparation

Dataset

Models

Run

Description of the files in this repository

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages