CLIP-TSA:
CLIP-Assisted Temporal Self-Attention for
Weakly-Supervised Video Anomaly Detection

IEEE International Conference on Image Processing (ICIP), 2023

Oral Presentation

The repository discusses the implementation of the paper
"CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection"
using the PyTorch framework.

Paper

CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection (Oral, ICIP 2023)

Kevin Hyekang Joo, Khoa Vo, Kashu Yamazaki, Ngan Le

Requirements

pytorch
matplotlib
tqdm
scipy
scikit-learn

CLIP Features

ShanghaiTech Campus Dataset
UCF-Crime Dataset
XD-Violence Dataset

FAQ

Q1) I get the following error: "RuntimeError: Expected a 'cuda' device type for generator but found 'cpu'"

A1) Please go to venv/lib/python3.8/site-packages/torch/utils/data/sampler.py, and find __iter__ function within RandomSampler class. Then change the line generator = torch.Generator() to generator = torch.Generator(device="cuda").

Q2) I keep getting CUDA OUT OF MEMORY error

A2) Each dataset requires varying amounts of VRAM, and a significant amount of VRAM is expected to be used with the TSA feature enabled. Thus, please be advised if you want to run tests on big public datasets such as ShanghaiTech Campus, XD-Violence, and UCF-Crime Datasets. If you would like to test out only the power of CLIP within the model, please disable the TSA by adding --disable_HA to the command, which requires less amount of VRAM and should be operable on most GPUs.

How to Run

python main.py

Please change the hyperparameters & parameters accordingly by first looking at the main.py file. Otherwise, it will be run under default settings.

Citations

@inproceedings{joo2023cliptsa,
  title={CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection},
  author={Joo, Hyekang Kevin and Vo, Khoa and Yamazaki, Kashu and Le, Ngan},
  doi={10.1109/ICIP49359.2023.10222289},
  url={https://ieeexplore.ieee.org/document/10222289}
  publisher={IEEE International Conference on Image Processing (ICIP)},
  pages={3230--3234},
  year={2023},
  organization={IEEE}
}

Contacts

Kevin Hyekang Joo - hkjoo@cs.umd.edu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

CLIP-TSA:
CLIP-Assisted Temporal Self-Attention for
Weakly-Supervised Video Anomaly Detection

IEEE International Conference on Image Processing (ICIP), 2023

Paper

Requirements

CLIP Features

FAQ

How to Run

Citations

Contacts

The codes have been adapted in part from Yu Tian's RTFM.

Files

README.md

Latest commit

History

README.md

File metadata and controls

CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection

IEEE International Conference on Image Processing (ICIP), 2023

Paper

Requirements

CLIP Features

FAQ

How to Run

Citations

Contacts

The codes have been adapted in part from Yu Tian's RTFM.

CLIP-TSA:
CLIP-Assisted Temporal Self-Attention for
Weakly-Supervised Video Anomaly Detection