Alessandro Flaborea*, Luca Collorone*, Guido D'Amely*, Stefano D'Arrigo*, Bardh Prenkaj, Fabio Galasso
The official PyTorch implementation of the IEEE/CVF International Conference on Computer Vision (ICCV) '23 paper Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection.
conda env create -f environment.yaml
conda activate mocodad
You can download the extracted poses for the datasets HR-Avenue, HR-ShanghaiTech and HR-UBnormal from the GDRive.
Place the extracted folder in a ./data
folder and change the configs accordingly.
To adapt your custom dataset to work with MoCoDAD, you can follow the structure below or look at the UBnormal dataset description here.
We also provide the code to extract poses and track actors in videos in the _annotations
folder.
{your_custom_dataset}
|
|__________ training
| |
| |__________ trajectories
| |
| |_________{scene_id}_{clip_id}
| |
| |_________00001.csv
| |_________...
| |_________0000{n}.csv
|
|__________ testing
| |
| |__________ trajectories
| | |
| | |_________{scene_id}_{clip_id}
| | |
| | |_________00001.csv
| | |_________...
| | |_________0000{n}.csv
| |
| |__________ test_frame_mask
| |
| |_______________{scene_id}_{clip_id}.npy
| |_______________...
| |_______________{scene_id}_{clip_id}.npy
|
|__________ validating
|
|__________ trajectories
| |
| |_________{scene_id}_{clip_id}
| |
| |_________00001.csv
| |_________...
| |_________0000{n}.csv
|
|__________ test_frame_mask
|
|_______________{scene_id}_{clip_id}.npy
|_______________...
|_______________{scene_id}_{clip_id}.npy
To train MoCoDAD, you can select the different type of conditioning of the model. The default parameters achieve the best results reported in the paper
In each config file you can choose the conditioning strategy and change the diffusion process parameters:
-
conditioning_strategy
- 'inject': Inject condition information into the model. The indices to be used as conditioning can be set using the 'conditioning_indices' parameter. Enabled by default.
- 'concat': concat conditioning and noised data to be passed to the model. The indices to be used as conditioning can be set using the 'conditioning_indices' parameter.
- 'inbetween_imp': Uses the list of indices of the 'conditioning_indices' parameter to select the indices to be used as conditioning.
- 'random_imp': 'conditioning_indices' must be int and it is used as the number of random indices that will be selected
- 'no_condition': if enabled, no motion condition is passed to the model
-
Diffusion Process
- noise_steps: how many diffusion steps have to be performed
Update the args 'data_dir', 'test_path', 'dataset_path_to_robust' with the path where you stored the datasets. To better track your experiments, change 'dir_name' and the wandb parameters.
To train MoCoDAD:
python train_MoCoDAD.py --config config/[Avenue/UBnormal/STC]/{config_name}.yaml
The training config is saved the associated experiment directory (/args.exp_dir/args.dataset_choice/args.dir_name
).
To evaluate the model on the test set, you need to change the following parameters in the config:
- split: 'Test'
- validation: 'False'
- load_ckpt: 'name_of_ckpt'
Test MoCoDAD
python eval_MoCoDAD.py --config /args.exp_dir/args.dataset_choice/args.dir_name/config.yaml
additional flag you can use:
- use_hr: False -> just for test. Use the entire version of the dataset or the Human-Related one.
The checkpoints for the pretrained models on the three datasets can be found HERE. To evaluate them follow the following steps:
- Download the checkpoints
- Add them to the corresponding folder
/checkpoints/[Avenue/UBnormal/STC]/pretrained_model
- Copy the config file /config/[Avenue/UBnormal/STC]/mocodad_test.yaml in the correct checkpoint folder
- Update the 'load_ckpt' field with the downloaded ckpt
- run
python eval_MoCoDAD.py --config `/checkpoints/[Avenue/UBnormal/STC]/pretrained_model/mocodad_test.yaml]
We provide the code to visualize frames, poses and anomaly scores. Follow the instruction in visualize for further details.
@InProceedings{Flaborea_2023_ICCV,
author = {Flaborea, Alessandro and Collorone, Luca and di Melendugno, Guido Maria D'Amely and D'Arrigo, Stefano and Prenkaj, Bardh and Galasso, Fabio},
title = {Multimodal Motion Conditioned Diffusion Model for Skeleton-based Video Anomaly Detection},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2023},
pages = {10318-10329}
}