Movie Genre Classification by Language Augmentation and Shot Sampling

Movie-CLIP is a video-based movie genre classification model. This repository contains the PyTorch implementation for our paper. If you find this code useful in your research, please consider citing:

@InProceedings{zhangMovie2024,
     author={Zhongping Zhang and Yiwen Gu and Bryan A. Plummer and Xin Miao and Jiayi Liu and Huayan Wang},
     title={Movie Genre Classification by Language Augmentation and Shot Sampling},
     booktitle={IEEE Winter Conference on Applications of Computer Vision (WACV)},
     year={2024}}

Set up environment

Libraries to train Movie-CLIP can be installed by:

pip install -r requirements.txt

Datasets:

We performed our experiments on two datasets, MovieNet and CondensedMovies. We provided the data we employed for model training and evaluation through the following links.

Datasets	Google Drive Link
MovieNet (Trailer30K)	Trailer30K Link
CondensedMovies	Condensed Movies Link

If you would like to obtain the original data, we also provide the code to collect videos using PyTube and youtube-transcript-api.

Take Trailer30K as an example:

Step 1 download trailer URLs and meta data

 wget https://download.openmmlab.com/datasets/movienet/trailer.urls.unique30K.v1.json # version 1.0
 wget https://download.openmmlab.com/datasets/movienet/meta.v1.zip # version 1.0

Step 2 download source videos from YouTube

python download_youtube_videos_captions.py

Train & Evaluate the Movie-CLIP model

Train and Evaluate mAP

Run the following script to train and evaluate Movie-CLIP on trailer30K:

sh scc_scripts/run_train_trailer30K.sh

Run the following script to train and evaluate Movie-CLIP on Condensed Movies:

sh scc_scripts/run_train_condensed_movies.sh

Note: To smoothly run the scripts, variables including DATA_DIR, VISUAL_FOLDER, VISUAL_FEATURE_VERSION, AUDIO_FEATURE_FILE, TEXT_TOKEN_FILE, TEXT_FEATURE_FILE need to be correctly defined.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
dataloader		dataloader
models		models
scc_scripts		scc_scripts
LICENSE		LICENSE
README.md		README.md
download_youtube_videos_captions.py		download_youtube_videos_captions.py
requirements.txt		requirements.txt
shot_genre_classification.py		shot_genre_classification.py
test.py		test.py
train.py		train.py
train_audio_PANNs.py		train_audio_PANNs.py
train_text.py		train_text.py
train_visual_audio.py		train_visual_audio.py
train_visual_audio_text.py		train_visual_audio_text.py

License

Zhongping-Zhang/Movie-CLIP

Folders and files

Latest commit

History

Repository files navigation

Movie Genre Classification by Language Augmentation and Shot Sampling

Set up environment

Datasets:

Train & Evaluate the Movie-CLIP model

Train and Evaluate mAP

About

Resources

License

Stars

Watchers

Forks

Languages