GitHub - xiaobai1217/Unseen-Modality-Interaction: This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"

Learning Unseen Modality Interaction (NeurIPS 2023)

Yunhua Zhang, Hazel Doughty, Cees G.M. Snoek

This is the demo code for the video classification task using EPIC-Kitchens, with RGB and audio modalities.

Demo Code

Environment

Python 3.8.5
torch 1.12.1+cu113
torchaudio 0.12.1+cu113
torchvision 0.13.1+cu113
mmcv-full 1.7.0

Dataset

We download the RGB and optical flow frames from the official website of EPIC-Kitchens, and extract the audio files ourselves from the videos by extract_audio.py.

Run Demo

We provide the splits for training, validation and testing in the epic-annotations folder.
To run the code: python train.py --lr 1e-1 --batch_size 96 --save_name 1e-1
We finetuned the model by reduced learning rates, as specified in bash.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
checkpoints		checkpoints
epic-annotations		epic-annotations
logs		logs
mmaction		mmaction
omnivision		omnivision
omnivore_models		omnivore_models
pretrained_models		pretrained_models
.DS_Store		.DS_Store
README.md		README.md
ast_configs.py		ast_configs.py
ast_model.py		ast_model.py
bash.sh		bash.sh
cutmixup.py		cutmixup.py
dataloader_test.py		dataloader_test.py
dataloader_train.py		dataloader_train.py
dataloader_validation.py		dataloader_validation.py
extract_audio.py		extract_audio.py
feature_reorganization.py		feature_reorganization.py
rand_auto_aug.py		rand_auto_aug.py
randomerasing.py		randomerasing.py
test.py		test.py
train.py		train.py
transforms.py		transforms.py
vit.py		vit.py

xiaobai1217/Unseen-Modality-Interaction

Folders and files

Latest commit

History

Repository files navigation

Learning Unseen Modality Interaction (NeurIPS 2023)

Demo Code

Environment

Dataset

Run Demo

About

Resources

Stars

Watchers

Forks

Languages