GitHub - WISION-Lab/eventful-transformer: Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"

Overview

This is the PyTorch code for our ICCV 2023 paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers." Please see our paper webpage and the arXiv paper.

Disclaimer

This is research-grade code, so it's possible you will encounter some hiccups. Contact me if you encounter problems or if the documentation is unclear, and I will do my best to help.

TLDR

Most of the interesting code (implementation of our core contributions) is in eventful_transformer/blocks.py, eventful_transformer/modules.py, and eventful_transformer/policies.py.

Dependencies

Dependencies are managed using Conda. The environment is defined in environment.yml.

To create the environment, run:

conda env create -f environment.yml

Then activate the environment with:

conda activate eventful-transformer

Running Scripts

Scripts should be run from the repo's base directory.

Many scripts expect a .yml configuration file as a command-line argument. These configuration files are in configs. The structure of the configs folder is set to mirror the structure of the scripts folder. For example, to run the base_672 evaluation for the ViTDet VID model:

./scripts/evaluate/vitdet_vid.py ./configs/evaluate/vitdet_vid/base_672.yml

Weights

Weights for the ViViT action recognition model (on Kinetics-400 and EPIC-Kitchens) are available here. We use the "ViViT Fact. Enc." weights.

Weights for the ViTDet object detection model (on COCO) are available here. We use the "Cascade Mask R-CNN, ViTDet, ViT-B" weights. Weights on ImageNet VID are available here (frcnn_vitdet_final.pth).

The weight names need to be remapped to work with this codebase. To remap the ViViT weights, run:

./scripts/convert/vivit.py <old_weights> <new_weights> ./configs/convert/vivit_b.txt

with <old_weights> and <new_weigtht> replaced by the path of the downloaded weights and the path where the converted weights should be saved, respectively.

To remap the ViTDet weights, run:

./scripts/convert/vitdet.py <old_weights> <new_weights> ./configs/convert/vitdet_b.txt

Some ViViT evaluation scripts assume a fine-tuned temporal sub-model. Fine-tuned weights can be downloaded here.

Alternatively, you can run the fine-tuning yourself. To do this, run a spatial configuration (to cache the forward pass of the spatial sub-model), followed by a train configuration. For example:

./scripts/spatial/vivit_epic_kitchens.py ./configs/spatial/vivit_epic_kitchens/50.yml

then

./scripts/train/vivit_epic_kitchens.py ./configs/train/vivit_epic_kitchens/final_50.yml

This will produce weights/vivit_b_epic_kitchens_final_50.pth.

Data

The datasets folder defines PyTorch Dataset classes for Kinetics-400, VID, and EPIC-Kitchens.

The Kinetics-400 class will automatically download and prepare the dataset on first use.

VID requires a manual download. Download vid_data.tar from here and place it at ./data/vid/data.tar. The VID class will take care of unpacking and preparing the data on first use.

EPIC-Kitchens also requires a manual download. Download the videos from here and place them in ./data/epic_kitchens/videos. Download the labels EPIC_100_train.csv and EPIC_100_validation.csv from here and place them in ./data/epic_kitchens. The EPICKitchens class will prepare the data on first use.

Other Setup

Scripts assume that the current working directory is on the Python path. In the Bash shell, run

export PYTHONPATH="$PYTHONPATH:."

Or in the Fish shell:

set -ax PYTHONPATH .

Code Style

Format all code using Black. Use a line limit of 88 characters (the default). To format a file, use the command:

black <FILE>

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
datasets		datasets
eventful_transformer		eventful_transformer
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

configs

configs

datasets

datasets

eventful_transformer

eventful_transformer

models

models

scripts

scripts

utils

utils

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

environment.yml

environment.yml

Repository files navigation

Overview

Disclaimer

TLDR

Dependencies

Running Scripts

Weights

Data

Other Setup

Code Style

About

Releases

Packages

Languages

License

WISION-Lab/eventful-transformer

Folders and files

Latest commit

History

Repository files navigation

Overview

Disclaimer

TLDR

Dependencies

Running Scripts

Weights

Data

Other Setup

Code Style

About

Resources

License

Stars

Watchers

Forks

Languages