I3D models trained on Vidor

Overview

This repo is the trunk net 4 2nd task of VidVRD: Video Relation Prediction

The 1st stage project: Video Object Detection

The Grand Challenge MM2019

This repository contains trained models reported in the paper "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset" by Joao Carreira and Andrew Zisserman.

This code is based on Deepmind's Kinetics-I3D. Including PyTorch versions of their models.

Download

Charades_v1_rgb

Vidor

Fine-tuning and Feature Extraction

We provide code to extract I3D features and fine-tune I3D for vidor. Our fine-tuned models on Vidor are also available in the models director (in addition to Deepmind's trained models). The Charades pre-trained models on Pytorch were saved to (flow_charades.pt and rgb_charades.pt). The deepmind pre-trained models were converted to PyTorch and give identical results (flow_imagenet.pt and rgb_imagenet.pt). These models were pretrained on imagenet and kinetics (see Kinetics-I3D for details).

Fine-tuning I3D

train_i3d.py contains the code to fine-tune I3D based on the details in the paper and obtained from the authors. Specifically, this version follows the settings to fine-tune on the Charades dataset based on the author's implementation that won the Charades 2017 challenge. The charades fine-tuned RGB and Flow I3D models are available in the model directory (rgb_charades.pt and flow_charades.pt).

This relied on having the optical flow and RGB frames extracted and saved as images on dist. vidor_dataset.py script VidorPytorchTrain Class contains the code to load charades video segments for training.

E.g.

python train_i3d.py -anno_rpath /storage/dldi/PyProjects/vidor/annotation -video_rpath /storage/dldi/PyProjects/vidor/train_vids -num_workers 0

This relied on having the optical flow and RGB frames extracted and saved as images on dist. charades_dataset.py contains the code to load charades video segments for training.

Feature Extraction

extract_features.py contains the code to load a pre-trained I3D model and extract the features and save the features as numpy arrays.

E.g.

python extract_features.py -anno_rpath /storage/dldi/PyProjects/vidor/annotation -video_rpath /storage/dldi/PyProjects/vidor/train_vids

The vidor_dataset.py script VidorPytorchExtract Class loads an entire video to extract per-segment features. The charades_dataset_full.py script loads an entire video to extract per-segment features.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
data		data
dataset		dataset
models		models
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
VORDInstance.py		VORDInstance.py
actions.json		actions.json
charades_dataset.py		charades_dataset.py
charades_dataset_full.py		charades_dataset_full.py
charades_extract_features.py		charades_extract_features.py
charades_train_i3d.py		charades_train_i3d.py
extract_features.py		extract_features.py
frames.py		frames.py
pytorch_i3d.py		pytorch_i3d.py
requirement.txt		requirement.txt
train_i3d.py		train_i3d.py
videotransforms.py		videotransforms.py
vidor_dataset.py		vidor_dataset.py
vidvrd_dataset.py		vidvrd_dataset.py
vidvrd_extract.py		vidvrd_extract.py
vord_utils.py		vord_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

I3D models trained on Vidor

Overview

Download

Fine-tuning and Feature Extraction

Fine-tuning I3D

Feature Extraction

About

Releases

Packages

Languages

License

ddl-donglin/I3D_pytorch

Folders and files

Latest commit

History

Repository files navigation

I3D models trained on Vidor

Overview

Download

Fine-tuning and Feature Extraction

Fine-tuning I3D

Feature Extraction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages