SCALE: Spatio-Temporal Crop Aggregation for Video Representation Learning

Paper

This repo contains code for extracting teacher representations, training the proposed method, and probing the resulting representation from our SCALE paper (ICCV 2023).

Spatio-Temporal Crop Aggregation for Video Representation Learning
Sepehr Sameni, Simon Jenni, Paolo Favaro
University of Bern, Adobe Research, University of Bern
ICCV 2023

Getting Started

In order to load the videos fast we use video_reader which requires manual build of TorchVision. We used cuda=11.6, and python=3.8, please follow this guide to install the required packages.

Usage

Below are some example commands to run each model.

Extracting Pretrained Features

python extract_features.py --model train --dataset ucf # for all the pretrained teachers

Training SCALE

python train.py --exp_name BYOL-UCF --epochs 1000 --initialization_model byl --dataset ucf

Probing SCALE

python eval.py --exp_name BYOL-UCF --dataset ucf --load -1 --freeze

Reference

If you find our code useful for your research, please cite our paper.

@article{Sameni2022SpatioTemporalCA,
  title={Spatio-Temporal Crop Aggregation for Video Representation Learning},
  author={Sepehr Sameni and S. Jenni and Paolo Favaro},
  journal={ArXiv},
  year={2022},
  volume={abs/2211.17042},
  url={https://api.semanticscholar.org/CorpusID:254096149}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
SlowFast		SlowFast
assets		assets
src		src
.gitignore		.gitignore
installation.md		installation.md
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SlowFast

SlowFast

assets

assets

src

src

.gitignore

.gitignore

installation.md

installation.md

readme.md

readme.md

Repository files navigation

SCALE: Spatio-Temporal Crop Aggregation for Video Representation Learning

Paper

Getting Started

Usage

Extracting Pretrained Features

Training SCALE

Probing SCALE

Reference

About

Releases

Packages

Languages

Separius/SCALE

Folders and files

Latest commit

History

Repository files navigation

SCALE: Spatio-Temporal Crop Aggregation for Video Representation Learning

Getting Started

Usage

Extracting Pretrained Features

Training SCALE

Probing SCALE

Reference

About

Resources

Stars

Watchers

Forks

Languages