AViD Dataset: Anonymized Videos from Diverse Countries

AViD is a large-scale video dataset published at NeurIPS 2020 (AViD NeurIPS details for video, poster and presentation information). It has 467k videos and 887 action classes. Importantly, AViD has several key attributes:

Static

The collected videos have a Creative-Commons License, allowing us to create and distribute a static dataset collected from various web sources (e.g., Flickr, Instagram, etc.). Unlike other YouTube-based datasets (e.g., Kinetics), the dataset is static and easily downloadable enabling reproducible research. We further release this dataset under a flexible MIT license, unlike more restrictive video datasets (e.g., Moments-in-Time and SomethingSomething). The dataset has similar size to the other standard video datasets.

Anonymized

All the faces in the videos have been blurred so that no person can be identified.

Diverse

The videos have been collected from a wide range of countries and sources. This is important as some actions, for example greeting, are performed differently in different cultures. Other actions, like news broadcasts, can have different text depending on the country. We find the model is unable to recognize videos from different countries without diverse training data (Tables 3, 4 and 5 in the paper).

Classes

The AViD dataset consists of 887 activity classes, capturing similiar actions to those in Kinetics, plus some additional actions such as talking, explosion, boating, etc. The classes follow a long-tailed distribution. More details on the classes and hierarchy of actions are described in the paper.

Baselines

Method	AViD Accuracy
2D ResNet-50	36.2
I3D	46.8
3D ResNet-50	48.2
Two-Stream 3D ResNet-50	50.1
RepFlow	50.5
(2+1)D ResNet-50	48.8
SlowFast-50 4x4	48.5
SlowFast-50 8x8	50.4
SlowFast-101 16x8	50.9

Dataset

The annotations are provided in this repository, in the dataset directory. This contains the full dataset with labels and weak tags as well as classification-only train and validation sets.

The videos can be downloaded. https://vision.cs.stonybrook.edu/~mryoo/avid/avid.tar.gz MD5 sum is available at https://vision.cs.stonybrook.edu/~mryoo/avid/avid.tar.gz.md5

Paper

AJ Piergiovanni and Michael S. Ryoo "AViD Dataset: Anonymized Videos from Diverse Countries" in NeurIPS 2020

arXiv

@inproceedings{aviddataset,
      title={AViD Dataset: Anonymized Videos from Diverse Countries},
      booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
      author={AJ Piergiovanni and Michael S. Ryoo},
      year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
dataset		dataset
LICENSE		LICENSE
README.md		README.md
arch_excv.gif		arch_excv.gif
datasetsize.png		datasetsize.png
geographic.png		geographic.png
ice_climb.gif		ice_climb.gif
longtail.png		longtail.png
shake_head.gif		shake_head.gif
tractor.gif		tractor.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AViD Dataset: Anonymized Videos from Diverse Countries

Static

Anonymized

Diverse

Classes

Baselines

Dataset

Paper

About

Releases

Packages

License

piergiaj/AViD

Folders and files

Latest commit

History

Repository files navigation

AViD Dataset: Anonymized Videos from Diverse Countries

Static

Anonymized

Diverse

Classes

Baselines

Dataset

Paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages