Skip to content
2.5D visual sound dataset
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
binaural_audios
splits
videos
CODE_OF_CONDUCT.md
LICENSE
README.md
data_collection_rig.png

README.md

FAIR-Play Dataset from 2.5D Visual Sound

[Project Page] [arXiv] [Video]


2.5D Visual Sound
Ruohan Gao1 and Kristen Grauman2
1UT Austin, 2Facebook AI Research
In Conference on Computer Vision and Pattern Recognition (CVPR), 2019


This repository (~100G) contains the FAIR-Play dataset we collected and used in our CVPR 2019 paper. It contains 1,871 video clips and their corresponding binaural audio clips recorded in a music room. The video clip and binaural clip of the same index are roughly aligned. The splits directory contains the 10 random splits used in the paper.

If you find our data or project useful in your research, please cite:

    @inproceedings{gao2019visualsound,
      title={2.5D Visual Sound},
      author={Gao, Ruohan and Grauman, Kristen},
      booktitle={CVPR},
      year={2019}
    }

Acknowlegements

We would like to thank Tony Miller, Jacob Donley, Pablo Hoffmann and Vladimir Tourbabin from Facebook for helpful discussions and the volunteers who participate in our data collection.

Licence

FAIR-Play is CC BY 4.0 licensed, as found in the LICENSE file.

You can’t perform that action at this time.