Video Person Clustering

Repo for the Video Person Clustering dataset, and code for the associated paper. This reporsitory contains the Video Person Clustering Dataset (below), and the code (coming soon...) from the associated paper, for the task of video person-clustering

Video Person Clustering Dataset (VPCD)

The dataset can be downloaded here. The tar.gz file contains the dataset, and a README detailing the contents

VPCD is built upon popular video datasets that are commonly used in the Computer Vision community (e.g. TBBT, Buffy, Friends, Sherlock, About Last Night, Hidden Figures)

Code

The code to produce video person-clustering results: Coming soon...

Important Notes

Details for the raw resolution of the videos, and the frame rates used in the dataset, can be found in this document

Currently the available dataset does not have the exact statistics quoted in the paper. A corrected version will be made available soon

Paper

If you find VPCD, or the code useful, please consider citing:

@misc{brown2021face,
      title={Face, Body, Voice: Video Person-Clustering with Multiple Modalities}, 
      author={Andrew Brown and Vicky Kalogeiton and Andrew Zisserman},
      year={2021},
      eprint={2105.09939},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LICENSE		LICENSE
README.md		README.md
VPCD.png		VPCD.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENSE

LICENSE

README.md

README.md

VPCD.png

VPCD.png

Repository files navigation

Video Person Clustering