The dataset is built from videos with permissive licences (mostly open source movies).
In the dataset, each video is identified by a hash. The hash is computed over the first 32KiB of the video (source code). It should be used to verify if the downloaded file is the same used when building the dataset.
Video | Download page | Hash | Metadata |
---|---|---|---|
Tears of Steel | New version (4k rendered) - HD | 3da6f3053e50b704bca44da452f01643535259a7 | tears-of-steel.json |
Valkaama | Valkaama HD - 720p | d048ba7479562dc188c85a72c92ca57e18a4c3b0 | valkaama.json |