Hierarchical-Model-for-Long-term-Video-Prediction

Given the previous frames of the video as input, we want to get the long-term frame prediction.

Adopted human pose prediction method based on: Villegas, Ruben, et al. "Learning to Generate Long-term Future via Hierarchical Prediction." arXiv:1704.05831 (2017)

Authors: Peter Wang, Zhongxia Yan, Jeffrey Zhang

Note: 1. The latest version of tensorflow is needed.

To run our code, you should first get Penn Action Dataset (Weiyu Zhang, Menglong Zhu and Konstantinos Derpanis, "From Actemes to Action: A Strongly-supervised Representation for Detailed Action Understanding" International Conference on Computer Vision (ICCV). Dec 2013.): http://dreamdragon.github.io/PennAction/

Then download pretrained alexnet model: ./models/download.sh

and pretrained vgg model: https://mega.nz/#!YU1FWJrA!O1ywiCS2IiOlUCtCpI6HTJOMrneN-Qdv3ywQP5poecM (rename it vgg.npy afterwards)

and then run preposs.ipynb

Run our LSTM model and analogy network model separately.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Hierarchical-Model-for-Long-term-Video-Prediction

Files

README.md

Latest commit

History

README.md

File metadata and controls

Hierarchical-Model-for-Long-term-Video-Prediction