Skip to content
GCPR 2019: Learning to Disentangle Latent Physical Factors for Video Prediction
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.README
scripts
video_prediction
LICENSE.md
README.md

README.md

Learning to Disentangle Latent Physical Factors for Video Prediction

This repository contains datasets, code for dataset initialization and MIG evaluation scripts corresponding to:

D. Zhu, M. Munderloh, B. Rosenhahn, J. Stückler. Learning to Disentangle Latent Physical Factors for Video Prediction. German Conference on Pattern Recognition (GCPR) 2019.

A video demonstrating the results can be found here

Datasets Description

Three video datasets describing physical scenarios. Each sequence in these datasets has 10 frames in 1 second. Resolution is 128x128.

Sliding Set

  • Objects sliding on a plane
  • Varying discrete shape, scale, friction, speed and position
  • 26000 sequences with 20000/3000/3000 for training, validation, and test.

Wall Set

  • Objects sliding into a wall
  • Varying discrete shape, scale, material (density, restitution, friction, color), initial speed and position
  • 10125 sequences with 7425/1350/1350 for training, validation, and test.

Collision Set

  • Two objects sliding into each other
  • Varying discrete shape, scale, material (density, restitution, friction, color), initial speed and position
  • 30000 sequences with 25000/2500/2500 for training, validation, and test.

How to Use

Datasets can be downloaded here: Datasets.zip (md5sum: 27ca28c4646c4fa77911338061f0c820)

Data are in the '.tfrecord' form. The code to load datasets can be found in the folder 'video_prediction/datastes'. The file 'scripts/eval_mig.py' demonstrates how to initialize these datasets. Besides, it is also our implementation for Mutual Information Gap evaluation. TensorFlow version is v1.12.

Our code is based on Alex X. Lee's SAVP and Ricky Tian Qi Chen's beta-TCVAE. Their License can also be found in the license file.

Citation

If you find this useful for your research, please cite the following:

@article{Deyao2019GCPR,
    author    = {Deyao Zhu and Marco Munderloh and Bodo Rosenhahn and Jörg Stückler},
    title     = {Learning to Disentangle Latent  Physical Factors for Video Prediction},
    journal   = {German Conference on Pattern Recognition (GCPR)},
    year      = {2019},
}
You can’t perform that action at this time.