Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control

Yaofeng Desmond Zhong, Naomi Ehrich Leonard | 2020

This repository is the official implementation of Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control. This implementation is refactored from the original implementation for readability.

Update

In the 2020 NeurIPS Proceedings, there is an error in Equation (3), which is corrected in this commit. The main results and conclusion of the paper are not affected. We thank Rembert Daems for spotting the error in Equation (3). The results shown on this page and the notebooks are updated based on training with the correct version of Equation (3).

The updated pretrained models can be downloaded .

Requirements

This implementation is written with PyTorch and handles training with PyTorch-Lightning, which makes our code easy to read and our results easy to reproduce.

Please use requirements.txt to install required dependencies:

pip install -r requirements.txt

In case you encounter problems, we also provide a requirements_freeze.txt which contains the output by pip freeze.

Dataset

The data are hosted on Zenodo.

Please see for more details.

Training

To train the model, run these commands:

# to train the pendulum example
python examples/pend_lag_cavae_trainer.py 
# to train the fully-actuated cartpole example
python examples/cart_lag_cavae_trainer.py 
# to train the fully-actuated acrobot example
python examples/acro_lag_cavae_trainer.py

The commands above train the model on CPUs. Due to a bug in torch.nn.functional.grid_sample, you might encounter a segmentation fault if you train the model on GPUs. This bug has not been fixed in the latest PyTorch version (1.6.0) when this work has been done.

However, I successfully trained the pendulum example on GPU without error. Thanks to PyTorch-Lightning, you can train it on GPU with

python examples/pend_lag_cavae_trainer.py --gpus 1

For more details, please check out the .

Evaluation

To analyze results, please check out the jupyter notebooks in .

Pre-trained models

Pre-trained models can be downloaded from github releases ( and )

Please unzip the downloaded file and put the checkpoints in .

Thanks to Pytorch-Lightning, after loading the checkpoints, you can check the training hyperparameters in the hparams property. For example, please check out the last cell in .

Results

The following results can be reproduced by the notebooks in .

Prediction

Prediction up to 60 time steps. The models here are trained with T_pred=4.

Examples	True	Lagrangian + caVAE	MLPdyn + caVAE	Lagrangian + VAE	HGN
Pendulum
CartPole

Our model (Lagrangian + caVAE) generate realistic long term prediction.

Control

One image of each target position is given in each task. The target positions here are inverted positions of the systems.

Energy-based controllers are able to conrol the pendulum, fully-actuated CartPole and fully-actuated Acrobot to the target position based on the learned dynamical and embedding (Lagrangian_caVAE) model.

Acknowledgement

This research has been supported in part by ONR grant #N00014-18-1-2873 and by the School of Engineering and Applied Science at Princeton University through the generosity of William Addy ’82. Yaofeng Desmond Zhong would like to thank Christine Allen-Blanchette, Shinkyu Park, Sushant Veer and Anirudha Majumdar for helpful discussions.

This implemention uses by Nicola De Cao for modelling rotational coordiantes.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
ablations		ablations
analysis		analysis
checkpoints		checkpoints
datasets		datasets
examples		examples
figures		figures
hyperspherical_vae		hyperspherical_vae
lag_caVAE		lag_caVAE
myenv		myenv
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
requirements_freeze.txt		requirements_freeze.txt
utils.py		utils.py

License

DesmondZhong/Lagrangian_caVAE

Folders and files

Latest commit

History

Repository files navigation

Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control

Update

Requirements

Dataset

Training

Evaluation

Pre-trained models

Results

Prediction

Control

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Languages