dynamics-learn

Working directory for my work on model-based reinforcement learning for novel robots. Best for robots with high test cost and difficult to model dynamics. Contact: nol@berkeley.edu First paper website: https://sites.google.com/berkeley.edu/mbrl-quadrotor/ There is current future work using this library, such as attempting to control the Ionocraft with model-based RL. https://sites.google.com/berkeley.edu/mbrl-ionocraft/

Note that I have been very actively developing in this repo, please reach out if you have any questions of accuracy in the readme.

This directory is working towards an implementation of many simulated model-based approaches on real robots. For current state of the art in simulation, see this work from Prof Sergey Levine's group: Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models.

Future implementations work towards controlled flight of the ionocraft, with a recent publication in Robotics and Automation Letters and in the future for transfer learning of dynamics on the Crazyflie 2.0 Platform.

Some potentially noteable implementations include:

probablistic nueral network in pytorch
gaussian loss function for said pytorch probablistic neural network
random shooting MPC implementation with customizable cost / reward function (See cousin repo: https://github.com/natolambert/ros-crazyflie-mbrl)

Usage is generally of the form, with hydra enabling more options:

$ python learn/trainer.py robot=iono

Main Scripts:

learn/trainer.py: is for training dynamics models (P,PE,D,DE) on experimental data. The training process uses Hydra to allow easy configuration of which states are used and how the predictions are formatted.
learn/simulate_mpc.py: a script that runs MBRL with a MPC on a simulated environment.
learn/bo.py: For generating PID parameters using a dynamics model as a simulation environment. This will eventually extend beyond PID control. See the controllers directory learn/control. I am working to integrate opto.
learn/plot.py: a script for viewing different types of predictions, under improvement

In Development:

learn/pipps_experiment.py: A reimplementation of the paper "PIPPS: Flexible Model-Based Policy Search Robust to the Curse of Chaos". I wrote a blog post summarizing the main derivation behind this work here.

Related Code for Experiments:

CF Firmware: https://github.com/natolambert/crazyflie-firmware-pwm-control

forked from: https://github.com/bitcraze/crazyflie-firmware

Ros code: https://github.com/natolambert/ros-crazyflie-mbrl

uh from: https://github.com/whoenig/crazyflie_ros
want to use: https://github.com/USC-ACTLab/crazyswarm

Name		Name	Last commit message	Last commit date
Latest commit History 315 Commits
ex_data		ex_data
learn		learn
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dynamics-learn

Main Scripts:

In Development:

Related Code for Experiments:

About

Releases

Packages

Contributors 6

Languages

License

natolambert/dynamicslearn

Folders and files

Latest commit

History

Repository files navigation

dynamics-learn

Main Scripts:

In Development:

Related Code for Experiments:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages