Safe Reinforcement Learning with Stability Guarantees
This code accompanies the paper  and implements the code for estimating the region of attraction for a policy and optimizing the policy subject to stability constraints. For the old numpy-based code to estimate the region of attraction in  see the lyapunov-learning repository.
|||F. Berkenkamp, M. Turchetta, A. P. Schoellig, A. Krause, Safe Model-based Reinforcement Learning with Stability Guarantees in Proc. of the Conference on Neural Information Processing Systems (NIPS), 2017.|
|||F. Berkenkamp, R. Moriconi, A. P. Schoellig, A. Krause, Safe Learning of Regions of Attraction in Uncertain, Nonlinear Systems with Gaussian Processes in Proc. of the Conference on Decision and Control (CDC), 2016.|
You can install the library by cloning the repository and installing it with
pip install . --process-dependency-links
To run the tests in scripts/test_code.sh, you need to install additional dependencies with
pip install .[test] --process-dependency-links
--process-dependency-links flag is needed to install
gpflow==0.4.0, which is not on pypi. You can skip it if that particular version of the library is already installed.
You can the find example jupyter notebooks and the experiments in the paper in the examples folder.