Skip to content

Convergent Policy Optimization for Safe Reinforcement Learning

Notifications You must be signed in to change notification settings

ming93/Safe_reinforcement_learning

Repository files navigation

Description

Codes for the constrained Linear-Quadratic Regulator (LQR) experiment.

Reference

Ming Yu, Zhuoran Yang, Mladen Kolar, and Zhaoran Wang. Convergent Policy Optimization for Safe Reinforcement Learning. In NeurIPS 2019.

Run codes

Run "Safe_RL_LQR_experiment.m"

About

Convergent Policy Optimization for Safe Reinforcement Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published