Risk averse reinforcement learning.

This code combines recent progress in Distributional Reinforcement Learning with Quantile Regression (https://arxiv.org/pdf/1710.10044.pdf) with classic theory to create risk averse, safe algorithm.

To download the code:

git clone https://github.com/sannebh/riskaverse/

The cartpole environment is implemented using OpenAI Gym. This is a toolkit for developing and comparing reinforcement learning algorithms. Installation guidelines and other information can be found at https://github.com/openai/gym.

We use Pytorch to build the networks, for installation and other information please see https://pytorch.org/.

The common folder contains helper functions and classes: layers, replay buffert and wrappers. They are all from Open AI Baselines (https://github.com/openai/baselines).

The risk averse strategies can be found in risk_strategies.py. wind_world.py and contains the Windy World environment.

qr-dqn_cart and qr-dqn_windy contains the implementations themselves, in Cartpole Environment and Windy Gridworld environment.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
common		common
README.md		README.md
adjusted_env.py		adjusted_env.py
qr-dqn_cart.py		qr-dqn_cart.py
qr-dqn_windy.py		qr-dqn_windy.py
risk_strategies.py		risk_strategies.py
wind_world.py		wind_world.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Risk averse reinforcement learning.

About

Releases

Packages

Languages

sannebh/riskaverse

Folders and files

Latest commit

History

Repository files navigation

Risk averse reinforcement learning.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages