RL-Adventure-2: Policy Gradients

PyTorch tutorial of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

The deep reinforcement learning community has made several improvements to the policy gradient algorithms. This tutorial presents latest extensions in the following order:

Advantage Actor Critic (A2C)

actor-critic.ipynb
A3C Paper
OpenAI blog

High-Dimensional Continuous Control Using Generalized Advantage Estimation

gae.ipynb
GAE Paper

Proximal Policy Optimization Algorithms

ppo.ipynb
PPO Paper
OpenAI blog

Sample Efficient Actor-Critic with Experience Replay

acer.ipynb
ACER Paper

Continuous control with deep reinforcement learning

ddpg.ipynb
DDPG Paper

Addressing Function Approximation Error in Actor-Critic Methods

td3.ipynb
Twin Dueling DDPG Paper

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

soft actor-critic.ipynb
Soft Actor-Critic Paper

Generative Adversarial Imitation Learning

gail.ipynb
GAIL Paper

Hindsight Experience Replay

her.ipynb
HER Paper
OpenAI Blog

If you get stuck…

Remember you are not stuck unless you have spent more than a week on a single algorithm. It is perfectly normal if you do not have all the required knowledge of mathematics and CS.
Carefully go through the paper. Try to see what is the problem the authors are solving. Understand a high-level idea of the approach, then read the code (skipping the proofs), and after go over the mathematical details and proofs.

RL Algorithms

Deep Q Learning tutorial: DQN Adventure: from Zero to State of the Art Awesome RL libs: rlkit @vitchyr, pytorch-a2c-ppo-acktr @ikostrikov, ACER @Kaixhin

Best RL courses

Berkeley deep RL link
Deep RL Bootcamp link
David Silver's course link
Practical RL link

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RL-Adventure-2: Policy Gradients

If you get stuck…

RL Algorithms

Best RL courses

Files

README.md

Latest commit

History

README.md

File metadata and controls

RL-Adventure-2: Policy Gradients

If you get stuck…

RL Algorithms

Best RL courses