Skip to content

peiranli/RL

Repository files navigation

RLalgorithms

RL agents using various reinforcement learning algorithms. Test mainly on OpenAI gym environments. Currently, both discrete and continuous action space versions are working perfectly. Continuous versions can solve Pendulum in around 1000 episodes.

Dependencies: OpenAI Gym, PyTorch

  1. Advantage Actor Critic (A2C)
  1. Proximal Policy Optimization
  1. Deep Deterministic Policy Gradient
  1. Deep Q Learning and Double Q Learning
  1. Policy Gradient