A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
-
Updated
Apr 8, 2021 - Python
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
Actor Critic using Kronecker-Factored Trust Region
Solving CartPole-v1 environment in Keras with Actor Critic algorithm an Deep Reinforcement Learning algorithm
Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)
Implementations of some of the most well known Deep Reinforcement Learning algorithms
Deep Reinforcement Learning: On-Policy Actor Critic methods. An implementation of Advantage Actor-Critic (A2C) and Proximal Policy Optimization (PPO) on the PyTorch Lightning framework.
unRL (AKA "unreal") is a set of libraries providing Reinforcement Learning algorithms implemented in PyTorch or Jax.
Add a description, image, and links to the actor-critic-methods topic page so that developers can more easily learn about it.
To associate your repository with the actor-critic-methods topic, visit your repo's landing page and select "manage topics."