Build software better, together

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

reinforcement-learning deep-reinforcement-learning q-learning dqn policy-gradient sarsa a3c ddpg imitation-learning double-dqn dueling-dqn ppo td3 easy-rl

Updated Sep 6, 2025
Jupyter Notebook

MorvanZhou / Reinforcement-learning-with-tensorflow

Star

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Updated Mar 31, 2024
Python

thu-ml / tianshou

Star

An elegant PyTorch deep reinforcement learning library.

pytorch dqn policy-gradient rl cql atari ddpg imitation-learning sac drl npg double-dqn trpo mujoco ppo a2c td3 bcq transferlab

Updated Nov 18, 2025
Python

vwxyzjn / cleanrl

Star

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

python machine-learning reinforcement-learning deep-learning deep-reinforcement-learning pytorch gym atari actor-critic ale proximal-policy-optimization ppo advantage-actor-critic a2c wandb phasic-policy-gradient

Updated Jul 8, 2025
Python

udacity / deep-reinforcement-learning

Star

Repo for the Deep Reinforcement Learning Nanodegree program

reinforcement-learning deep-reinforcement-learning openai-gym pytorch dqn neural-networks reinforcement-learning-algorithms dynamic-programming hill-climbing ddpg cross-entropy openai-gym-solutions pytorch-rl ppo ml-agents rl-algorithms

Updated Nov 16, 2023
Jupyter Notebook

sweetice / Deep-reinforcement-learning-with-pytorch

Star

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

algorithm deep-learning deep-reinforcement-learning pytorch dqn policy-gradient sarsa resnet a3c reinforce sac alphago actor-critic trpo ppo a2c actor-critic-algorithm td3

Updated Mar 24, 2023
Python

andri27-ts / Reinforcement-Learning

Star

Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning

machine-learning reinforcement-learning qlearning deep-learning deep-reinforcement-learning artificial-intelligence dqn deepmind evolution-strategies ppo a2c policy-gradients

Updated Jun 30, 2020
Jupyter Notebook

AI4Finance-Foundation / ElegantRL

Star

Massively Parallel Deep Reinforcement Learning. 🔥

lightweight reinforcement-learning gae efficient pytorch stable dqn ddpg sac per multiple-gpu ppo a2c td3 model-free-rl drl-pytorch bipedalwalkerhardcore

Updated Oct 26, 2025
Python

simoninithomas / Deep_reinforcement_learning_Course

Star

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

qlearning deep-learning unity tensorflow deep-reinforcement-learning pytorch tensorflow-tutorials deep-q-network actor-critic deep-q-learning ppo a2c

Updated May 2, 2023
Jupyter Notebook

ikostrikov / pytorch-a2c-ppo-acktr-gail

Star

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Updated May 29, 2022
Python

ShangtongZhang / DeepRL

Star

Modularized Implementation of Deep RL Algorithms in PyTorch

deep-reinforcement-learning rainbow pytorch dqn ddpg double-dqn dueling-network-architecture quantile-regression option-critic-architecture deeprl categorical-dqn ppo a2c prioritized-experience-replay option-critic td3

Updated Apr 16, 2024
Python

XinJingHao / DRL-Pytorch

Star

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

machine-learning reinforcement-learning asl deep-reinforcement-learning q-learning pytorch ddpg sac double-dqn c51 dueling-dqn categorical-dqn ppo prioritized-experience-replay noisynet-dqn td3

Updated Jun 11, 2025
Python

seungeunrho / minimalRL

Star

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

machine-learning reinforcement-learning deep-learning simple deep-reinforcement-learning pytorch dqn a3c reinforce ddpg sac acer ppo a2c policy-gradients

Updated Apr 22, 2023
Python

AI4Finance-Foundation / FinRL-Trading

Star

For trading. Please star.

deep-reinforcement-learning openai-gym sharpe-ratio ddpg stock-trading ppo a2c-algorithm ensemble-strategy stock-trading-strategy automated-stock-trading

Updated Nov 23, 2025
Python

nikhilbarhate99 / PPO-PyTorch

Star

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

reinforcement-learning deep-learning deep-reinforcement-learning pytorch policy-gradient reinforcement-learning-algorithms pytorch-tutorial proximal-policy-optimization ppo pytorch-implmention ppo-pytorch

Updated Jul 9, 2024
Python

marlbenchmark / on-policy

Star

This is the official implementation of Multi-Agent PPO (MAPPO).

algorithms multi-agent hanabi smac ppo mpes starcraftii mappo

Updated Jul 18, 2024
Python

kengz / SLM-Lab

Star

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

benchmark reinforcement-learning deep-reinforcement-learning pytorch dqn policy-gradient a3c sac ppo a2c

Updated Nov 27, 2025
Python

Khrylx / PyTorch-RL

Star

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

reinforcement-learning deep-reinforcement-learning pytorch generative-adversarial-network policy-gradient trpo fisher-vectors pytorch-rl proximal-policy-optimization ppo a2c

Updated Feb 9, 2021
Python

vietnh1009 / Super-mario-bros-PPO-pytorch

Star

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros

python mario reinforcement-learning ai deep-learning openai-gym python3 pytorch openai gym super-mario-bros proximal-policy-optimization ppo ppo2

Updated Jul 24, 2021
Python

ericyangyu / PPO-for-Beginners

Star

A simple and well styled PPO implementation. Based on my Medium series: https://medium.com/@eyyu/coding-ppo-from-scratch-with-pytorch-part-1-4-613dfc1b14c8.

machine-learning reinforcement-learning pytorch reinforcement-learning-algorithms ppo

Updated Oct 1, 2024
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ppo

Here are 922 public repositories matching this topic...

datawhalechina / easy-rl

MorvanZhou / Reinforcement-learning-with-tensorflow

thu-ml / tianshou

vwxyzjn / cleanrl

udacity / deep-reinforcement-learning

sweetice / Deep-reinforcement-learning-with-pytorch

andri27-ts / Reinforcement-Learning

AI4Finance-Foundation / ElegantRL

simoninithomas / Deep_reinforcement_learning_Course

ikostrikov / pytorch-a2c-ppo-acktr-gail

ShangtongZhang / DeepRL

XinJingHao / DRL-Pytorch

seungeunrho / minimalRL

AI4Finance-Foundation / FinRL-Trading

nikhilbarhate99 / PPO-PyTorch

marlbenchmark / on-policy

kengz / SLM-Lab

Khrylx / PyTorch-RL

vietnh1009 / Super-mario-bros-PPO-pytorch

ericyangyu / PPO-for-Beginners

Improve this page

Add this topic to your repo