Implementation of some of the policy gradient methods in PyTorch.
pytorch
policy-gradient
reinforce
actor-critic
ppo
online-supervised-learning
gradient-bandit
batch-reinforce
-
Updated
Jul 27, 2022 - Python