Pytorch implementation of the Persistent Advantage reinforcement learning operator proposed in paper 'Increasing the Action Gap: New Operators for Reinforcement Learning'
reinforcement-learning
atari2600
deep-reinforcement-learning
dqn
persistent-advantage-learning
advantage-learning
al-algorithm
pal-algorithm
-
Updated
Dec 8, 2018 - Python