Distributed RL platform with modified IMPALA architecture. Implements CLEAR, LASER V-trace modifications along with Attentive and Elite sampling experience replay methods.
machine-learning
impala
deep-reinforcement-learning
pytorch
policy-gradient
arcade-learning-environment
experience-replay
distributed-reinforcement-learning
actor-critic-with-experience-replay
elite-sampling
mixing-on-and-off-policy-data
-
Updated
Apr 8, 2022 - Python