This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient
q-learning
pytorch
dqn
epsilon-greedy
breakout
sarsa
policy-iteration
value-iteration
monte-carlo-methods
deep-q-learning
model-based-rl
model-free-rl
td-methods
model-free-control
-
Updated
Dec 8, 2019 - Python