Adaptive Temporal Difference Learning (AdaTD)

Implemented AdaTD and compared it with other optimization methods in temporal difference learning. Used in the paper Adaptive Temporal Difference Learning with Linear Function Approximation.

Getting started:

Dependencies: Python (3.7.3), OpenAI gym (0.10.5), numpy (1.14.5), matplotlib (3.0.3), tensorflow (2.0.0), multiagent
To install dpendency 'multiagent': Download 'multiagent-particle-envs-master' provided in folder 'AdaTD-master'. Then cd into its root directory and install it using 'pip install -e multiagent-particle-envs-master'

optimizers_linear.py: Code for running AdaTD, ALRR-TD, and vanilla-TD simultaneously under linear value function approximation.
optimzers_nonlinear.py: Code for running AdaTD and vanilla-TD simultaneously under nonlinear value function approximation.

The dependency 'multiagent' used in this repo is a modified version of Multi-Agent Particle Environment

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Acrobot		Acrobot
CartPole		CartPole
MountainCar		MountainCar
Navigation		Navigation
README.md		README.md