Implementation of Recurrent Deterministic Policy Gradient (https://arxiv.org/abs/1512.04455).) in Tensorflow
git clone https://github.com/stevenpjg/RDPG.git
cd RDPG
python main.py
- Tensorflow
- OpenAi gym
- Mujoco
Let me know if there are any issues and clarifications