Recurrent Stochastic Value Gradient implementation with Tensorflow.
https://arxiv.org/pdf/1512.04455.pdf
- Python3
- tensorflow
- gym[atari]
- opencv-python
- git+https://github.com/imai-laboratory/lightsaber
$ python train.py [--render] --final-steps 10000000
$ python train.py [--render] --load {path of models} --demo
This is inspired by following projects.