Normalized Advantage Functions (NAF) in TensorFlow

<<<<<<< HEAD

Normalized Advantage Functions (NAF) in TensorFlow

TensorFlow implementation of Continuous Deep q-Learning with Model-based Acceleration.

Requirements

Python 2.7
gym
TensorFlow 0.9+

Usage

First, install prerequisites with:

$ pip install tqdm gym[all]

To train a model for an environment with a continuous action space:

$ python main.py --env=Pendulum-v0 --is_train=True
$ python main.py --env=Pendulum-v0 --is_train=True --display=True

To test and record the screens with gym:

$ python main.py --env=Pendulum-v0 --is_train=False
$ python main.py --env=Pendulum-v0 --is_train=False --display=True

Results

Training details of Pendulum-v0 with different hyperparameters.

$ python main.py --env=Pendulum-v0 # dark green
$ python main.py --env=Pendulum-v0 --action_fn=tanh # light green
$ python main.py --env=Pendulum-v0 --use_batch_norm=True # yellow
$ python main.py --env=Pendulum-v0 --use_seperate_networks=True # green

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
__pycache__		__pycache__
assets		assets
checkpoints		checkpoints
gym		gym
logs		logs
screenshots		screenshots
src		src
.screenshot2018-0516_14-40-15-891568.png		.screenshot2018-0516_14-40-15-891568.png
.screenshot2018-0517_20-46-58-009173.png		.screenshot2018-0517_20-46-58-009173.png
InstallationNote		InstallationNote
LICENSE		LICENSE
MUJOCO_LOG.TXT		MUJOCO_LOG.TXT
README.md		README.md
data.txt		data.txt
in_memory_to_disk.png		in_memory_to_disk.png
main.py		main.py
plots.py		plots.py
run_mujoco.sh		run_mujoco.sh
take_screenshot.py		take_screenshot.py
utils.py		utils.py

License

adarshsehgal/DeepLearning

Folders and files

Latest commit

History

Repository files navigation

Normalized Advantage Functions (NAF) in TensorFlow

Requirements

Usage

Results

References

Author

Taehoon Kim / @carpedm20

DeepLearning

About

Resources

License

Stars

Watchers

Forks

Languages