GitHub - rd-tobias-sunderdiek/hyperparameter-tuning-demo: Demo for hyperparameter tuning with tune using an DQN example from Udacity for OpenAI gym

Hyperparametertuning Demo

This is a demo for hyperparameter-tuning with Tune[1] using an DQN example from Udacity[2] for the OpenAI-Gym environment[3] LunarLander[4].

This demo is meant to be able to be trained on cpu locally (took ~40 min. on a 2.5 GHz Quad-Core i7)

Goal

Land on the moon and get reward for landing properly, loose reward for using fuel or land outside landing pad. In this example, we use the metric mean_reward for this.

Install

make install (tested with python 3.8)

Usage

watch random, untrained agent via make random
[optional] configure hyperparameter in train.py
make train starts training
see results in tensorboard via make tensorboard
after training finished, make gif creates a .gif of the best model

[1] https://docs.ray.io/en/latest/tune.html

[2] https://github.com/udacity/deep-reinforcement-learning/tree/master/dqn/solution

[3] https://gym.openai.com/

[4] https://gym.openai.com/envs/LunarLander-v2/

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
assets		assets
.gitignore		.gitignore
HyperparameterTuning.md		HyperparameterTuning.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
create_gif.py		create_gif.py
random_action.py		random_action.py
requirements.txt		requirements.txt
some_model_to_train.py		some_model_to_train.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

.gitignore

.gitignore

HyperparameterTuning.md

HyperparameterTuning.md

LICENSE

LICENSE

Makefile

Makefile

README.md

README.md

create_gif.py

create_gif.py

random_action.py

random_action.py

requirements.txt

requirements.txt

some_model_to_train.py

some_model_to_train.py

train.py

train.py

Repository files navigation

Hyperparametertuning Demo

Goal

Install

Usage

About

Releases

Packages

Contributors 3

Languages

License

rd-tobias-sunderdiek/hyperparameter-tuning-demo

Folders and files

Latest commit

History

Repository files navigation

Hyperparametertuning Demo

Goal

Install

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Languages