Time Skip Reinforcement Learning

Prior work:

This paper does something very similar, however their model adds the dynamic duration by adding a second version of each action with a different duration. I would add a second decision (either within the same model or with a second, parallel model) which selects the duration over which to perform the chosen action.

ftp://ftp.cs.utexas.edu/pub/neural-nets/papers/braylan.aaai15.pdf

Explores use of very large (but static) frame-skip values and discovers that on some games they deliver very good results.

https://danieltakeshi.github.io/2016/11/25/frame-skipping-and-preprocessing-for-deep-q-networks-on-atari-2600-games/ Explanation of the motivation and mechanism behind skipping frames

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
actor_critic_network.py		actor_critic_network.py
basic_q_agent.py		basic_q_agent.py
deep_q_agent.py		deep_q_agent.py
option_critic_network.py		option_critic_network.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

actor_critic_network.py

actor_critic_network.py

basic_q_agent.py

basic_q_agent.py

deep_q_agent.py

deep_q_agent.py

option_critic_network.py

option_critic_network.py

Repository files navigation

Time Skip Reinforcement Learning

About

Releases

Packages

Languages

Nyrt/time_skip_RL

Folders and files

Latest commit

History

Repository files navigation

Time Skip Reinforcement Learning

About

Resources

Stars

Watchers

Forks

Languages