h-DQN

Reproduction of "Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation" by Kulkarni et al. (2016) in Python: https://arxiv.org/abs/1604.06057

Disclaimer

This is a work in progress. I haven't been able to replicate the results yet.

Also, I haven't started on Montezuma's revenge yet. I intend to do this eventually, but I'm not sure when. Pull requests are welcomed and encouraged!

Comments/criticisms/suggestions/etc welcome, as always.

Progress

MDP Environment

Create MDP Environment [Done]
Create a non-hierarchical actor-critic agent as a baseline [Done]
Evaluate the non-hierachical actor-critic by plotting which states it visits [Done]
Create a h-DQN agent [Done]
Evaluate the h-DQN agent by plotting which states it visits [Done]

Montezuma's Revenge

TODO (This might be a while. Pull requests welcome.)

Results

Stochastic MDP Environment

h-DQN

The h-DQN agent is located in ./agent/hDQN.py. Below is our replication of Figure 4 from the paper:

Requirements

numpy
tensorflow
keras
h5py
matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
agent		agent
data/raw		data/raw
envs		envs
fig		fig
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
actor_critic.py		actor_critic.py
run.py		run.py
search_architectures.py		search_architectures.py
test_mdp.py		test_mdp.py
test_naive_hierarchy.py		test_naive_hierarchy.py
work_notes.md		work_notes.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

h-DQN

Disclaimer

Progress

MDP Environment

Montezuma's Revenge

Results

Stochastic MDP Environment

h-DQN

Requirements

About

Releases

Packages

Languages

License

bpleshakov/h-DQN

Folders and files

Latest commit

History

Repository files navigation

h-DQN

Disclaimer

Progress

MDP Environment

Montezuma's Revenge

Results

Stochastic MDP Environment

h-DQN

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages