Code for the paper "Exploring Unknown States with Action Balance"

If you find this code useful, please reference in your paper:

@article{DBLP:journals/corr/abs-2003-04518,
  author    = {Yan Song and
               Yingfeng Chen and
               Yujing Hu and
               Changjie Fan},
  title     = {Exploring Unknown States with Action Balance},
  journal   = {CoRR},
  year      = {2020}
}

Usage

Finding unknown states (Grid world):

Run following command for one group experiment.

cd grid-experiments && mkdir logs
./run_no_ends.sh no_ends_test run1 100 100 128 1

Reaching goals (Grid world):

cd grid-experiments && mkdir logs
./run_reach_goal.sh reach_goals_test 128 1

Atari:

This implementation is mainly based on random-network-distillation. The following command should train an action balance RND with action channel on Montezuma's Revenge.

--abc: 0 or 1, whether use action balance exploration. 0 means only RND.
--array_action: 0 or 1, whether use action channel.

python3 -u run_atari.py --env=MontezumaRevengeNoFrameskip-v4 --num_env=32 --gamma_ext 0.999 --abc=1 --seed=0 --array_action=1 --logdir /tmp/action_balance_tmp_run

If you have any question, please contact yansong1024@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
grid-experiments		grid-experiments
policies		policies
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
atari_wrappers.py		atari_wrappers.py
cmd_util.py		cmd_util.py
console_util.py		console_util.py
load_log.py		load_log.py
monitor.py		monitor.py
mpi_util.py		mpi_util.py
ppo_agent.py		ppo_agent.py
recorder.py		recorder.py
replayer.py		replayer.py
run_atari.py		run_atari.py
stochastic_policy.py		stochastic_policy.py
tf_util.py		tf_util.py
utils.py		utils.py
vec_env.py		vec_env.py

License

NeteaseFuxiRL/action-balance-exploration

Folders and files

Latest commit

History

Repository files navigation

Code for the paper "Exploring Unknown States with Action Balance"

Usage

About

Resources

License

Stars

Watchers

Forks

Languages