Skip to content
master
Switch branches/tags
Code
This branch is even with NeteaseFuxiRL:master.
Contribute
Fetch upstream

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

fuxi_logo

Code for the paper "Exploring Unknown States with Action Balance"

Usage

Finding unknown states (Grid world):

Run following command for one group experiment.

cd grid-experiments && mkdir logs
./run_no_ends.sh no_ends_test run1 100 100 128 1

Reaching goals (Grid world):

cd grid-experiments && mkdir logs
./run_reach_goal.sh reach_goals_test 128 1

Atari:

This implementation is mainly based on random-network-distillation. The following command should train an action balance RND with action channel on Montezuma's Revenge.

  • --abc: 0 or 1, whether use action balance exploration. 0 means only RND.
  • --array_action: 0 or 1, whether use action channel.
python3 -u run_atari.py --env=MontezumaRevengeNoFrameskip-v4 --num_env=32 --gamma_ext 0.999 --abc=1 --seed=0 --array_action=1 --logdir /tmp/action_balance_tmp_run

About

No description, website, or topics provided.

Resources

License

Releases

No releases published

Packages

No packages published