Skip to content

ysaibhargav/mlsh-gpu

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

54 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Meta-Learning Shared Hierarchies

GPU implementation of Meta-Learning Shared Hierarchies, based on OpenAI's MPI based implementation

Also supports recurrent policies

Installation
Add the following to your .bash_profile (replace ... with path to directory) and source it:
export PYTHONPATH=$PYTHONPATH:/.../mlsh-gpu/rl-algs;
export PYTHONPATH=$PYTHONPATH:/.../mlsh-gpu/test_envs;
How to run
python3 main.py --task [task_name] --master_network [master_network] --subpol_network [subpol_network] --nlstm [nlstm] --num_subs [num_subs] --macro_duration [macro_duration] --num_rollouts [num_rollouts] --warmup_time [warmup_time] --train_time [train_time] --num_master_grp [num_master_grp] --num_sub_batches [num_sub_batches] --num_sub_in_grp [num_sub_in_grp] --vfcoeff [vfcoeff] --entcoeff [entcoeff] --master_lr [master_lr] --sub_lr [sub_lr] --replay [replay] [savename] 
Note

DEVELOPMENT IN PROGRESS - see code for pending TODOs