GPU implementation of Meta-Learning Shared Hierarchies, based on OpenAI's MPI based implementation
Also supports recurrent policies
Add the following to your .bash_profile (replace ... with path to directory) and source it:
export PYTHONPATH=$PYTHONPATH:/.../mlsh-gpu/rl-algs;
export PYTHONPATH=$PYTHONPATH:/.../mlsh-gpu/test_envs;
python3 main.py --task [task_name] --master_network [master_network] --subpol_network [subpol_network] --nlstm [nlstm] --num_subs [num_subs] --macro_duration [macro_duration] --num_rollouts [num_rollouts] --warmup_time [warmup_time] --train_time [train_time] --num_master_grp [num_master_grp] --num_sub_batches [num_sub_batches] --num_sub_in_grp [num_sub_in_grp] --vfcoeff [vfcoeff] --entcoeff [entcoeff] --master_lr [master_lr] --sub_lr [sub_lr] --replay [replay] [savename]
DEVELOPMENT IN PROGRESS - see code for pending TODOs