Self Paced Deep Reinforcement Learning

Installation

It is easiest to setup a virtual or conda environment in order to isolate the packages installed for this project from your global python installation. We used Python 3.6.10 on Ubuntu 18.04 LTS for the experiments. You can easily install the required dependencies by executing

pip install -r requirements.txt

This will install all packages required to run the point mass experiments. If you furthermore want to run the ball catching experiment, you need to also execute

pip install -r requirements_ext.txt

This will install a wrapper for the MuJoCo simulation library. For this to work, you need to have set up MuJoCo according to this guide.

There exist a convenience script for running the experiments: run_experiments.sh. The script takes one argument that specifies the seed with which the experiments will be run. So in order to run all experiments with seed 1, you need to execute

./run_experiments.sh 1

After running the experiments for the desired number of seeds, the results can be visualized using the following command

python visualize_results.py --env point_mass point_mass_2d --learner ppo ppo
python visualize_results.py --env point_mass point_mass_2d --learner trpo trpo
python visualize_results.py --env point_mass point_mass_2d --learner sac sac
python visualize_results.py --env ball_catching --learner ppo
python visualize_results.py --env ball_catching --learner trpo
python visualize_results.py --env ball_catching --learner sac

To visualize the context distributions for a set of seeds you can also execute the following commands.

python visualize_results.py --env point_mass point_mass_2d --learner ppo ppo --dist_vis point_mass_2d
python visualize_results.py --env point_mass point_mass_2d --learner trpo trpo --dist_vis point_mass_2d
python visualize_results.py --env point_mass point_mass_2d --learner sac sac --dist_vis point_mass_2d
python visualize_results.py --env ball_catching --learner ppo --dist_vis ball_catching
python visualize_results.py --env ball_catching --learner trpo --dist_vis ball_catching
python visualize_results.py --env ball_catching --learner sac --dist_vis ball_catching

Keep in mind that this requires a certain amount of seeds to be run (otherwise the script will return an error). You can also change the seeds that are visualized in the visualize_results.py script.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
deep_sprl		deep_sprl
misc		misc
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
alp_gmm_hp_search.sh		alp_gmm_hp_search.sh
goal_gan_hp_search.sh		goal_gan_hp_search.sh
requirements.txt		requirements.txt
requirements_ext.txt		requirements_ext.txt
run_ball_catching_experiments.sh		run_ball_catching_experiments.sh
run_experiment.py		run_experiment.py
run_experiments.sh		run_experiments.sh
run_point_mass_2d_experiments.sh		run_point_mass_2d_experiments.sh
run_point_mass_experiments.sh		run_point_mass_experiments.sh
select_best_hps.py		select_best_hps.py
visualize_results.py		visualize_results.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self Paced Deep Reinforcement Learning

Installation

About

Releases 1

Packages

Languages

License

psclklnk/spdl

Folders and files

Latest commit

History

Repository files navigation

Self Paced Deep Reinforcement Learning

Installation

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages