Continuous Control With Ensemble Deep Deterministic Policy Gradients

This repository is the official implementation of Continuous Control With Ensemble Deep Deterministic Policy Gradients.

Requirements

Before installation, please make sure you have MuJoCo engine set up on your machine. We use mujoco150 in order to be comparable with previous benchmarks on v2 environments. See this issue

To install requirements:

pip install -r requirements.txt

Training

To train the model(s) in the paper, run this command:

python run.py <experiment_specification path>

Logger automatically stops training and evaluates current policy every log_every environment interactions. The data is printed to standard output and stored on drive.

We include specifications for our most important experiments.

Path	Description
specs/ed2_on_mujoco.py	Benchmark of our method
specs/sac_on_mujoco.py	Benchmark of our implementation of SAC
specs/sunrise_on_mujoco.py	Benchmark of our implementation of SUNRISE
specc/sop_on_mujoco.py	Benchmark of our implementation of SOP

Results

Our model achieves the following performance on the MuJoCo suite:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
envs		envs
images		images
specs		specs
spinup_bis		spinup_bis
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Continuous Control With Ensemble Deep Deterministic Policy Gradients

Requirements

Training

Results

About

Releases

Packages

Languages

License

ed2-paper/ED2

Folders and files

Latest commit

History

Repository files navigation

Continuous Control With Ensemble Deep Deterministic Policy Gradients

Requirements

Training

Results

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages