Deep Deterministic Policy Gradient: Continuous Control

Deep Deterministic Policy Gradient Actor-Critic method for solving the continuous control reacher problem.

Project Details

The continuous control reacher environment is a Unity environment that consists of a double-jointed arm that can move to target locations. The goal is to keep the agents hand in the target area for as long as possible.

State space: 33 dimensions corresponding to position, rotation, velocity, and angular velocities of the arm.
Action space: 4 dimensions corresponding to torque applicable to two joints (each with value in [-1,1]).
Rewards: +0.1 is provided for each step that the agent's hand is in the goal location.

The environment is considered solved when the agents achieve an average reward of +30 (over 100 consecutive episodes, and over all agents)

The code in this project is based heavily off the code from the Udacity Deep Reinforcement Learning ddpg-bipedal code and tuned based on discussion and code in the Udacity mentor chat from Dmitry G.

Getting Started

Follow the instructions at the Udacity Deep Reinforcement Learning repository for general instructions on setting up the environment. Specific instructions for installing and downloading required files for this project are at located in Project 2.

Instructions

Run control.ipynb to train the 20-agent model and visualize the scores over time. The logic for the agent and neural network are in ddpg_agent.py and model.py, respectively. The model weights for the successful agent are saved in checkpoint_actor.pth and checkpoint_critic.pth. Note that there is an alternative approach for the single agent model in the files appended _vanilla.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
README.md		README.md
Report.md		Report.md
checkpoint_actor.pth		checkpoint_actor.pth
checkpoint_critic.pth		checkpoint_critic.pth
control.ipynb		control.ipynb
control_vanilla.ipynb		control_vanilla.ipynb
ddpg_agent.py		ddpg_agent.py
ddpg_agent_vanilla.py		ddpg_agent_vanilla.py
model.py		model.py
model_vanilla.py		model_vanilla.py
p2.png		p2.png
unity-environment.log		unity-environment.log
workspace_utils.py		workspace_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Deterministic Policy Gradient: Continuous Control

Project Details

Getting Started

Instructions

About

Releases

Packages

Languages

ajkeith/control-ddpg

Folders and files

Latest commit

History

Repository files navigation

Deep Deterministic Policy Gradient: Continuous Control

Project Details

Getting Started

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages