GitHub - danielnbarbosa/soccer_twos: MADDPG agent with collaboration and competition

Introduction

This project uses Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to train four agents to play Soccer.

Environment Description

In this environment we are training four agents to both collaborate and compete in a game of 2v2 soccer. The Striker's goal is to get the ball into the opponent's goal. The Goalie's goal is to prevent the ball from entering its own goal.

Observation Space

The observation space for each agent consists of 112 variables corresponding to 14 local ray casts, each detecting 7 possible object types, along with the object's distance. Perception is in 180 degree view from front of agent. Observations over the last three time steps are stacked together for a total of 336 dimensions per agent. Putting the four agents together results in a final observation vector of 1344 dimensions.

Action Space

Striker: 6 actions corresponding to forward, backward, sideways movement, as well as rotation.
Goalie: 4 actions corresponding to forward, backward, sideways movement.

Reward Structure

Striker:
- +1 When ball enters opponent's goal.
- -0.1 When ball enters own team's goal.
- -0.001 Existential penalty.
Goalie:
- -1 When ball enters team's goal.
- +0.1 When ball enters opponents goal.
- +0.001 Existential bonus.

Solve Criteria

None specified.

Installation

Step 1: Clone the repo

Clone this repo using git clone https://github.com/danielnbarbosa/soccer_twos.git. Pre-compiled Unity environments for MacOS and Linux are included.

Step 2: Install Dependencies

Create an anaconda environment that contains all the required dependencies to run the project.

Mac:

conda create --name soccer_twos python=3.6
source activate soccer_twos
conda install -y pytorch -c pytorch
pip install torchsummary tensorboardX unityagents

Linux:

See separate instructions.

Step 3: Download Unity environment

Install the pre-compiled Unity environment. Select the appropriate file for your operating system:

Linux: click [here](https://s3-us-west-1.amazonaws.com/udacity-drlnd/P3/Soccer/Soccer_Linux.zip
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Download the file into the unity_envs directory of this repo and unzip it.

Train your agent

To train the agent run ./main.py. This will start the agent training inside the Unity environment. Statistics will be output to the command line as well as logged to the 'runs' directory for visualizing via tensorboard. To start tensorboard run tensorboard --logdir runs.

To load a saved model in evaluation mode run ./main.py --eval --load=<path to files>. This will load saved weights from checkpoint files. Evaluation mode disables training and noise, which gives better performance.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
checkpoints		checkpoints
scripts		scripts
unity_envs		unity_envs
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
environment.py		environment.py
main.py		main.py
model.py		model.py
statistics.py		statistics.py
training.py		training.py

danielnbarbosa/soccer_twos

Folders and files

Latest commit

History

Repository files navigation

Introduction

Environment Description

Observation Space

Action Space

Reward Structure

Solve Criteria

Installation

Step 1: Clone the repo

Step 2: Install Dependencies

Mac:

Linux:

Step 3: Download Unity environment

Train your agent

About

Resources

Stars

Watchers

Forks

Languages