A3C

This project is my attempt at implementing the Asynchronous Methods for Deep Reinforcement Learning (A3C) Paper.

Currently the PyTorch version is functional, and a TensorFlow version is being worked on.

Environment

The environment the model is trained on is SpaceInvaders-v0 from OpenAI's gym library. In this environment, the input received is a 210x160 RGB screenshot and the output is an integer reward as shown on screen and a boolean for if the game is done. Here is an example of a full game cycle.

Asynchronous Design

Python multiprocessing (which forks and executes) subprocesses were used rather than threading in this project due to the CPU bound nature of the functions, which renders multithreading almost as inefficient as single threading due to Python GIL contention.

Training

The original paper model parameters, preprocessing sequence and training parameters were replicated as best as it could be discerned and training is currently in progress. Currently, the best is:

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
work_in_progress		work_in_progress
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
model.py		model.py
run		run
shared_optim.py		shared_optim.py
train.py		train.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A3C

Environment

Asynchronous Design

Training

About

Releases

Packages

Languages

License

JasonTang99/A3C

Folders and files

Latest commit

History

Repository files navigation

A3C

Environment

Asynchronous Design

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages