robmsylvester / Super-Mario-Bros-DQN Public

Notifications You must be signed in to change notification settings
Fork 2
Star 10

A Deep Q Network used for running experiments on reinforcement learning agents targeted at learning Super Mario Bros (NES)

10 stars 2 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
MarioDQNAgent.py		MarioDQNAgent.py
PrioritizedSumTree.py		PrioritizedSumTree.py
README.md		README.md
SarstReplayMemory.py		SarstReplayMemory.py
ops.py		ops.py
references.txt		references.txt
run_experiment.ipynb		run_experiment.ipynb
super_mario_bros.py		super_mario_bros.py

Repository files navigation

Super Mario Brothers Deep Q Network

Super Mario Brothers Deep Q Network is a Reinforcement Learning module that aims to make it easier to run experiments with the goal of beating levels in Super Mario Brothers (1984)

This project utilizes the wonderful work provided by ppaquette (https://github.com/ppaquette/gym-super-mario) which provides the lua files and an NES environment object which can work with openAI gym to allow you to interact with FCEUltra's emulator, sending controller commands and reading memory values from the game, which include things like screen pixels, score, level, etc. Follow his instructions for installing the package.

Note, however, that for this project you will want to heavily modify the file super_mario_bros.py, because it is here that you can define custom reward functions for things like Mario dying, eating mushrooms, etc. Reward design is up to you. I have provided a few examples of how to use the info object in the custom super_mario_bros.py file in the function _process_data_message().

The learner itself uses a Deep Q Network with a target network, and prioritized SARST replay memory as per the groundbreaking paper by DeepMind. The prioritized SARST memory is efficiently implemented using a SumTree to provide logarithmic probability access to samples that have larger rewards associated with them

The Deep Q Network uses a convolutional network to read screen pixels, converting the game from RGB to a single channel to save on computational resources.

Usage:

Install necessary packages below
Install ppaquette's Super Mario Bros package linked above to hook it into OpenAI Gym
Define custom super_mario_bros.py rewards and whatever else you feel you want access to at runtime and copy this file into the gym environment for Super Mario Bros. On my OS, this lives in /usr/local/lib/python2.7/dist-packages/gym/envs/ppaquette_gym_super_mario/
Open run_experiment.ipynb and execute the blocks. Note that the default is to run on GPU, so you might want to change that line.

Requirements:

Tensorflow >= 1.0
Numpy >= 1.12
OpenAI Gym >= 0.8
Matplotlib (for visualizations, not necessarily crucial)

About

A Deep Q Network used for running experiments on reinforcement learning agents targeted at learning Super Mario Bros (NES)

Report repository

Releases

No releases published

Packages

No packages published

Languages