Skip to content

juliuskunze/nevermind

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

nevermind

Implementation of the original Deep Q-network algorithm from Mnih et al. 2015.

Installation

Python 3.6+ and TensorFlow are required.

pip3 install git+git@github.com:JuliusKunze/nevermind.git

will install nevermind together with minimal requirements.

Run

Solve the cartpole gym environments in a few minutes on a CPU:

from nevermind import configurations
    
configurations.train_cartpole()

Train for the lunar lander environment - typically solved in a few hours on a CPU:

configurations.train_lunar_lander()

Training for the Atari 2600 game breakout takes days on a GPU:

configurations.train_atari(game='breakout')

Models and training summary plots (see below) are saved to the directory ./data by default.

Results

Learned value and advantage function for the cartpole task:

Cartpole Value Function Cartpole Advantage Function

Cartpole training summary of 10 runs: Cartpole Summary

Summary of a successful lunar lander training run: Lunar Lander Summary

10M timesteps of training for the breakout Atari environment: Atari Breakout Summary

Releases

No releases published

Packages

No packages published

Languages