OpenAI Gym cartpole solved by a Neural Network (DQN) in Tensorflow 2

CartPole-v0 defines "solving" as getting average reward of 195.0 over 100 consecutive trials. Run python play.py to check.

Requirements

$ pipenv install
$ pipenv shell

$ pipenv run python train.py
$ pipenv run python play.py

$ tensorboard --logdir=logs

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
agent		agent
memory		memory
model		model
trained_models		trained_models
.gitignore		.gitignore
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
Readme.md		Readme.md
play.py		play.py
train.py		train.py