dqn_cnn_mnist_gym
An OpenAI training gym environment for MNIST handwritten digit classification with a Deep-Q Agent and CNN Policy. Digits 0-9 are displayed on a upsampled 128x128px canvas. A correct discrete value action for a matching observation receives a reward.
This model achieves a 98% accuracy on the MNIST dataset.