<h1>Table of Contents<span class="tocSkip"></span></h1>
<div class="toc" style="margin-top: 1em;"><ul class="toc-item"><li><span><a href="#Loading-the-MNIST-dataset" data-toc-modified-id="Loading-the-MNIST-dataset-1"><span class="toc-item-num">1&nbsp;&nbsp;</span>Loading the MNIST dataset</a></span></li><li><span><a href="#Introduction-to-convnets" data-toc-modified-id="Introduction-to-convnets-2"><span class="toc-item-num">2&nbsp;&nbsp;</span>Introduction to convnets</a></span><ul class="toc-item"><li><span><a href="#Instantiating-a-small-convnet" data-toc-modified-id="Instantiating-a-small-convnet-2.1"><span class="toc-item-num">2.1&nbsp;&nbsp;</span>Instantiating a small convnet</a></span></li><li><span><a href="#Adding-a-classifier-on-top-of-the-convnet" data-toc-modified-id="Adding-a-classifier-on-top-of-the-convnet-2.2"><span class="toc-item-num">2.2&nbsp;&nbsp;</span>Adding a classifier on top of the convnet</a></span></li></ul></li></ul></div>

# Loading the MNIST dataset

In [1]:
from keras.datasets import mnist

Using TensorFlow backend.


In [2]:
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

In [8]:
train_images = train_images.reshape((60000, 28, 28, 1))
train_images = train_images.astype('float32') / 255
test_images = test_images.reshape((10000, 28, 28, 1))
test_images = test_images.astype('float32') / 255

In [9]:
from keras.utils import to_categorical

In [10]:
train_labels = to_categorical(train_labels)
test_labels = to_categorical(test_labels)

# Introduction to convnets

## Instantiating a small convnet

In [3]:
from keras import layers
from keras import models

In [4]:
model = models.Sequential()
model.add(layers.Conv2D(32, (3, 3), activation='relu', input_shape=(28, 28, 1)))
model.add(layers.MaxPooling2D(2, 2))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))
model.add(layers.MaxPooling2D(2, 2))
model.add(layers.Conv2D(64, (3, 3), activation='relu'))

In [5]:
model.summary()

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
conv2d_1 (Conv2D)            (None, 26, 26, 32)        320       
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 13, 13, 32)        0         
_________________________________________________________________
conv2d_2 (Conv2D)            (None, 11, 11, 64)        18496     
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 5, 5, 64)          0         
_________________________________________________________________
conv2d_3 (Conv2D)            (None, 3, 3, 64)          36928     
Total params: 55,744
Trainable params: 55,744
Non-trainable params: 0
_________________________________________________________________


## Adding a classifier on top of the convnet

The next step is to feed the last output tensor of shape ```(3, 3, 64)``` int a densely connected classifier network.

In [6]:
model.add(layers.Flatten())
model.add(layers.Dense(64, activation='relu'))
model.add(layers.Dense(10, activation='softmax'))

In [7]:
model.summary()

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
conv2d_1 (Conv2D)            (None, 26, 26, 32)        320       
_________________________________________________________________
max_pooling2d_1 (MaxPooling2 (None, 13, 13, 32)        0         
_________________________________________________________________
conv2d_2 (Conv2D)            (None, 11, 11, 64)        18496     
_________________________________________________________________
max_pooling2d_2 (MaxPooling2 (None, 5, 5, 64)          0         
_________________________________________________________________
conv2d_3 (Conv2D)            (None, 3, 3, 64)          36928     
_________________________________________________________________
flatten_1 (Flatten)          (None, 576)               0         
_________________________________________________________________
dense_1 (Dense)              (None, 64)                36928     
__________

In [11]:
model.compile(optimizer='rmsprop',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

In [12]:
%%time
model.fit(train_images, train_labels, epochs=5, batch_size=64)

Epoch 1/5
Epoch 2/5
Epoch 3/5
Epoch 4/5
Epoch 5/5
CPU times: user 1min 29s, sys: 13.5 s, total: 1min 42s
Wall time: 1min 58s


<keras.callbacks.History at 0x7f375b1e17f0>

In [13]:
test_loss, test_acc = model.evaluate(test_images, test_labels)



In [14]:
test_acc

0.99060000000000004

In [15]:
%load_ext version_information
%version_information keras, numpy, tensorflow

Software,Version
Python,3.6.3 64bit [GCC 4.8.2 20140120 (Red Hat 4.8.2-15)]
IPython,6.2.1
OS,Linux 4.4.0 53 generic x86_64 with debian stretch sid
keras,2.0.9
numpy,1.12.1
tensorflow,1.3.0
Fri Dec 22 13:02:56 2017 CST,Fri Dec 22 13:02:56 2017 CST
