# **Import Keras and Packages**

Let's start by importing the keras libraries and the packages that we would need to build a neural network.


In [1]:
import keras
from keras.models import Sequential
from keras.layers import Dense
from keras.utils import to_categorical

When working with convolutional neural networks in particular, we will need additional packages.

In [2]:
from keras.layers.convolutional import Conv2D # to add convolutional layers
from keras.layers.convolutional import MaxPooling2D # to add pooling layers
from keras.layers import Flatten # to flatten data for fully connected layers

# **Convolutional Layer with One set of convolutional and pooling layers**

In [3]:
# import data
from keras.datasets import mnist

# load data
(X_train, y_train), (X_test, y_test) = mnist.load_data()

# reshape to be [samples][pixels][width][height]
X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('float32')
X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('float32')

Let's normalize the pixel values to be between 0 and 1

In [4]:
X_train = X_train / 255 # normalize training data
X_test = X_test / 255 # normalize test data

Next, let's convert the target variable into binary categories

In [5]:
y_train = to_categorical(y_train)
y_test = to_categorical(y_test)

num_classes = y_test.shape[1] # number of categories

Next, let's define a function that creates our model. Let's start with one set of convolutional and pooling layers.

In [6]:
def convolutional_model():
    
    # create model
    model = Sequential()
    model.add(Conv2D(16, (5, 5), strides=(1, 1), activation='relu', input_shape=(28, 28, 1)))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))
    
    model.add(Flatten())
    model.add(Dense(100, activation='relu'))
    model.add(Dense(num_classes, activation='softmax'))
    
    # compile model
    model.compile(optimizer='adam', loss='categorical_crossentropy',  metrics=['accuracy'])
    return model

Finally, let's call the function to create the model, and then let's train it and evaluate it.

In [7]:
# build the model
model = convolutional_model()

# fit the model
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

# evaluate the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: {} \n Error: {}".format(scores[1], 100-scores[1]*100))

Epoch 1/10
300/300 - 19s - loss: 0.2933 - accuracy: 0.9202 - val_loss: 0.0888 - val_accuracy: 0.9745 - 19s/epoch - 63ms/step
Epoch 2/10
300/300 - 14s - loss: 0.0766 - accuracy: 0.9776 - val_loss: 0.0655 - val_accuracy: 0.9802 - 14s/epoch - 47ms/step
Epoch 3/10
300/300 - 14s - loss: 0.0534 - accuracy: 0.9840 - val_loss: 0.0468 - val_accuracy: 0.9846 - 14s/epoch - 47ms/step
Epoch 4/10
300/300 - 14s - loss: 0.0431 - accuracy: 0.9868 - val_loss: 0.0429 - val_accuracy: 0.9860 - 14s/epoch - 47ms/step
Epoch 5/10
300/300 - 14s - loss: 0.0333 - accuracy: 0.9897 - val_loss: 0.0394 - val_accuracy: 0.9868 - 14s/epoch - 47ms/step
Epoch 6/10
300/300 - 14s - loss: 0.0273 - accuracy: 0.9920 - val_loss: 0.0384 - val_accuracy: 0.9867 - 14s/epoch - 47ms/step
Epoch 7/10
300/300 - 14s - loss: 0.0245 - accuracy: 0.9921 - val_loss: 0.0395 - val_accuracy: 0.9880 - 14s/epoch - 47ms/step
Epoch 8/10
300/300 - 15s - loss: 0.0201 - accuracy: 0.9937 - val_loss: 0.0354 - val_accuracy: 0.9884 - 15s/epoch - 49ms/step


# **Convolutional Layer with two sets of convolutional and pooling layers**

Let's redefine our convolutional model so that it has two convolutional and pooling layers instead of just one layer of each.


In [8]:
def convolutional_model():
    
    # create model
    model = Sequential()
    model.add(Conv2D(16, (5, 5), activation='relu', input_shape=(28, 28, 1)))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))
    
    model.add(Conv2D(8, (2, 2), activation='relu'))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))
    
    model.add(Flatten())
    model.add(Dense(100, activation='relu'))
    model.add(Dense(num_classes, activation='softmax'))
    
    # Compile model
    model.compile(optimizer='adam', loss='categorical_crossentropy',  metrics=['accuracy'])
    return model

Now, let's call the function to create our new convolutional neural network, and then let's train it and evaluate it.

In [9]:
# build the model
model = convolutional_model()

# fit the model
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

# evaluate the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: {} \n Error: {}".format(scores[1], 100-scores[1]*100))

Epoch 1/10
300/300 - 18s - loss: 0.4592 - accuracy: 0.8665 - val_loss: 0.1529 - val_accuracy: 0.9561 - 18s/epoch - 59ms/step
Epoch 2/10
300/300 - 16s - loss: 0.1328 - accuracy: 0.9603 - val_loss: 0.0938 - val_accuracy: 0.9715 - 16s/epoch - 52ms/step
Epoch 3/10
300/300 - 16s - loss: 0.0949 - accuracy: 0.9716 - val_loss: 0.0750 - val_accuracy: 0.9757 - 16s/epoch - 52ms/step
Epoch 4/10
300/300 - 16s - loss: 0.0728 - accuracy: 0.9786 - val_loss: 0.0639 - val_accuracy: 0.9797 - 16s/epoch - 53ms/step
Epoch 5/10
300/300 - 16s - loss: 0.0625 - accuracy: 0.9818 - val_loss: 0.0542 - val_accuracy: 0.9825 - 16s/epoch - 53ms/step
Epoch 6/10
300/300 - 16s - loss: 0.0539 - accuracy: 0.9838 - val_loss: 0.0491 - val_accuracy: 0.9848 - 16s/epoch - 53ms/step
Epoch 7/10
300/300 - 17s - loss: 0.0489 - accuracy: 0.9854 - val_loss: 0.0396 - val_accuracy: 0.9867 - 17s/epoch - 56ms/step
Epoch 8/10
300/300 - 16s - loss: 0.0441 - accuracy: 0.9869 - val_loss: 0.0401 - val_accuracy: 0.9875 - 16s/epoch - 53ms/step
