# Convolutional Neural Networks with Keras

In this notebook, I will show how to use the Keras library to build convolutional neural networks. I will use the popular MNIST dataset and will compare results to using a conventional neural network.


Objectives:
>
1. How to use the Keras library to build convolutional neural networks.
2. Convolutional Neural Network with One Convolutional and Pooling Layers.
3. Convolutional Neural Network with Two Convolutional and Pooling Layers.



## Table of Contents

<div class="alert alert-block alert-info" style="margin-top: 20px">

<font size = 3>
      
1. <a href="#item41">Import Keras and Packages</a>   
2. <a href="#item42">Convolutional Neural Network with One Convolutional and Pooling Layers</a>  
3. <a href="#item43">Convolutional Neural Network with Two Convolutional and Pooling Layers</a>  

</font>
</div>


<a id='item41'></a>


## Import Keras and Packages


Let's start by importing the keras libraries and the packages that we would need to build a neural network.


In [None]:
#pip install numpy==1.21.4
#pip install pandas==1.3.4
#pip install keras==2.1.6

In [1]:
import keras
from keras.models import Sequential
from keras.layers import Dense
from keras.utils import to_categorical

When working with convolutional neural networks in particular, we will need additional packages.


In [4]:
from keras.layers import Conv2D # to add convolutional layers
from keras.layers import MaxPooling2D # to add pooling layers
from keras.layers import Flatten # to flatten data for fully connected layers
from keras.layers import MaxPooling2D # to add pooling layers

<a id='item42'></a>


## Convolutional Layer with One set of convolutional and pooling layers


In [8]:
# import data
from keras.datasets import mnist

# load data
(X_train, y_train), (X_test, y_test) = mnist.load_data()

# reshape to be [samples][pixels][width][height]
X_train = X_train.reshape(X_train.shape[0], 28, 28, 1).astype('float32')
X_test = X_test.reshape(X_test.shape[0], 28, 28, 1).astype('float32')

Next is to normalize the pixel values to be between 0 and 1


In [9]:
X_train = X_train / 255 # normalize training data
X_test = X_test / 255 # normalize test data

Next, I'll convert the target variable into binary categories


In [10]:
y_train = to_categorical(y_train)
y_test = to_categorical(y_test)

num_classes = y_test.shape[1] # number of categories

Next, I'll define a function that creates our model. Let's start with one set of convolutional and pooling layers.


In [11]:
def convolutional_model():

    # create model
    model = Sequential()
    model.add(Conv2D(16, (5, 5), strides=(1, 1), activation='relu', input_shape=(28, 28, 1)))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))

    model.add(Flatten())
    model.add(Dense(100, activation='relu'))
    model.add(Dense(num_classes, activation='softmax'))

    # compile model
    model.compile(optimizer='adam', loss='categorical_crossentropy',  metrics=['accuracy'])
    return model

Finally, let's call the function to create the model, and then let's train it and evaluate it.


In [12]:
# build the model
model = convolutional_model()

# fit the model
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

# evaluate the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: {} \n Error: {}".format(scores[1], 100-scores[1]*100))

Epoch 1/10


  super().__init__(activity_regularizer=activity_regularizer, **kwargs)


300/300 - 2s - 7ms/step - accuracy: 0.9188 - loss: 0.2938 - val_accuracy: 0.9696 - val_loss: 0.1042
Epoch 2/10
300/300 - 1s - 4ms/step - accuracy: 0.9732 - loss: 0.0920 - val_accuracy: 0.9766 - val_loss: 0.0727
Epoch 3/10
300/300 - 1s - 4ms/step - accuracy: 0.9814 - loss: 0.0617 - val_accuracy: 0.9841 - val_loss: 0.0511
Epoch 4/10
300/300 - 1s - 4ms/step - accuracy: 0.9857 - loss: 0.0473 - val_accuracy: 0.9851 - val_loss: 0.0443
Epoch 5/10
300/300 - 2s - 6ms/step - accuracy: 0.9890 - loss: 0.0374 - val_accuracy: 0.9868 - val_loss: 0.0404
Epoch 6/10
300/300 - 2s - 7ms/step - accuracy: 0.9908 - loss: 0.0308 - val_accuracy: 0.9858 - val_loss: 0.0401
Epoch 7/10
300/300 - 2s - 7ms/step - accuracy: 0.9924 - loss: 0.0258 - val_accuracy: 0.9870 - val_loss: 0.0376
Epoch 8/10
300/300 - 2s - 6ms/step - accuracy: 0.9930 - loss: 0.0229 - val_accuracy: 0.9890 - val_loss: 0.0355
Epoch 9/10
300/300 - 2s - 6ms/step - accuracy: 0.9940 - loss: 0.0196 - val_accuracy: 0.9865 - val_loss: 0.0403
Epoch 10/10


------------------------------------------


<a id='item43'></a>


## Convolutional Layer with two sets of convolutional and pooling layers


Let's redefine our convolutional model so that it has two convolutional and pooling layers instead of just one layer of each.


In [13]:
def convolutional_model():

    # create model
    model = Sequential()
    model.add(Conv2D(16, (5, 5), activation='relu', input_shape=(28, 28, 1)))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))

    model.add(Conv2D(8, (2, 2), activation='relu'))
    model.add(MaxPooling2D(pool_size=(2, 2), strides=(2, 2)))

    model.add(Flatten())
    model.add(Dense(100, activation='relu'))
    model.add(Dense(num_classes, activation='softmax'))

    # Compile model
    model.compile(optimizer='adam', loss='categorical_crossentropy',  metrics=['accuracy'])
    return model

Now, let's call the function to create our new convolutional neural network, and then let's train it and evaluate it.


In [14]:
# build the model
model = convolutional_model()

# fit the model
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

# evaluate the model
scores = model.evaluate(X_test, y_test, verbose=0)
print("Accuracy: {} \n Error: {}".format(scores[1], 100-scores[1]*100))

Epoch 1/10
300/300 - 3s - 9ms/step - accuracy: 0.8605 - loss: 0.4849 - val_accuracy: 0.9542 - val_loss: 0.1554
Epoch 2/10
300/300 - 2s - 6ms/step - accuracy: 0.9625 - loss: 0.1275 - val_accuracy: 0.9679 - val_loss: 0.1025
Epoch 3/10
300/300 - 2s - 5ms/step - accuracy: 0.9732 - loss: 0.0891 - val_accuracy: 0.9799 - val_loss: 0.0645
Epoch 4/10
300/300 - 2s - 5ms/step - accuracy: 0.9788 - loss: 0.0708 - val_accuracy: 0.9825 - val_loss: 0.0552
Epoch 5/10
300/300 - 2s - 5ms/step - accuracy: 0.9814 - loss: 0.0607 - val_accuracy: 0.9854 - val_loss: 0.0457
Epoch 6/10
300/300 - 2s - 6ms/step - accuracy: 0.9841 - loss: 0.0521 - val_accuracy: 0.9847 - val_loss: 0.0463
Epoch 7/10
300/300 - 2s - 5ms/step - accuracy: 0.9858 - loss: 0.0457 - val_accuracy: 0.9858 - val_loss: 0.0475
Epoch 8/10
300/300 - 2s - 6ms/step - accuracy: 0.9870 - loss: 0.0411 - val_accuracy: 0.9867 - val_loss: 0.0404
Epoch 9/10
300/300 - 2s - 5ms/step - accuracy: 0.9887 - loss: 0.0364 - val_accuracy: 0.9888 - val_loss: 0.0343
E

# Conclusion

In this notebook, we explored Convolutional Neural Networks (CNNs) using Keras to classify the MNIST handwritten digit dataset. We covered several key concepts and processes:

1. **Data Preparation**:
    - Loaded and reshaped the MNIST data to fit the CNN input requirements
    - Normalized pixel values to a 0-1 range
    - Converted target variables to categorical format using one-hot encoding

2. **CNN Architectures**:
    - Built a simple CNN with one convolutional and pooling layer
    - Created a more complex CNN with two sets of convolutional and pooling layers
    - Used the Sequential model API from Keras for model construction

3. **Key Components**:
    - Convolutional layers (Conv2D) to extract features from images
    - MaxPooling2D layers to reduce dimensionality
    - Flatten layer to convert 2D feature maps to 1D feature vectors
    - Dense layers for classification

4. **Training and Evaluation**:
    - Trained models for 10 epochs using categorical cross-entropy loss
    - Used Adam optimizer for gradient descent
    - Evaluated model performance using accuracy metrics

This approach has numerous business applications:
- Document processing and optical character recognition (OCR)
- Quality control in manufacturing (defect detection)
- Medical image analysis and diagnosis
- Facial recognition systems
- Self-driving vehicles and object detection
- Retail inventory management through image recognition

The techniques demonstrated here can be extended to more complex image classification problems by adjusting the network architecture, adding more layers, or using transfer learning with pre-trained models.