<a href="https://colab.research.google.com/github/Aboubacar2012/Deep-Learning-Training/blob/main/Project_CNN_Handwritten_Digit_Recognition_Larger_Convolutional_Neural_Network_for_MNIST.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

**Larger Convolutional Neural Network for MNIST**

Now that we have seen how to create a simple CNN, let’s take a look at a model capable of close
to state-of-the-art results. We import the classes and functions then load and prepare the data
the same as in the previous CNN example. This time we define a larger CNN architecture with
additional convolutional, max pooling layers and fully connected layers. The network topology
can be summarized as follows.

- Convolutional layer with 30 features maps of size 5x5

- Pooling layer taking the max over 2X2 patches

- Convolutional layer with 15 features maps of 3x3

- Pooling layer taking the max over 2x2 patches

- Dropout layer with a probability of 20%

- Flatten layer.

- Fully connected layer with 128 neurons and rectifier activation 

- Fully connected layer with 50 neurons and rectifier activation. 

- Output layer. 

In [11]:
# Larger CNN for the MNIST Dataset 
# Import classes and functions
import numpy 
from keras.datasets import mnist 
from keras.models import Sequential 
from keras.layers import Dense 
from keras.layers import Dropout 
from keras.layers import Flatten
from keras.layers.convolutional import Convolution2D
from keras.layers.convolutional import MaxPooling2D
from keras.utils import np_utils
from keras import backend as K
K.set_image_data_format('channels_first')

# fix random seed for reproducibility
seed=7
numpy.random.seed(seed)

# load data 
(X_train, y_train), (X_test, y_test)=mnist.load_data()

# reshape to be [samples][pixels][width][height]
X_train=X_train.reshape(X_train.shape[0], 1, 28, 28).astype('float32')
X_test=X_test.reshape(X_test.shape[0], 1, 28, 28).astype('float32')

#Normalize input from 0-255 to 0-1
X_train=X_train/255
X_test=X_test/255

#One hot encode outputs
y_train=np_utils.to_categorical(y_train)
y_test=np_utils.to_categorical(y_test)
num_classes=y_test.shape[1]

# define the larger model
def larger_model():
  # create model
  model = Sequential()
  model.add(Convolution2D(30, (5, 5), input_shape=(1, 28, 28), activation='relu', data_format='channels_first'))
  model.add(MaxPooling2D(pool_size=(2, 2)))
  model.add(Convolution2D(15, 3, 3,  activation='relu'))
  model.add(MaxPooling2D(pool_size=(2, 2)))
  model.add(Dropout(0.2))
  model.add(Flatten())
  model.add(Dense(128, activation='relu'))
  model.add(Dense(50, activation='relu'))
  model.add(Dense(num_classes, activation='softmax'))
  # Compile model
  model.compile(loss='categorical_crossentropy' , optimizer='adam' , metrics=['accuracy'])
  return model

# Build the model
model=larger_model()

# Fit the model 
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

# Final evaluation of the model 
scores=model.evaluate(X_test, y_test, verbose=0)
print("Large CNN Error: %.2f%%" % (100-scores[1]*100))

Epoch 1/10
300/300 - 3s - loss: 0.7032 - accuracy: 0.7746 - val_loss: 0.1948 - val_accuracy: 0.9431 - 3s/epoch - 9ms/step
Epoch 2/10
300/300 - 2s - loss: 0.2624 - accuracy: 0.9185 - val_loss: 0.1345 - val_accuracy: 0.9586 - 2s/epoch - 6ms/step
Epoch 3/10
300/300 - 2s - loss: 0.2062 - accuracy: 0.9359 - val_loss: 0.1064 - val_accuracy: 0.9680 - 2s/epoch - 6ms/step
Epoch 4/10
300/300 - 2s - loss: 0.1767 - accuracy: 0.9442 - val_loss: 0.0934 - val_accuracy: 0.9701 - 2s/epoch - 6ms/step
Epoch 5/10
300/300 - 2s - loss: 0.1538 - accuracy: 0.9517 - val_loss: 0.0794 - val_accuracy: 0.9755 - 2s/epoch - 7ms/step
Epoch 6/10
300/300 - 2s - loss: 0.1390 - accuracy: 0.9567 - val_loss: 0.0782 - val_accuracy: 0.9760 - 2s/epoch - 6ms/step
Epoch 7/10
300/300 - 2s - loss: 0.1266 - accuracy: 0.9599 - val_loss: 0.0674 - val_accuracy: 0.9782 - 2s/epoch - 6ms/step
Epoch 8/10
300/300 - 2s - loss: 0.1158 - accuracy: 0.9635 - val_loss: 0.0619 - val_accuracy: 0.9816 - 2s/epoch - 6ms/step
Epoch 9/10
300/300 - 2s 