# Lecture 12 Machine Learning Tutoiral

## Simple MNIST CNN Model

**Modified from:** [fchollet](https://twitter.com/fchollet)<br>
**Last modified:** 2021/01/01<br>
**Description:** A simple convolutional Neural network that achieves ~99% test accuracy on MNIST.

---
#### Workflow of the tutorial

In the tutorial, we will construct a simple convolution neural network with multiple layers. And the model is being trained with `MNIST` containing binary images of handwritten digits. We aim the model can recongize the digit from the images. 


1) Obatin Data

- 28*28 Pixels of handwritten numbers 0 to 9
- Obtain from `keras.datasets.mnsit.load_data`

2) Preprocess on data
- Normalize data (Scale amplitude of the data to [0, 1] range)
- Create Suitable data shape and format (28, 28, 1)
- Convert 0,1,2,...,9 classes to binary class matrix
    - 0 : [1, 0, 0 ,0 ,0,...,0]
    - 1 : [0, 1, 0 ,0 ,0,...,0]
    - 2 : [0, 0, 1 ,0 ,0,...,0]
    - 9 : ?
 
3) Constructure CNN model
- Using ` keras.Sequential( [ Layers ] )`

4) Train the model with the data

5) Evaluate the model

## Setup

In [None]:
import numpy as np
from tensorflow import keras
from tensorflow.keras import layers

## Prepare the data

In [None]:
# Model / data parameters
num_classes = 10
input_shape = (28, 28, 1)

# the data, split between train and test sets
(x_train, y_train), (x_test, y_test) = keras.datasets.mnist.load_data()

# Scale images to the [0, 1] range
x_train = x_train.astype("float32") / 255
x_test = x_test.astype("float32") / 255
# Make sure images have shape (28, 28, 1)
x_train = np.expand_dims(x_train, -1)
x_test = np.expand_dims(x_test, -1)
print("x_train shape:", x_train.shape)
print(x_train.shape[0], "train samples")
print(x_test.shape[0], "test samples")


# convert class vectors to binary class matrices
y_train = keras.utils.to_categorical(y_train, num_classes)
y_test = keras.utils.to_categorical(y_test, num_classes)

## Build the model

In [None]:
model = keras.Sequential(
    [
        keras.Input(shape=input_shape),
        layers.Conv2D(32, kernel_size=(3, 3), activation="relu"),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Conv2D(64, kernel_size=(3, 3), activation="relu"),
        layers.MaxPooling2D(pool_size=(2, 2)),
        layers.Flatten(),
        layers.Dropout(0.5),
        layers.Dense(num_classes, activation="softmax"),
    ]
)

model.summary()

## Train the model

In [None]:
batch_size = 128
epochs = 15

model.compile(loss="categorical_crossentropy", optimizer="adam", metrics=["accuracy"])

model.fit(x_train, y_train, batch_size=batch_size, epochs=epochs, validation_split=0.1)

## Evaluate the trained model

In [None]:
score = model.evaluate(x_test, y_test, verbose=0)
print("Test loss:", score[0])
print("Test accuracy:", score[1])