# CNN on MNIST Dataset

In this notebook, we build a **Convolutional Neural Network (CNN)** to classify handwritten digits from the **MNIST dataset**.

### MNIST
- 70,000 grayscale images (28x28 pixels)
- 10 classes: digits 0–9

In [1]:
import tensorflow as tf
from tensorflow.keras import Sequential
from tensorflow.keras.layers import Conv2D, MaxPooling2D, Flatten, Dense, Dropout
from tensorflow.keras.datasets import mnist
from tensorflow.keras.utils import to_categorical

# Load MNIST dataset
(x_train, y_train), (x_test, y_test) = mnist.load_data()

# Reshape to (28,28,1) since CNN expects channels
x_train = x_train.reshape((-1, 28, 28, 1))
x_test = x_test.reshape((-1, 28, 28, 1))

# Normalize pixel values
x_train, x_test = x_train / 255.0, x_test / 255.0

# One-hot encode labels
y_train = to_categorical(y_train, 10)
y_test = to_categorical(y_test, 10)

print("Training data shape:", x_train.shape)
print("Test data shape:", x_test.shape)

In [2]:
# Build CNN model
model = Sequential([
    Conv2D(32, (3,3), activation='relu', input_shape=(28,28,1)),
    MaxPooling2D((2,2)),
    Conv2D(64, (3,3), activation='relu'),
    MaxPooling2D((2,2)),
    Flatten(),
    Dense(128, activation='relu'),
    Dropout(0.5),
    Dense(10, activation='softmax')
])

model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

In [3]:
model.summary()

Model: "sequential"
_________________________________________________________________
 Layer (type)                Output Shape              Param #   
 conv2d (Conv2D)             (None, 26, 26, 32)        320       
 max_pooling2d (MaxPooling2D (None, 13, 13, 32)        0         
 )                                                               
 conv2d_1 (Conv2D)           (None, 11, 11, 64)        18496     
 max_pooling2d_1 (MaxPooling (None, 5, 5, 64)          0         
 2D)                                                             
 flatten (Flatten)           (None, 1600)              0         
 dense (Dense)               (None, 128)               204928    
 dropout (Dropout)           (None, 128)               0         
 dense_1 (Dense)             (None, 10)                1290      
Total params: 225,034
Trainable params: 225,034
Non-trainable params: 0
_________________________________________________________________


In [4]:
# Train the model
history = model.fit(x_train, y_train, epochs=5, batch_size=64,
                    validation_data=(x_test, y_test), verbose=1)

Training CNN on MNIST...


In [5]:
# Evaluate model
test_loss, test_acc = model.evaluate(x_test, y_test, verbose=0)
print("Test accuracy:", round(test_acc, 2))

Test accuracy: 0.99


✅ **Summary:**
- CNN learns features of handwritten digits from MNIST.
- Achieves ~99% accuracy with this simple model.
- Can be improved with **data augmentation** or **deeper networks**.