<a href="https://colab.research.google.com/github/hsr99/neural_network/blob/main/MNIST/mnist_mlp_baseline.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# **Baseline MLP for MNIST dataset**

**Importing Packages**

In [1]:
import numpy
from keras.datasets import mnist
from keras.models import Sequential
from keras.layers import Dense
# from keras.layers import Dropout
from keras.utils import np_utils

In [2]:
seed = 7
numpy.random.seed(seed)

**load data**

In [11]:
(X_train, y_train), (X_test, y_test) = mnist.load_data()

**flatten 28x28 images to a 784 vector for each image **

784 will be the total pixels, reshaping because neural network is a single long line and data shld also be a single long line and not a vector of n dimension


In [6]:
X_train.shape

(60000, 784)

In [9]:
y_test.shape

(10000,)

In [12]:
num_pixels = X_train.shape[1] * X_train.shape[2]
print (X_train.shape)
X_train = X_train.reshape(X_train.shape[0], num_pixels).astype('float32')
X_test = X_test.reshape(X_test.shape[0], num_pixels).astype('float32')
print(X_train.shape)

(60000, 28, 28)
(60000, 784)


**normalize inputs from 0-255 to 0-1**

In [None]:
X_train = X_train / 255 #when normalizing data, model proves to be better
X_test = X_test / 255

**One Hot Encoding the outputs**

In [13]:
y_train = np_utils.to_categorical(y_train)
y_test = np_utils.to_categorical(y_test)
num_classes = y_test.shape[1]

**define baseline model**

In [14]:
def baseline_model():
	# create model
	model = Sequential()
	model.add(Dense(num_pixels, input_dim=num_pixels, activation='relu')) #784 nuerons
	model.add(Dense(num_classes,  activation='softmax')) #10 nuerons
	# Compile model
	model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])
	return model

**Build and Fit the model**

In [15]:
model = baseline_model()
model.fit(X_train, y_train, validation_data=(X_test, y_test), epochs=10, batch_size=200, verbose=2)

Epoch 1/10
300/300 - 6s - loss: 5.2249 - accuracy: 0.9092 - val_loss: 1.2321 - val_accuracy: 0.9452 - 6s/epoch - 22ms/step
Epoch 2/10
300/300 - 6s - loss: 0.7021 - accuracy: 0.9593 - val_loss: 0.6777 - val_accuracy: 0.9558 - 6s/epoch - 21ms/step
Epoch 3/10
300/300 - 5s - loss: 0.3241 - accuracy: 0.9722 - val_loss: 0.5401 - val_accuracy: 0.9633 - 5s/epoch - 18ms/step
Epoch 4/10
300/300 - 6s - loss: 0.2098 - accuracy: 0.9791 - val_loss: 0.5212 - val_accuracy: 0.9664 - 6s/epoch - 21ms/step
Epoch 5/10
300/300 - 5s - loss: 0.1845 - accuracy: 0.9814 - val_loss: 0.4660 - val_accuracy: 0.9674 - 5s/epoch - 18ms/step
Epoch 6/10
300/300 - 6s - loss: 0.1519 - accuracy: 0.9833 - val_loss: 0.4687 - val_accuracy: 0.9701 - 6s/epoch - 21ms/step
Epoch 7/10
300/300 - 5s - loss: 0.1379 - accuracy: 0.9850 - val_loss: 0.5862 - val_accuracy: 0.9660 - 5s/epoch - 18ms/step
Epoch 8/10
300/300 - 6s - loss: 0.1503 - accuracy: 0.9836 - val_loss: 0.5304 - val_accuracy: 0.9664 - 6s/epoch - 21ms/step
Epoch 9/10
300/3

<keras.callbacks.History at 0x7f0d38a16890>

**Final evaluation of the model**

In [16]:
scores = model.evaluate(X_test, y_test, verbose=0)
print("Baseline Error: %.2f%%" % (100-scores[1]*100))

Baseline Error: 3.13%
