**First Network**
---

In [1]:
from tensorflow.keras.datasets import mnist
from tensorflow.keras import models
from tensorflow.keras import layers
from tensorflow.keras.utils import to_categorical

In [2]:
# Seperate data into train and test
(train_images, train_labels), (test_images, test_labels) = mnist.load_data()

In [3]:
# check a sample image
print(train_images.shape)
print(train_labels.shape)

(60000, 28, 28)
(60000,)


`In our train sample, we can see that we have 60,000 images with the size 28x28 pixels. We have 60,000 labels labeling the images from 0 to 9.`

In [4]:
# We are now going to build our first network which will predict what number is in the picture:
network = models.Sequential()
network.add(layers.Dense(784, activation='relu', input_shape=(28 * 28,)))
network.add(layers.Dense(784, activation='relu', input_shape=(28 * 28,)))
network.add(layers.Dense(10, activation='softmax'))
network.compile(optimizer='adam',
                loss='categorical_crossentropy',
                metrics=['accuracy'])

Metal device set to: Apple M1 Pro

systemMemory: 16.00 GB
maxCacheSize: 5.33 GB



Before we can feed our data into our newly created model, we will need to reshape our input into a format\
that the model can read. The original shape of our input was [60000, 28, 28] \
which essentially represents 60,000 images with the pixel height and width of 28x28. \
We will reshape it, so that we have all pixels for each image in one row of a 2D array. \
We can think about this as a dataset with 60,000 rows and 28*28 columns.

In [9]:
train_images = train_images.reshape((60000, 28 * 28))
train_images = train_images.astype('float32') / 255
test_images = test_images.reshape((10000, 28 * 28))
test_images = test_images.astype('float32') / 255

`We also have to make sure our network thinks it is a categorical problem because numbers from 0 to 9 \
can be interpreted as regression as well. So we will encode our target as categories:`

In [6]:
train_labels = to_categorical(train_labels)
test_labels = to_categorical(test_labels)

In [7]:
#we are now ready to train our NN! To do this, we will call the fit function and pass in the required parameters:
network.fit(train_images, train_labels, epochs=10, batch_size=128)

Epoch 1/10


2023-05-28 14:47:57.526185: W tensorflow/tsl/platform/profile_utils/cpu_utils.cc:128] Failed to get CPU frequency: 0 Hz


Epoch 2/10
Epoch 3/10
Epoch 4/10
Epoch 5/10
Epoch 6/10
Epoch 7/10
Epoch 8/10
Epoch 9/10
Epoch 10/10


<keras.callbacks.History at 0x29c717250>

In [8]:
test_loss, test_acc = network.evaluate(test_images, test_labels)
print('test_acc:', test_acc, 'test_loss', test_loss)

test_acc: 0.9824000597000122 test_loss 0.07870979607105255


**We have just taken the first step on our deep learning journey. <br>
We have seen that creating a network and using it as a black box is not all that complex. <br>
However, in order to maximize the added value of using deep learning networks, it's fundamental to also understand what is going on during the different steps.**