# Neural Networks with Keras

In [8]:
# Set the seed value for the notebook so the results are reproducible
from numpy.random import seed
seed(42)

In [9]:
# Generate some fake data with 3 features

from sklearn.datasets import make_classification

X, y = make_classification(n_features=3, n_redundant=0, n_informative=3,
                           random_state=42, n_classes=2, n_clusters_per_class=1)

y = y.reshape(-1, 1)

print(X.shape)
print(y.shape)

(100, 3)
(100, 1)


Use train_test_split to create training and testing data

In [10]:
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y, random_state=1)

## Data Preprocessing

It is really important to scale our data before using multilayer perceptron models. 

Without scaling, it is often difficult for the training cycle to converge

In [11]:
from sklearn.preprocessing import StandardScaler

X_scaler = StandardScaler().fit(X_train)

Remember to scale both the training and testing data

In [12]:
X_train_scaled = X_scaler.transform(X_train)
X_test_scaled = X_scaler.transform(X_test)

One-hot encode the labels

In [13]:
from tensorflow.keras.utils import to_categorical

In [14]:
# One-hot encoding
y_train_categorical = to_categorical(y_train)
y_test_categorical = to_categorical(y_test)
y_train_categorical

array([[0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [0., 1.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.],
       [0., 1.],
       [1., 0.

## Creating our Model

We must first decide what kind of model to apply to our data. 

For numerical data, we use a regressor model. 

For categorical data, we use a classifier model. 

In this example, we will use a classifier to build the following network:

![nnet.png](../Images/nnet.png)

## Defining our Model Architecture (the layers)

We first need to create a sequential model

In [15]:
from tensorflow.keras.models import Sequential

model = Sequential()

Next, we add our first layer. This layer requires you to specify both the number of inputs and the number of nodes that you want in the hidden layer.

In [16]:
from tensorflow.keras.layers import Dense
number_inputs = 3
number_hidden_nodes = 4
model.add(Dense(units=number_hidden_nodes,
                activation='relu', input_dim=number_inputs))

![first_layer](../Images/nnet_first_layer.png)

Our final layer is the output layer. Here, we need to specify the activation function (typically `softmax` for classification) and the number of classes (labels) that we are trying to predict (2 in this example).

In [17]:
number_classes = 2
model.add(Dense(units=number_classes, activation='softmax'))

![output_layer](../Images/nnet_output_layer.png)

## Model Summary

In [18]:
model.summary()

Model: "sequential"
_________________________________________________________________
Layer (type)                 Output Shape              Param #   
dense (Dense)                (None, 4)                 16        
_________________________________________________________________
dense_1 (Dense)              (None, 2)                 10        
Total params: 26
Trainable params: 26
Non-trainable params: 0
_________________________________________________________________


## Compile the Model

Now that we have our model architecture defined, we must compile the model using a loss function and optimizer. We can also specify additional training metrics such as accuracy.

In [19]:
# Use categorical crossentropy for categorical data and mean squared error for regression
# Hint: your output layer in this example is using software for logistic regression (categorical)
# If your output layer activation was `linear` then you may want to use `mse` for loss
model.compile(optimizer='adam',
              loss='categorical_crossentropy',
              metrics=['accuracy'])

## Training the Model
Finally, we train our model using our training data

Training consists of updating our weights using our optimizer and loss function. In this example, we choose 1000 iterations (loops) of training that are called epochs.

We also choose to shuffle our training data and increase the detail printed out during each training cycle.

In [20]:
# Fit (train) the model
model.fit(
    X_train_scaled,
    y_train_categorical,
    epochs=1000,
    shuffle=True,
    verbose=2
)

Epoch 1/1000
3/3 - 0s - loss: 0.6588 - accuracy: 0.6400
Epoch 2/1000
3/3 - 0s - loss: 0.6538 - accuracy: 0.6400
Epoch 3/1000
3/3 - 0s - loss: 0.6489 - accuracy: 0.5867
Epoch 4/1000
3/3 - 0s - loss: 0.6445 - accuracy: 0.6000
Epoch 5/1000
3/3 - 0s - loss: 0.6397 - accuracy: 0.6133
Epoch 6/1000
3/3 - 0s - loss: 0.6350 - accuracy: 0.6267
Epoch 7/1000
3/3 - 0s - loss: 0.6305 - accuracy: 0.6533
Epoch 8/1000
3/3 - 0s - loss: 0.6262 - accuracy: 0.6667
Epoch 9/1000
3/3 - 0s - loss: 0.6216 - accuracy: 0.6667
Epoch 10/1000
3/3 - 0s - loss: 0.6171 - accuracy: 0.6800
Epoch 11/1000
3/3 - 0s - loss: 0.6127 - accuracy: 0.7067
Epoch 12/1000
3/3 - 0s - loss: 0.6083 - accuracy: 0.7067
Epoch 13/1000
3/3 - 0s - loss: 0.6039 - accuracy: 0.7600
Epoch 14/1000
3/3 - 0s - loss: 0.5997 - accuracy: 0.7733
Epoch 15/1000
3/3 - 0s - loss: 0.5954 - accuracy: 0.7733
Epoch 16/1000
3/3 - 0s - loss: 0.5911 - accuracy: 0.7733
Epoch 17/1000
3/3 - 0s - loss: 0.5869 - accuracy: 0.7733
Epoch 18/1000
3/3 - 0s - loss: 0.5824 - 

Epoch 145/1000
3/3 - 0s - loss: 0.1972 - accuracy: 0.9733
Epoch 146/1000
3/3 - 0s - loss: 0.1956 - accuracy: 0.9733
Epoch 147/1000
3/3 - 0s - loss: 0.1939 - accuracy: 0.9733
Epoch 148/1000
3/3 - 0s - loss: 0.1922 - accuracy: 0.9733
Epoch 149/1000
3/3 - 0s - loss: 0.1907 - accuracy: 0.9733
Epoch 150/1000
3/3 - 0s - loss: 0.1891 - accuracy: 0.9733
Epoch 151/1000
3/3 - 0s - loss: 0.1875 - accuracy: 0.9733
Epoch 152/1000
3/3 - 0s - loss: 0.1860 - accuracy: 0.9733
Epoch 153/1000
3/3 - 0s - loss: 0.1845 - accuracy: 0.9733
Epoch 154/1000
3/3 - 0s - loss: 0.1829 - accuracy: 0.9733
Epoch 155/1000
3/3 - 0s - loss: 0.1815 - accuracy: 0.9733
Epoch 156/1000
3/3 - 0s - loss: 0.1801 - accuracy: 0.9867
Epoch 157/1000
3/3 - 0s - loss: 0.1786 - accuracy: 0.9867
Epoch 158/1000
3/3 - 0s - loss: 0.1771 - accuracy: 0.9867
Epoch 159/1000
3/3 - 0s - loss: 0.1759 - accuracy: 0.9867
Epoch 160/1000
3/3 - 0s - loss: 0.1744 - accuracy: 0.9867
Epoch 161/1000
3/3 - 0s - loss: 0.1731 - accuracy: 0.9867
Epoch 162/1000

Epoch 287/1000
3/3 - 0s - loss: 0.0888 - accuracy: 0.9867
Epoch 288/1000
3/3 - 0s - loss: 0.0883 - accuracy: 0.9867
Epoch 289/1000
3/3 - 0s - loss: 0.0880 - accuracy: 0.9867
Epoch 290/1000
3/3 - 0s - loss: 0.0877 - accuracy: 0.9867
Epoch 291/1000
3/3 - 0s - loss: 0.0874 - accuracy: 0.9867
Epoch 292/1000
3/3 - 0s - loss: 0.0871 - accuracy: 0.9867
Epoch 293/1000
3/3 - 0s - loss: 0.0867 - accuracy: 0.9867
Epoch 294/1000
3/3 - 0s - loss: 0.0864 - accuracy: 0.9867
Epoch 295/1000
3/3 - 0s - loss: 0.0861 - accuracy: 0.9733
Epoch 296/1000
3/3 - 0s - loss: 0.0860 - accuracy: 0.9733
Epoch 297/1000
3/3 - 0s - loss: 0.0856 - accuracy: 0.9733
Epoch 298/1000
3/3 - 0s - loss: 0.0853 - accuracy: 0.9733
Epoch 299/1000
3/3 - 0s - loss: 0.0850 - accuracy: 0.9733
Epoch 300/1000
3/3 - 0s - loss: 0.0846 - accuracy: 0.9733
Epoch 301/1000
3/3 - 0s - loss: 0.0844 - accuracy: 0.9733
Epoch 302/1000
3/3 - 0s - loss: 0.0841 - accuracy: 0.9733
Epoch 303/1000
3/3 - 0s - loss: 0.0838 - accuracy: 0.9733
Epoch 304/1000

Epoch 429/1000
3/3 - 0s - loss: 0.0600 - accuracy: 0.9867
Epoch 430/1000
3/3 - 0s - loss: 0.0599 - accuracy: 0.9867
Epoch 431/1000
3/3 - 0s - loss: 0.0598 - accuracy: 0.9867
Epoch 432/1000
3/3 - 0s - loss: 0.0596 - accuracy: 0.9867
Epoch 433/1000
3/3 - 0s - loss: 0.0595 - accuracy: 0.9867
Epoch 434/1000
3/3 - 0s - loss: 0.0594 - accuracy: 0.9867
Epoch 435/1000
3/3 - 0s - loss: 0.0593 - accuracy: 0.9867
Epoch 436/1000
3/3 - 0s - loss: 0.0591 - accuracy: 0.9867
Epoch 437/1000
3/3 - 0s - loss: 0.0590 - accuracy: 0.9867
Epoch 438/1000
3/3 - 0s - loss: 0.0589 - accuracy: 0.9867
Epoch 439/1000
3/3 - 0s - loss: 0.0587 - accuracy: 0.9867
Epoch 440/1000
3/3 - 0s - loss: 0.0586 - accuracy: 0.9867
Epoch 441/1000
3/3 - 0s - loss: 0.0585 - accuracy: 0.9867
Epoch 442/1000
3/3 - 0s - loss: 0.0585 - accuracy: 0.9867
Epoch 443/1000
3/3 - 0s - loss: 0.0583 - accuracy: 0.9867
Epoch 444/1000
3/3 - 0s - loss: 0.0582 - accuracy: 0.9867
Epoch 445/1000
3/3 - 0s - loss: 0.0581 - accuracy: 0.9867
Epoch 446/1000

Epoch 571/1000
3/3 - 0s - loss: 0.0468 - accuracy: 0.9733
Epoch 572/1000
3/3 - 0s - loss: 0.0467 - accuracy: 0.9733
Epoch 573/1000
3/3 - 0s - loss: 0.0467 - accuracy: 0.9733
Epoch 574/1000
3/3 - 0s - loss: 0.0467 - accuracy: 0.9733
Epoch 575/1000
3/3 - 0s - loss: 0.0465 - accuracy: 0.9733
Epoch 576/1000
3/3 - 0s - loss: 0.0464 - accuracy: 0.9733
Epoch 577/1000
3/3 - 0s - loss: 0.0464 - accuracy: 0.9733
Epoch 578/1000
3/3 - 0s - loss: 0.0463 - accuracy: 0.9733
Epoch 579/1000
3/3 - 0s - loss: 0.0462 - accuracy: 0.9733
Epoch 580/1000
3/3 - 0s - loss: 0.0461 - accuracy: 0.9733
Epoch 581/1000
3/3 - 0s - loss: 0.0461 - accuracy: 0.9733
Epoch 582/1000
3/3 - 0s - loss: 0.0460 - accuracy: 0.9733
Epoch 583/1000
3/3 - 0s - loss: 0.0460 - accuracy: 0.9733
Epoch 584/1000
3/3 - 0s - loss: 0.0459 - accuracy: 0.9733
Epoch 585/1000
3/3 - 0s - loss: 0.0459 - accuracy: 0.9733
Epoch 586/1000
3/3 - 0s - loss: 0.0458 - accuracy: 0.9733
Epoch 587/1000
3/3 - 0s - loss: 0.0458 - accuracy: 0.9733
Epoch 588/1000

Epoch 713/1000
3/3 - 0s - loss: 0.0392 - accuracy: 0.9733
Epoch 714/1000
3/3 - 0s - loss: 0.0391 - accuracy: 0.9733
Epoch 715/1000
3/3 - 0s - loss: 0.0391 - accuracy: 0.9733
Epoch 716/1000
3/3 - 0s - loss: 0.0391 - accuracy: 0.9733
Epoch 717/1000
3/3 - 0s - loss: 0.0391 - accuracy: 0.9733
Epoch 718/1000
3/3 - 0s - loss: 0.0391 - accuracy: 0.9733
Epoch 719/1000
3/3 - 0s - loss: 0.0390 - accuracy: 0.9733
Epoch 720/1000
3/3 - 0s - loss: 0.0390 - accuracy: 0.9733
Epoch 721/1000
3/3 - 0s - loss: 0.0390 - accuracy: 0.9733
Epoch 722/1000
3/3 - 0s - loss: 0.0389 - accuracy: 0.9733
Epoch 723/1000
3/3 - 0s - loss: 0.0389 - accuracy: 0.9733
Epoch 724/1000
3/3 - 0s - loss: 0.0388 - accuracy: 0.9733
Epoch 725/1000
3/3 - 0s - loss: 0.0388 - accuracy: 0.9733
Epoch 726/1000
3/3 - 0s - loss: 0.0388 - accuracy: 0.9733
Epoch 727/1000
3/3 - 0s - loss: 0.0388 - accuracy: 0.9733
Epoch 728/1000
3/3 - 0s - loss: 0.0388 - accuracy: 0.9733
Epoch 729/1000
3/3 - 0s - loss: 0.0387 - accuracy: 0.9733
Epoch 730/1000

Epoch 855/1000
3/3 - 0s - loss: 0.0344 - accuracy: 0.9733
Epoch 856/1000
3/3 - 0s - loss: 0.0344 - accuracy: 0.9733
Epoch 857/1000
3/3 - 0s - loss: 0.0344 - accuracy: 0.9733
Epoch 858/1000
3/3 - 0s - loss: 0.0343 - accuracy: 0.9733
Epoch 859/1000
3/3 - 0s - loss: 0.0343 - accuracy: 0.9733
Epoch 860/1000
3/3 - 0s - loss: 0.0343 - accuracy: 0.9733
Epoch 861/1000
3/3 - 0s - loss: 0.0342 - accuracy: 0.9733
Epoch 862/1000
3/3 - 0s - loss: 0.0342 - accuracy: 0.9733
Epoch 863/1000
3/3 - 0s - loss: 0.0342 - accuracy: 0.9733
Epoch 864/1000
3/3 - 0s - loss: 0.0342 - accuracy: 0.9733
Epoch 865/1000
3/3 - 0s - loss: 0.0341 - accuracy: 0.9733
Epoch 866/1000
3/3 - 0s - loss: 0.0341 - accuracy: 0.9733
Epoch 867/1000
3/3 - 0s - loss: 0.0341 - accuracy: 0.9733
Epoch 868/1000
3/3 - 0s - loss: 0.0341 - accuracy: 0.9733
Epoch 869/1000
3/3 - 0s - loss: 0.0340 - accuracy: 0.9733
Epoch 870/1000
3/3 - 0s - loss: 0.0340 - accuracy: 0.9733
Epoch 871/1000
3/3 - 0s - loss: 0.0340 - accuracy: 0.9733
Epoch 872/1000

Epoch 997/1000
3/3 - 0s - loss: 0.0312 - accuracy: 0.9733
Epoch 998/1000
3/3 - 0s - loss: 0.0312 - accuracy: 0.9733
Epoch 999/1000
3/3 - 0s - loss: 0.0311 - accuracy: 0.9733
Epoch 1000/1000
3/3 - 0s - loss: 0.0311 - accuracy: 0.9733


<tensorflow.python.keras.callbacks.History at 0x2d872314940>

## Quantifying the Model
We use our testing data to validate our model. This is how we determine the validity of our model (i.e. the ability to predict new and previously unseen data points)

In [21]:
# Evaluate the model using the testing data
model_loss, model_accuracy = model.evaluate(
    X_test_scaled, y_test_categorical, verbose=2)
print(f"Loss: {model_loss}, Accuracy: {model_accuracy}")

1/1 - 0s - loss: 0.5312 - accuracy: 0.8400
Loss: 0.5312415361404419, Accuracy: 0.8399999737739563


## Making Predictions with new data

We can use our trained model to make predictions using `model.predict`

In [22]:
import numpy as np
new_data = np.array([[0.2, 0.3, 0.4]])
print(f"Predicted class: {model.predict_classes(new_data)}")

Predicted class: [1]


