# Artificial Neural Network
* ### Regression
* ### Classification

###### Source : https://www.youtube.com/watch?v=OTTOglLJxLU&list=PLZoTAELRMXVPGU70ZGsckrMdr0FteeRUi&index=18 

## Main focus is to get started with Keras
> #### Classification problem : 
This data set contains details of a bank's customers and the target variable is a binary variable reflecting the fact whether the customer left the bank (closed his account) or he continues to be a customer. 
Many who yet to start with Deep learning and thinking how and when to start, this could a starter for you. 
<h3 style='color:#1d057d'>Just Start it guyz.. belive me its not tough</h3>

In [10]:
import pandas as pd
import matplotlib.pyplot as plt
import numpy as np

### Data Preparation

In [11]:
## last feature is Exited that goes into y and first two columns are not required
data = pd.read_csv('Churn_Modelling.csv')
X = data.iloc[:,3:13]
y = data.iloc[:,-1]

### Category features need to be onehot encoding so we use get dummies
* *Note*: drop_first=True -> keeps k-1 categories instead of k 

In [12]:
dummies = pd.get_dummies(X[['Geography', 'Gender']],drop_first=True)
X = pd.concat([X,dummies],axis=1)
X.drop(['Geography', 'Gender'],axis=1,inplace=True)

In [13]:
## train test split
from sklearn.model_selection import train_test_split
X_train,X_test,y_train,y_test = train_test_split(X,y,train_size=0.8,random_state=0)

### Feature scaling
> ##### Reasons to do feature scaling 
>* As we have to train data and neuron output is = W*X + B 
>* W -> Weight, X-> x data , B-> Bias
>* As X val is higher the computation time is high and leading to lots of time in converging to global minima


In [14]:
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

In [15]:
import keras
from keras.models import Sequential
from keras.layers import Dense, LeakyReLU,ReLU,ELU
from keras.layers import Dropout
import random

Using TensorFlow backend.


In [20]:
def annClassifier(no_hidden_layers,neuron_list,input_dim,activation='relu',kernel_initializer='he_normal',optimizer='adam',loss='binary_crossentropy',metrics=['accuracy']):
    if no_hidden_layers != len(neuron_list):
        raise ValueError('[no_hidden_layers] and length of [neuron_list] do not match.')
    classifier = Sequential()
    for layers_indx in range(no_hidden_layers):
        if layers_indx ==0:
            classifier.add(Dense(units=neuron_list[layers_indx],kernel_initializer=kernel_initializer,activation=activation,input_dim=input_dim))
        else:
            classifier.add(Dense(units=neuron_list[layers_indx],kernel_initializer=kernel_initializer,activation=activation))
    classifier.compile(optimizer=optimizer, loss=loss,metrics=metrics)
    return classifier

#### Dont panic the above is simple function, with respect to binary classification problem.
##### Lemme break down the above code
1. Parameters 
> * *no_hidden_layers*: You can specify the number of hidden layers you want. Integer value, example:  4  
> * neuron_list: Number of neurons you want in each layer in sequence. eaxmple: [4,5,10] - Means 4,5,10 neurons in Hidden layer 1,2,3
> * input_dim: Number of features in you X(independent features), example : in our data its 11
> * activation='relu': Activation function
> * kernel_initializer='he_normal'
> * optimizer='adam'
> * loss='binary_crossentropy'
> * metrics=['accuracy'])

2. Sequential() - For most of the models you require Sequential on which you further add your layers
3. Dense()
> ##### Notes:   
> * At this point or any point in time to know the summary of the classifier/optimizer
> * classifier.summary()
> * optimizer now adam is best
> * if output is binary loss -> binary_crossentropy, multiple class, loss -> categoricalcrossentropy 


###### Note: Everything is put into a python function else we can also define custom layers

#### Below fucntion is also as same the above function except for Dropouts, we have added Dropout for each layer.


In [18]:
def annClassifier(no_hidden_layers,neuron_list,input_dim,activation='relu',kernel_initializer='he_normal',optimizer='adam',loss='binary_crossentropy',metrics=['accuracy']):
    dropOuts = [0.2,0.4,0.3,0.6]   
    if no_hidden_layers != len(neuron_list):
        raise ValueError('[no_hidden_layers] and length of [neuron_list] do not match.')
    classifier = Sequential()
    for layers_indx in range(no_hidden_layers):
        if layers_indx ==0:
            classifier.add(Dense(units=neuron_list[layers_indx],kernel_initializer=kernel_initializer,activation=activation,input_dim=input_dim))
            classifier.add(Dropout(random.choice(dropOuts)))
        else:
            classifier.add(Dense(units=neuron_list[layers_indx],kernel_initializer=kernel_initializer,activation=activation))
            classifier.add(Dropout(random.choice(dropOuts)))
    classifier.compile(optimizer=optimizer, loss=loss,metrics=metrics)
    return classifier

##### Here we go... Try it out with different combination of neurons and hidden layers. Soon we will look into hyperparameter optimization

In [21]:
annoptimizer = annClassifier(no_hidden_layers=4,neuron_list=[10,20,15,1],input_dim=X.shape[-1])
annclassifier_history = annoptimizer.fit(X_train,y_train,validation_split=0.33, batch_size=10,epochs=100)

Train on 5359 samples, validate on 2641 samples
Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100


Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


In [22]:
## prediction
y_pred = annoptimizer.predict(X_test)
y_pred = (y_pred>0.5)
from sklearn.metrics import confusion_matrix,accuracy_score
## confusion matrix
cm = confusion_matrix(y_test,y_pred)
## accuracy score
acc_score = accuracy_score(y_test,y_pred) ## 0.8475
print(f"Confusion matrix:\n{cm}\n\nAccurcay: {acc_score}")


Confusion matrix:
[[1537   58]
 [ 236  169]]

Accurcay: 0.853
