# Neural Networks

Neural networks is a wide category of ML tools that through various modifications have been applied to many tasks.

### Background

Neural networks are inspired by the way that neurons work. A pathway is defined between inputs and outputs that include weights, functions (convolutions, recursions, and activations) applied in the layers of "neurons" between. 

### NN Architecture
Let's take an overview of the different architectures used in NNs
![alt](http://www.asimovinstitute.org/wp-content/uploads/2016/09/neuralnetworks.png)
[AsimovInsitute Network Zoo](http://www.asimovinstitute.org/neural-network-zoo/)

### Other Resources
https://github.com/leriomaggio/deep-learning-keras-tensorflow - A much better and more in depth tutorial than we have time for in an hour
https://hackernoon.com/deep-learning-cheat-sheet-25421411e460 - A nice little cheat sheet for NN concepts

### Papers to Read
[Deep Learning Papers Roadmap](https://github.com/songrotek/Deep-Learning-Papers-Reading-Roadmap) 

[NN Paper Geneology](https://coggle.it/diagram/Wf5mYoJbsgABUF9P)
![alt](https://github.com/hunkim/deep_architecture_genealogy/raw/master/Neural_Net_Arch_Genealogy.png)



In [36]:
# 1. Import libraries and modules
import numpy as np
np.random.seed(123)  # for reproducibility
 
from keras.models import Sequential
from keras.layers import Dense, Dropout, Activation, Flatten
from keras.layers import Convolution2D, MaxPooling2D
from keras.utils import np_utils
from keras.datasets import mnist
 
# 2. Load pre-shuffled MNIST data into train and test sets
(X_train, y_train), (X_test, y_test) = mnist.load_data()
X_train.shape

(60000, 28, 28)

In [37]:
# 3. Preprocess input data
X_train = X_train.reshape(X_train.shape[0], 28, 28, 1)
X_test = X_test.reshape(X_test.shape[0], 28, 28, 1)
X_train = X_train.astype('float32')
X_test = X_test.astype('float32')
X_train /= 255
X_test /= 255

In [38]:
# 4. Preprocess class labels
Y_train = np_utils.to_categorical(y_train, 10)
Y_test = np_utils.to_categorical(y_test, 10)

X_train.shape

(60000, 28, 28, 1)

In [42]:
# 7. Define model architecture
model = Sequential()

model.add(Convolution2D(32, (3, 3), activation='relu', input_shape=(28,28,1)))
model.add(Convolution2D(32, (3, 3), activation='relu'))
model.add(MaxPooling2D(pool_size=(2,2)))
model.add(Dropout(0.25))
 
model.add(Flatten())
model.add(Dense(128, activation='relu'))
model.add(Dropout(0.5))
model.add(Dense(10, activation='softmax'))
 

In [43]:
# 6. Compile model
model.compile(loss='categorical_crossentropy',
              optimizer='adam',
              metrics=['accuracy'])
 

In [44]:
# 7. Fit model on training data
model.fit(X_train, Y_train, 
          batch_size=32, nb_epoch=10, verbose=1)
 



Epoch 1/10
Epoch 2/10
Epoch 3/10
Epoch 4/10

KeyboardInterrupt: 

In [47]:
# 8. Evaluate model on test data
score = model.evaluate(X_test, Y_test, verbose=0)
print(score)

[0.034415331403303114, 0.98919999999999997]
