## Dataset
- You will be using a modified version of the FairFace dataset (https://github.com/joojs/fairface). This is a set of 86,744 training face images and 10,954 validation face images. 
- In order to decrease the training time I converted all images to gray scale and resized them to 32 × 32. Each face has 3 different attributes which can be used for a classification task: race, gender, and age. All files can be found in the zip file on Canvas. The train folder contains the training images and the fairface label train.csv file contains all the label. There is a similar folder and file for the validation set.
- As the three different attributes have a different number of possible values, your final layers for each classifier will vary. For each of the networks below please attempt to classify 2 of the attributes (you can choose which).

## Imports

In [10]:
#Tensor imports
import tensorflow as tf
from tensorflow import optimizers
from tensorflow.keras.models import Sequential
from tensorflow.keras import layers

#Pillow Imports
from PIL import Image

## Task 1: Fully Connected Neural Network
1. Build a feed forward neural network with the following specifications (Test on two different tasks):
    - Hidden layer 1: 1024 neurons with hyperbolic tangent activation function in each neuron.
    - Hidden layer 2: 512 neurons, with sigmoid activation function in each of the neuron.
    - 100 neurons, with rectified linear activation function in each of the neuron.
    - Output layer: n (depending on the task) neurons representing the n classes, using the softmax activation function.
2. Using Min-Max scaling to scale the training dataset and using the same Min and Max values from the training set scale the test dataset (X−Xmin/Xmax−Xmin).
3. Using mini-batch gradient descent to optimize the loss function: “categorical cross-entropy” on the training dataset. Please record the loss value for each of the epochs and create an epoch-loss plot and an accuracy-loss plot for both the training and validation set.
4. Report the following:
    - Final classification accuracy.
    - The n-class confusion matrix.

In [18]:
img = Image.open('project3_COSC525/train/1.jpg')
print(list(img.getdata()))

inputShape = (32*32,)
outputSize = 10

model = Sequential()
model.add(layers.Dense(1024, input_shape=inputShape, activation='tanh'))
model.add(layers.Dense(512, activation='sigmoid'))
model.add(layers.Dense(100, activation='relu'))
model.add(layers.Dense(outputSize, activation='softmax'))


[1, 12, 16, 11, 10, 18, 25, 25, 28, 32, 33, 31, 36, 43, 44, 39, 43, 40, 36, 33, 32, 33, 33, 33, 29, 30, 18, 9, 18, 22, 19, 21, 6, 13, 14, 13, 22, 39, 50, 50, 47, 49, 47, 42, 43, 47, 46, 40, 41, 41, 41, 42, 42, 41, 38, 37, 35, 34, 19, 9, 17, 22, 18, 20, 5, 10, 13, 19, 36, 58, 67, 65, 48, 49, 47, 46, 48, 53, 54, 51, 47, 46, 44, 42, 39, 36, 33, 32, 44, 40, 21, 8, 16, 21, 17, 18, 0, 7, 15, 22, 39, 58, 64, 59, 45, 43, 42, 42, 46, 51, 54, 54, 52, 49, 44, 38, 34, 33, 35, 37, 49, 44, 23, 8, 14, 19, 16, 17, 1, 10, 16, 19, 30, 49, 59, 59, 58, 52, 47, 45, 44, 45, 46, 47, 48, 49, 47, 44, 41, 40, 42, 44, 47, 44, 24, 9, 13, 16, 14, 17, 10, 15, 14, 12, 22, 44, 61, 66, 60, 52, 47, 47, 48, 48, 51, 55, 50, 53, 54, 50, 41, 33, 28, 27, 37, 39, 25, 11, 12, 13, 12, 17, 10, 12, 9, 8, 21, 43, 55, 56, 39, 32, 30, 36, 43, 49, 57, 65, 57, 58, 54, 44, 31, 22, 18, 18, 25, 33, 26, 13, 11, 10, 10, 18, 3, 4, 5, 9, 26, 43, 44, 34, 21, 14, 14, 23, 32, 40, 50, 60, 61, 57, 47, 35, 25, 24, 32, 39, 16, 29, 26, 15, 11, 8, 9

## Task 2: Small Convolutional Neural Network
- Build a convolutional neural network with the following specifications (Test on two different tasks):
    - Convolution layer having 40 feature detectors, with kernel size 5 x 5, and ReLU as the activation function, with stride 1 and no-padding.
    - A max-pooling layer with pool size 2x2.
    - Fully connected layer with 100 neurons, and ReLU as the activation function.
    - Output layer: n (depending on the task) neurons representing the n classes, using the softmax activation function. function for each of the 10 neurons.
2. Using Min-Max scaling to scale the training dataset and using the same Min and Max values from the training set scale the test dataset ( X−Xmin/Xmax−Xmin ).
3. Using mini-batch gradient descent to optimize the loss function: “categorical cross-entropy” on the training dataset. Please record the loss value for each of the epochs and create an epoch-loss plot and an accuracy-loss plot for both the training and validation set.
4. Report the following:
    - Final classification accuracy.
    - The n-class confusion matrix.

## Task 3: Your own Convolutional Neural Network
1. Build another convolutional neural network, where you choose all the parameters to see if you can get a higher accuracy.
2. Using Min-Max scaling to scale the training dataset and using the same Min and Max values from the training set scale the test dataset ( X−Xmin/Xmax−Xmin ).
3. Using mini-batch gradient descent to optimize the loss function: “categorical cross-entropy” on the training dataset. Please record the loss value for each of the epochs and create an epoch-loss plot and an accuracy-loss plot for both the training and validation set.
4. Report the following:
    - Final classification accuracy.
    - The n-class confusion matrix

## Task 4: Your own Convolutional Neural Network on both Tasks Simultaneously
1. Build another convolutional neural network, where you try and classify both tasks with a single network. After your flatten layer have two more fully connected layers for each “branch”. Note that in order to do so you will not be able to use the Sequential model.
2. Using Min-Max scaling to scale the training dataset and using the same Min and Max values from the training set scale the test dataset ( X−Xmin/Xmax−Xmin ).
3. Using mini-batch gradient descent to optimize the loss function: “categorical cross-entropy” on the training dataset. Please record the loss value for each of the epochs and create an epoch-loss plot and an accuracy-loss plot for both the training and validation set.
4. Report the following:
    - Final classification accuracy.
    - The n-class confusion matrix

## Task 5: Variational Auto Encoder (COSC 525 only)
1. Build a variational autoencoder with the following specifications (in this one you have a little more flexibility):
    - Should have at least two convolution layers in the encoder and 2 deconvolution layers in the decoder.
    - Latent dimension should be at least 5.
    - Loss should be either MSE or binary cross entropy.
2. Using Min-Max scaling to scale the training dataset and using the same Min and Max values from the training set scale the test dataset ( X−Xmin/Xmax−Xmin ).
3. Using mini-batch gradient descent to optimize the loss function on the training dataset. Please record the loss value for each of the epochs and create an epoch-loss plot and an accuracy-loss plot for both the training and validation set.
4. Qualitatively evaluate your model by generating a set of faces by randomly choosing 10 latent vectors and presenting the resulting images