# Preface
If you have some free time, there is an awesome course from the Tensorflow team at Google and Udacity for FREE on everything and more that we are going to be talking about. The AI team managers are still learning these topics just as you are and want to help everyone learn. The course can be found here https://www.udacity.com/course/intro-to-tensorflow-for-deep-learning--ud187 .


# Artificial Intelligence

## What is it?
A field of computer science that aims to make computers achieve human-style intelligence. There are many approaches to reaching this goal, including machine learning and deep learning.

## Machine Learning
A set of related techniques in which computers are trained to perform a particular task rather than by explicitly programming them.

## Neural Network
A construct in Machine Learning inspired by the network of neurons (nerve cells) in the biological brain. Neural networks are a fundamental part of deep learning.

## Deep Learning
A subfield of machine learning that uses multi-layered neural networks. Often, “machine learning” and “deep learning” are used interchangeably.

![ai-diagram.png](img/ai-diagram.png)


## So how can we use it?
![title](img/comparison.jpg)

### Celsius to Fahrenheit Example:
The function for converting Celsius to Fahrenheit is:

In [None]:
Fahrenheit = Celsius * 1.8 + 32

In python for example, it would traditionally look like this:

In [None]:
def function(C):
    F = C * 1.8 + 32
    return F

![traditional](img/traditional.jpg)

Using artificial intelligence, we do not know the algorithm and therefore this problem becomes this:

![ai_approach](img/ai_approach.png)



# Tensorflow Introduction
## What is Tensorflow?
Tensorflow takes all the complicated mathemetics and makes it easily available to code quickly. It combines years of research and development into a free to use API that is trusted by millions around the world.

# Tensorflow Installation
Open your anaconda prompt by searching for `Anaconda Prompt (Anaconda3)` in Windows search or Spotlight on MacOS.

If you don't have anaconda installed, please follow our other tutorial

### Create a new Conda virtual environment
- Open a new Anaconda/Command Prompt window
- Type the following command:

In [None]:
conda create -n tensorflow_env tensorflow

The above will create a new virtual environment with name `tensorflow_env`

### Activate your new Conda virtual environment

Now lets activate the newly created virtual environment by running the following in the *Anaconda Promt* window:

In [None]:
activate tensorflow_env

Once you have activated your virtual environment, the name of the environment should be displayed within brackets at the beggining of your cmd path specifier e.g.:

![conda_activate](img/conda_activate.jpg)

# Using Tensorflow Basic Demonstration
The problem we will solve is to convert from Celsius to Fahrenheit, where the approximate formula is:

$$ f = c \times 1.8 + 32 $$


Of course, it would be simple enough to create a conventional Python function that directly performs this calculation, but that wouldn't be machine learning.


Instead, we will give TensorFlow some sample Celsius values (0, 8, 15, 22, 38) and their corresponding Fahrenheit values (32, 46, 59, 72, 100).
Then, we will train a model that figures out the above formula through the training process.

## Import dependencies

First, import TensorFlow. Here, we're calling it `tf` for ease of use. We also tell it to only display errors.

Next, import [NumPy](http://www.numpy.org/) as `np`. Numpy helps us to represent our data as highly performant lists.

In [None]:
from __future__ import absolute_import, division, print_function, unicode_literals
import tensorflow as tf

import numpy as np

In [None]:
import logging
logger = tf.get_logger()
logger.setLevel(logging.ERROR)

## Set up training data

As we saw before, supervised Machine Learning is all about figuring out an algorithm given a set of inputs and outputs. Since the task in this Codelab is to create a model that can give the temperature in Fahrenheit when given the degrees in Celsius, we create two lists `celsius_q` and `fahrenheit_a` that we can use to train our model.

In [None]:
celsius_q    = np.array([-40, -10,  0,  8, 15, 22,  38],  dtype=float)
fahrenheit_a = np.array([-40,  14, 32, 46, 59, 72, 100],  dtype=float)

for i,c in enumerate(celsius_q):
  print("{} degrees Celsius = {} degrees Fahrenheit".format(c, fahrenheit_a[i]))

### Some Machine Learning terminology

 - **Feature** — The input(s) to our model. In this case, a single value — the degrees in Celsius.

 - **Labels** — The output our model predicts. In this case, a single value — the degrees in Fahrenheit.

 - **Example** — A pair of inputs/outputs used during training. In our case a pair of values from `celsius_q` and `fahrenheit_a` at a specific index, such as `(22,72)`.
 
## Create the model

Next create the model. We will use simplest possible model we can, a Dense network. Since the problem is straightforward, this network will require only a single layer, with a single neuron.

### Build a layer

We'll call the layer `l0` and create it by instantiating `tf.keras.layers.Dense` with the following configuration:

*   `input_shape=[1]` — This specifies that the input to this layer is a single value. That is, the shape is a one-dimensional array with one member. Since this is the first (and only) layer, that input shape is the input shape of the entire model. The single value is a floating point number, representing degrees Celsius.

*   `units=1` — This specifies the number of neurons in the layer. The number of neurons defines how many internal variables the layer has to try to learn how to solve the problem (more later). Since this is the final layer, it is also the size of the model's output — a single float value representing degrees Fahrenheit. (In a multi-layered network, the size and shape of the layer would need to match the `input_shape` of the next layer.)


In [None]:
l0 = tf.keras.layers.Dense(units=1, input_shape=[1])

### Assemble layers into the model

Once layers are defined, they need to be assembled into a model. The Sequential model definition takes a list of layers as argument, specifying the calculation order from the input to the output.

This model has just a single layer, l0.

In [None]:
model = tf.keras.Sequential([l0])

**Note**

You will often see the layers defined inside the model definition, rather than beforehand:

```python
model = tf.keras.Sequential([
  tf.keras.layers.Dense(units=1, input_shape=[1])
])
```

## Compile the model, with loss and optimizer functions

Before training, the model has to be compiled. When compiled for training, the model is given:

- **Loss function** — A way of measuring how far off predictions are from the desired outcome. (The measured difference is called the "loss".)

- **Optimizer function** — A way of adjusting internal values in order to reduce the loss.


In [None]:
model.compile(loss='mean_squared_error',
              optimizer=tf.keras.optimizers.Adam(0.1))

These are used during training (`model.fit()`, below) to first calculate the loss at each point, and then improve it. In fact, the act of calculating the current loss of a model and then improving it is precisely what training is.

During training, the optimizer function is used to calculate adjustments to the model's internal variables. The goal is to adjust the internal variables until the model (which is really a math function) mirrors the actual equation for converting Celsius to Fahrenheit.

TensorFlow uses numerical analysis to perform this tuning, and all this complexity is hidden from you so we will not go into the details here. What is useful to know about these parameters are:

The loss function ([mean squared error](https://en.wikipedia.org/wiki/Mean_squared_error)) and the optimizer ([Adam](https://machinelearningmastery.com/adam-optimization-algorithm-for-deep-learning/)) used here are standard for simple models like this one, but many others are available. It is not important to know how these specific functions work at this point.

One part of the Optimizer you may need to think about when building your own models is the learning rate (`0.1` in the code above). This is the step size taken when adjusting values in the model. If the value is too small, it will take too many iterations to train the model. Too large, and accuracy goes down. Finding a good value often involves some trial and error, but the range is usually within 0.001 (default), and 0.1

## Train the model

Train the model by calling the `fit` method.

During training, the model takes in Celsius values, performs a calculation using the current internal variables (called "weights") and outputs values which are meant to be the Fahrenheit equivalent. Since the weights are initially set randomly, the output will not be close to the correct value. The difference between the actual output and the desired output is calculated using the loss function, and the optimizer function directs how the weights should be adjusted.

This cycle of calculate, compare, adjust is controlled by the `fit` method. The first argument is the inputs, the second argument is the desired outputs. The `epochs` argument specifies how many times this cycle should be run, and the `verbose` argument controls how much output the method produces.

In [None]:
history = model.fit(celsius_q, fahrenheit_a, epochs=500, verbose=False)
print("Finished training the model")

## Display training statistics

The `fit` method returns a history object. We can use this object to plot how the loss of our model goes down after each training epoch. A high loss means that the Fahrenheit degrees the model predicts is far from the corresponding value in `fahrenheit_a`.

We'll use [Matplotlib](https://matplotlib.org/) to visualize this (you could use another tool). As you can see, our model improves very quickly at first, and then has a steady, slow improvement until it is very near "perfect" towards the end.

In [None]:
import matplotlib.pyplot as plt
plt.xlabel('Epoch Number')
plt.ylabel("Loss Magnitude")
plt.plot(history.history['loss'])

## Use the model to predict values

Now you have a model that has been trained to learn the relationship between `celsius_q` and `fahrenheit_a`. You can use the predict method to have it calculate the Fahrenheit degrees for a previously unknown Celsius degrees.

So, for example, if the Celsius value is 100, what do you think the Fahrenheit result will be? Take a guess before you run this code.

In [None]:
print(model.predict([100.0]))

The correct answer is $100 \times 1.8 + 32 = 212$, so our model is doing really well.

### To review


*   We created a model with a Dense layer
*   We trained it with 3500 examples (7 pairs, over 500 epochs).

Our model tuned the variables (weights) in the Dense layer until it was able to return the correct Fahrenheit value for any Celsius value. (Remember, 100 Celsius was not part of our training data.)



## Looking at the layer weights

Finally, let's print the internal variables of the Dense layer. 

In [None]:
print("These are the layer variables: {}".format(l0.get_weights()))

The first variable is close to ~1.8 and the second to ~32. These values (1.8 and 32) are the actual variables in the real conversion formula.

This is really close to the values in the conversion formula. We'll explain this in an upcoming video where we show how a Dense layer works, but for a single neuron with a single input and a single output, the internal math looks the same as [the equation for a line](https://en.wikipedia.org/wiki/Linear_equation#Slope%E2%80%93intercept_form), $y = mx + b$, which has the same form as the conversion equation, $f = 1.8c + 32$.

Since the form is the same, the variables should converge on the standard values of 1.8 and 32, which is exactly what happened.

With additional neurons, additional inputs, and additional outputs, the formula becomes much more complex, but the idea is the same.

### A little experiment

Just for fun, what if we created more Dense layers with different units, which therefore also has more variables?

In [None]:
l0 = tf.keras.layers.Dense(units=4, input_shape=[1])
l1 = tf.keras.layers.Dense(units=4)
l2 = tf.keras.layers.Dense(units=1)
model = tf.keras.Sequential([l0, l1, l2])
model.compile(loss='mean_squared_error', optimizer=tf.keras.optimizers.Adam(0.1))
model.fit(celsius_q, fahrenheit_a, epochs=500, verbose=False)
print("Finished training the model")
print(model.predict([100.0]))
print("Model predicts that 100 degrees Celsius is: {} degrees Fahrenheit".format(model.predict([100.0])))
print("These are the l0 variables: {}".format(l0.get_weights()))
print("These are the l1 variables: {}".format(l1.get_weights()))
print("These are the l2 variables: {}".format(l2.get_weights()))

As you can see, this model is also able to predict the corresponding Fahrenheit value really well. But when you look at the variables (weights) in the `l0` and `l1` layers, they are nothing even close to ~1.8 and ~32. The added complexity hides the "simple" form of the conversion equation.

|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||













## Neural Nets
An artificial neural network is a system of 'neurons' that are actually mathematical functions. These functions can take multiple inputs and produce one or multiple outputs. For each input to the neuron there is also an associated 'Weight vector', this number is what makes each neuron unique. The function of each neuron can be considered as a linear combination of weights and inputs with a bias. The weights of each connecting neuron can be thought of as the strength of that connection, or how much the former neuron affects the neuron in question. The bias of a specific neuron is a numerical value of how active that neuron tends to be within the net as a whole. After summing the products of each weight and input as well as the bias, a nonlinear function (the activation function) is applied. Most commonly the Rectified Linear Unit (ReLU). If the number produced by the reLU is above 0, then that number is used without adjustment, howver if the number produced is less than zero, then the number used will be 0. 

The network itself can be broken down into specific sections. The input layer, the 'black box', and an output layer. The input layer is intuitively the first layer of neurons that are activated by the subject. For example in a 500 by 500 pixel picture of a handwritten number, each pixel might correspond to a neuron, and the degree to which said pixel is 'coloured in' by the number is the initial value. The 'black box' is the middle section of neuron layers where the magic happens. The connections the neurons make between layers will adjust with training such that when the 'signal' gets to the output layer, it is correct. 

Training the data was briefly mentioned above, so how eactly does this happen? Well first we must define a cost function for the neural net. This will tell the computer what the output was supposed to be as opposed to what it produced. The average cost of all the training data available is a measure of how well the neural net performs its job. Since we want to minimize the value of the cost function, we need to use the negative gradient of the function. 


## Installing Tensorflow GPU (Work in progress)
Before we begin the installation process your computer must have a discreet NVIDIA graphics card compatible with the software. A list of these cards can be found here: https://developer.nvidia.com/cuda-gpus. 

The following software is required as well:

https://www.nvidia.com/Download/index.aspx?lang=en-us

https://developer.nvidia.com/cuda-toolkit-archive

https://developer.nvidia.com/cudnn

https://docs.nvidia.com/deeplearning/sdk/tensorrt-install-guide/index.html (optional)


Once all this software is succesfully installed, use pip to run the following:

In [None]:
pip install tensorflow-gpu