# HW 3

## Problem 1
We're going to briefly look at truth tables. As an example let's assume we have a 100 photos, some of birds and some of people. In a truth table you assume you can categorize the true answer—let's say you pay an undergraduate to sort the photos into birds and people. Then you want to have an automated sorting algorithm. However that works, whether it is by looking a the color at the center of photo or using a neural network (NN), it will also sort the photos but imperfectly. Given two sorts you can then arrange a truth table:

|           | True Bird | True Person |
|-----------|-----------|-------------|
| NN Bird   | 45        | 5           |
| NN Person | 3         | 47          |

There are 48 birds and 52 People (columns), and there are 8 miscategorizations. The NN called three of the birds people, and 5 of the people birds.

This is useful because it not only shows the number of errors, but the type, and not all errors are created equal. Let's say our identification system is being used to keep birds from escaping the aviary at the zoo by locking the aviary door when a bird is trying to escape. We really don't want to let birds escape, but locking a person in for 30 extra seconds is not a big deal. In this case false bird identification is not so bad, but false person identification lets a bird escape.

Let's say the above truth table is the current system, and you've developed a new algorithm (NA) with the following truth table:

|           | True Bird | True Person |
|-----------|-----------|-------------|
| NA Bird   | 47        | 11          |
| NA Person | 1         | 41          |


## a) Which algorithm makes the fewest mistakes?
*Answer:* NN makes fewer mistakes, if we are just summing them. NN made 8 total mistakes, while NA made 12.

## b) Which algorithm is better for the zoo? Explain.
*Answer:* NA is better, because it miscategorized less birds as people, meaning less birds would get away. The people can be held up at a lesser cost than the birds flying away.

## c) During the pandemic the number of visitors plummets, and it is only the zoo keeper visiting. So instead of 52% of the photos taken at the aviary door being people, it is now only 1%. Make new truth tables for both algorithms.

*Answer:* The code blocks below generate the new truth tables.

In [1]:
from pandas import DataFrame as df

# NN
bird = 99
person = 1

prob_b_if_b = 45/48
prob_p_if_b = 3/48
prob_b_if_p = 5/52
prob_p_if_p = 47/52


BB = round(bird * prob_b_if_b)
PB = round(bird * prob_p_if_b)
BP = round(person * prob_b_if_p)
PP = round(person * prob_p_if_p)

NN = df({"True Bird":(BB,PB),"True Person":(BP,PP)})
NN = NN.rename(index={0: "NN Bird", 1: "NN Person"})
NN = NN.style.set_caption('NN with Covid Numbers')
NN

Unnamed: 0,True Bird,True Person
NN Bird,93,0
NN Person,6,1


In [2]:
# NA
bird = 99
person = 1

prob_b_if_b = 47/48
prob_p_if_b = 1/48
prob_b_if_p = 11/52
prob_p_if_p = 42/52


BB = round(bird * prob_b_if_b)
PB = round(bird * prob_p_if_b)
BP = round(person * prob_b_if_p)
PP = round(person * prob_p_if_p)

NA = df({"True Bird":(BB,PB),"True Person":(BP,PP)})
NA = NA.rename(index={0: "NA Bird", 1: "NA Person"})
NA = NA.style.set_caption('NA with Covid Numbers')
NA

Unnamed: 0,True Bird,True Person
NA Bird,97,0
NA Person,2,1


## Problem 2
In the last lab we explored how to numerically calculate the pdf of a summed or averaged observation through repeated convolutions. But sometimes the convolution has an analytical solution. We could have found this out by either using a sharp pencil and doing the convolution integral by hand, or by looking it up in a table (much easier).

Having an analytic answer is much nicer when they exist, so it is always good to look and see if it exists. Further, sums and averages are only some of the mathematical operations that we can perform. In this section we will do an internet scavenger hunt to find the analytic pdf for some interesting distributions.

### Example 1
What is the sum of two Guassian distributions?

We did this numerically in the last lab, but we can find it analytically. One might start with this page on the normal distribution which would refer you to this page on the sum, which would give you the same answer you figured out last week.

### Example 2
Let's say we have a variable with a Rayleigh distribution, and we're going to square it. What is the distribution?

First I'll lookup and read about the Rayleigh distribution, such as this Wikipedia page (Mathworld and other sources, such as CRC books are great too). Down near the bottom are listed a number of related distributions. Note that the square of the Rayleigh is listed as a gamma distribution with N = 1. Looking up the gamma distribution we see that a gamma with N=1 is an exponential distribution, and just to check we can see that the sqrt of an exponential distribution is a Rayleigh distribution to bring us full circle.

Now it is your turn!

## a) What is the pdf of the sum of two identical exponential distributions?

*Answer:* From the Wikipedia page linked [here](https://en.wikipedia.org/wiki/Exponential_distribution), we find the pdf of the sum of two exponential functions to be the following:

$${{\begin{aligned}f_{Z}(z)&=\int _{-\infty }^{\infty }f_{X_{1}}(x_{1})f_{X_{2}}(z-x_{1})\,dx_{1}\\&=\int _{0}^{z}\lambda _{1}e^{-\lambda _{1}x_{1}}\lambda _{2}e^{-\lambda _{2}(z-x_{1})}\,dx_{1}\\&=\lambda _{1}\lambda _{2}e^{-\lambda _{2}z}\int _{0}^{z}e^{(\lambda _{2}-\lambda _{1})x_{1}}\,dx_{1}\\&={\begin{cases}{\dfrac {\lambda _{1}\lambda _{2}}{\lambda _{2}-\lambda _{1}}}\left(e^{-\lambda _{1}z}-e^{-\lambda _{2}z}\right)&{\text{ if }}\lambda _{1}\neq \lambda _{2}\\[4pt]\lambda ^{2}ze^{-\lambda z}&{\text{ if }}\lambda _{1}=\lambda _{2}=\lambda \end{cases}}\end{aligned}}}$$

As you can see in the bottom half of that equation, if the exponential functions are identical ($\lambda_1 = \lambda_2$), you get an Erlang distribution with shape 2 and parameter ${\displaystyle \lambda ,}$ which is a special case of a gamma distribution.

## b) What is the pdf of the ratio of two zero-mean unity variance normal distributions $X_1/X_2$?

*Answer:* According to Wikipedia's article on [Normal distributions](https://en.wikipedia.org/wiki/Normal_distribution), if you have two standard normal distributions with mean 0 and variance 1, the pdf of the ratio between the two follows a standard Cauchy distribution: ${\displaystyle X_{1}/X_{2}\sim \operatorname {Cauchy} (0,1)}$.

## c) So far we have looked at 1D probability distributions, but it is possible to have a multi-dimensional vector distribution. A simple first introduction is the 2D Gaussian; it looks like a smeared spot. Mathematically this is given by $X\hat{i} + Y\hat{j}$ where both $X$ and $Y$ are drawn from 1D Gaussian distributions. If I measure the amplitude of this vector, what is its pdf? (Hint, the amplitude is always positive.)

*Answer:* Looking at [this link](https://en.wikipedia.org/wiki/Multivariate_normal_distribution#See_also), we can see that the pdf for this is a chi distribution.