# Inferential Statistics 
1. **WHAT** is it? & how is it different from descriptive statistics?
2. **WHY** do we need it? 
3. **HOW** do we use it?

![](http://www.z-table.com/uploads/2/1/7/9/21795380/205552900.jpg)

![](https://miro.medium.com/max/1164/1*WD8uN3s7_eo1peVehPZAVA.jpeg)

Earlier we looked at descriptive statistics: starting with a dataset and making various observations (overall shape, histogram, outliers, etc.) as well as calculations of quantities that can characterize the dataset as a whole (mean, median, mode, variance, standard deviation, quartiles, percentiles, etc.).

To make the move into inferential statistics, we need to imagine now that we don't have (or anyway cannot measure) all the data of interest.

And this is, of course, the typical situation. Consider:

- A zoologist wanting to know the typical lifespan of a Siberian tiger
- A cosmologist wanting to know the mass of a normal white dwarf star
- A businesswoman wanting to know how many M&M's her customers should expect to find in their Party Size bags
- A botanist wanting to know how tall California redwoods usually grow

The zoologist could, in principle:

Keep track of every currently existing Siberian tiger; record their (more or less) exact ages at their moments of death; add up those ages and divide by the number of tigers to calculate an average lifespan ––But only in principle. In all of these situations, there is no realistic or practical opportunity to check each relevant data point.

![](https://pictures-of-cats.org/wp-content/uploads/2012/09/bengal-tiger-size-weight-1.jpg)

What we can do, however, is to check some of the data points we want to check. That is, we'll draw a sample of data from our population of interest. We can then use the techniques of descriptive statistics to characterize our sample.

Does this help? The hope, of course, is that our sample will be representative of the population as a whole, which would justify our using facts about the sample to infer things about the population as a whole. But naturally we'll expect a certain amount of error: If I take the mean of a sample,  𝑥¯  and project it as an estimate of the mean of the whole population,  𝜇 , the estimate is bound to be imperfect.

Inferential statistics makes all this precise. Some of this gets fairly technical, hence the need for a whole unit on it!

**Quick discussion question:**

<details>
<summary>Can anyone give an example of how inferential statistics is being used in the news today?</summary>
<br>
1. emergency preparedness<br/>
2. predicting illness/spread of infection <br/>
3. political campaigns
</details>

## Review Set Theory 
<details>
<summary>Be sure to review terminology on set theory. When we are talking about events in probability, we are talking about the set of events.</summary>
<br>
What is set theory?<br/>
    
A set is a collection of some items (elements). We often use capital letters to denote a set. To define a set we can simply list all the elements in curly brackets. 

**For example:** 
- to define a set **A** that consists of the two elements ♣ and ♢, we write **A**={♣,♢}. To say that ♢ belongs to **A**, we write **♢∈A**, where "∈" is pronounced "belongs to." To say that an element does not belong to a set, we use ∉. To express this, we may write **♡∉A**.
<br>
The symbols "|" and ":" are pronounced "such that."

A={x|x satisfies some property} <br/>
**or**<br/>
A={x:x satisfies some property}

**If the set C is defined as C={x|x∈ℤ,−2≤x<10}, then C={−2,−1,0,⋯,9}.**

**Inclusion-exclusion principle:**
Note that if you want to know how many elements are in set *A* versus set  *B*, you can't simply sum up the elements, because they have elements in common.

In combinational mathematics, the inclusion-exclusion principle is a counting technique that solves this problem.

When having two sets, the method for counting the number of elements in the union of two finite sets is given by:

|A∪B|=|A|+|B|−|A∩B|,

|A∪B∪C|=|A|+|B|+|C|−|A∩B|−|A∩C|−|B∩C|+|A∩B∩C|.

### Terminology

**Outcome:** A result of a random experiment.<br/>
**Universal Set:** $\Omega$ <br/>
**Sample Space:** The set of all possible outcomes. <br/>
**Event:** A subset of the sample space. 

</details>

## Classical Probability 
**The probability (P) that an event will happen is:**<br/>
P =   	Total number of desired outcomes/Total number of possible outcomes

**Independent vs Dependent Events** 
- Two events A and B are said to be independent if the fact that one event has occurred does not affect the probability that the other event will occur. The probability of 2 independent events occuring is the product of their respective probabilities.<br/>
- If whether or not one event occurs does affect the probability that the other event will occur, then the two events are said to be dependent.

**EXAMPLES:**

1. Is the probability that it will rain today independent or dependent? <br/>

2. Is the probability that you will crash your car today or next year independent or dependent? <br/>

3. How about if you crash your car because you were drunk driving?

In [None]:
from fractions import Fraction

def P(event, space): 
    "The probability of an event, given a sample space."
    return Fraction(cases(favorable(event, space)), 
                    cases(space))

favorable = set.intersection # Outcomes that are in the event and in the sample space
cases     = len              # The number of cases is the length, or size, of a set

### Warm-up Problem: Die Roll
What's the probability of rolling an even number with a single six-sided fair die? Mathematicians traditionally use a single capital letter to denote a sample space; I'll use D for the die:

In [None]:
D     = {1, 2, 3, 4, 5, 6} # a sample space
even  = {   2,    4,    6} # an event

P()

### Good. Let's do a few more

In [None]:
prime = {2, 3, 5, 7, 11, 13}
odd   = {1, 3, 5, 7, 9, 11, 13}


In [None]:
P(odd, D)

In [None]:
P() # The probability of an even or prime die roll

In [None]:
P() # The probability of an odd prime die roll

**This is classic probability. The desired outcomes divided by the total number of outcomes. When dealing with classical probability keep in mind whether you have an independent or dependent event. We calculate the chances of 2 or more independent events by multiplying the probabilities of each event. For dependent events, we do the same except we must consider what event happened prior to the desired event.** 

### Card Example - Conditional Probability 

**What is the probability of drawing 2 Kings from a deck? This is a conditional probability problem. We use the following formula for conditional probability problems.**

![](https://www.mathsisfun.com/data/images/probability-independent-formula1.svg)

P(Both cards are kings) = P(1st card is a King) * P(2nd card is a King given that the first card was a king)
P() = P(4/52) * P(3/51)

**A derivative of the formula above is the following:** 
![](https://www.mathsisfun.com/data/images/probability-independent-formula2.gif)

**Use the formula above to solve the following problem. 70% of your friends like Chocolate, and 35% like Chocolate AND like vanilla.**

**What percent of those who like Chocolate also like vanilla?**

### Another example: Use a tree diagram 
Tree diagrams are helpful for calculating the probability of independent and dependent events.

![](https://www.mathsisfun.com/data/images/probability-tree-coin3.svg)


#### Example: 
You are in a free throw competition and you get $100 if you can make 1 basket. There are 2 options. First option is, you take 2 shots with a 2/3 probability of making it. The second option is, you take 3 shots with a 1/2 probability of making it. Which option do you choose? 

## Permutations
**An ordering of objects.** $P_{k}^{n}= \dfrac{n!}{(n-k)!}$

n =	total number of objects<br/>
k =	number of objects selected


_Remember factorials:_ n! = The factorial of n is denoted by n! and calculated by the product of integer numbers from 1 to n.

For n>0,

n! = 1×2×3×4×...×n



### Examples
1. How many different ways can you could order the letters CAT?

2. How many different ways can you order only 2 of the letters in CAT? 

3. What about repeats? How many different ways could we rewrite the letters in Mississippi? Divide out the duplicates. 

## Combinations
**Order is not important** 
$C_{k}^{n}= \dfrac{n!}{(n-k)!*k!}$

#### Examples 
**1.** If there are 10 people on a committee. How many ways are there of choosing a President, Vice President and Secretary? 

**2.** If there are 30 people on your tennis team, how many ways could you choose 2 co-captains? 

**3.** If a board has 10 women and 7 men, how many ways can you form a committee containing 4 members such that:<br/> 
    **a.** All 4 are men? <br/>
    **b.** 2 are men 2 are women?<br/>
    **c.** At least 1 is a woman? 

#### Card Problems

1. Count the number of possible five-card hands that can be dealt from a standard deck of 52 cards
2. Count the number of ways that a particular type of poker hand can occur
3. The probability of being dealt any particular type of hand is equal to the number of ways it can occur divided by the total number of possible five-card hands.

Consider dealing a hand of five playing cards. An individual card has a rank and suit, like 'J♥' for the Jack of Hearts, and a deck has 52 cards:

Now determine: 
- the probability of getting dealt 3 of a kind in 5-card poker 
- the probability of getting a pair in 5-card poker 
- the probability of getting a full-house in 5-card poker 
