# Probability

- `Probability` is a measure that is associated with how certain we are of outcomes of a particular experiment or activity.
- An `experiment` is a planned operation carried out under controlled conditions.
- If the result is not predetermined, then the experiment is said to be a chance experiment. 
- Flipping one fair coin twice is an example of an experiment.

- A result of an experiment is called an outcome.
- The `sample space` of an experiment is the set of all possible outcomes.
- Three ways to represent a sample space are: 
1. to list the possible outcomes
2. to create a tree diagram, 
3. to create a Venn diagram. 
- The uppercase letter `S` is used to denote the `sample space`.
- For example, if you flip one fair coin, 
     S = {H, T} 
     where 
     H = heads and 
     T = tails are the outcomes.

- An `event` is any `combination of outcomes`. Upper case letters like A and B represent events.
- For example, if the experiment is to flip one fair coin, 
- event A might be getting at most one head. 
- The probability of an event A is written P(A).

-  The probability of any outcome is the long-term relative frequency of that outcome. 
- Probabilities are between zero and one, inclusive (that is, zero and one and all numbers between these values).
- P(A) = 0 means the event A can never happen. 
- P(A) = 1 means the event A always happens. 
- P(A) = 0.5 means the event A is equally likely to occur or not to occur. - For example, if you flip one fair coin repeatedly (from 20 to 2,000 to 20,000 times) the relative frequency of heads approaches 0.5 (the probability of heads).

- Equally likely means that each outcome of an experiment occurs with equal probability. 
- For example, if you toss a fair, six-sided die, each face (1, 2, 3, 4, 5, or 6) is as likely to occur as any other face. 
- If you toss a fair coin, a Head (H) and a Tail (T) are equally likely to occur. 
- If you randomly guess the answer to a true/false question on an exam, you are equally likely to select a correct answer or an incorrect answer.

- To calculate the probability of an event A when all outcomes in the sample space are equally likely, count the number of outcomes for event A and divide by the total number of outcomes in the sample space. 
- For example, 
- if you toss a fair dime and a fair nickel, 
    the sample space is {HH, TH, HT, TT} 
    
    where T = tails and H = heads. 
    
    The sample space has four outcomes. 
    A = getting one head. 
    
    - There are two outcomes that meet this condition 
    {HT, TH}, 
    - so P(A) =  2/4  = 0.5.

- Suppose you roll one fair six-sided die, 
- with the numbers {1, 2, 3, 4, 5, 6} on its faces.
    - Let event E = rolling a number that is at least five.
    - There are two outcomes {5, 6}. 
    -  P(E) =  2/6 . 
    - If you were to roll the die only a few times, you would not be surprised if your observed results did not match the probability. 
    - If you were to roll the die a very large number of times, you would expect that, overall,  2/6  of the rolls would result in an outcome of "at least five".
    - You would not expect exactly  2/6 . The long-term relative frequency of obtaining this result would approach the theoretical probability of  2/6  as the number of repetitions grows larger and larger.

- This important characteristic of probability experiments is known as the law of large numbers which states that `as the number of repetitions of an experiment is increased, the relative frequency obtained in the experiment tends to become closer and closer to the theoretical probability.` Even though the outcomes do not happen according to any set pattern or order, overall, the long-term observed relative frequency will approach the theoretical probability. `(The word empirical is often used instead of the word observed.)`

- It is important to realize that in many situations, the outcomes are not equally likely.
- A coin or die may be unfair, or biased. 
- Two math professors in Europe had their statistics students test the Belgian one Euro coin and discovered that in 250 trials, a head was obtained 56% of the time and a tail was obtained 44% of the time.
- The data seem to show that the coin is not a fair coin; more repetitions would be helpful to draw a more accurate conclusion about such bias. 
- Some dice may be biased. Look at the dice in a game you have at home; the spots on each face are usually small holes carved out and then painted to make the spots visible.
- Your dice may or may not be biased; it is possible that the outcomes may be affected by the slight weight differences due to the different numbers of holes in the faces. 
- Gambling casinos make a lot of money depending on outcomes from rolling dice, so casino dice are made differently to eliminate bias.
- Casino dice have flat faces; the holes are completely filled with paint having the same density as the material that the dice are made out of so that each face is equally likely to occur. 


### "OR" Event:
- An outcome is in the event A OR B if the outcome is in A or is in B or is in both A and B. 
- For example, let 
    - A = {1, 2, 3, 4, 5} and 
    - B = {4, 5, 6, 7, 8}. 
    - A OR B = {1, 2, 3, 4, 5, 6, 7, 8}. 
    - Notice that 4 and 5 are NOT listed twice.

### "AND" Event:
- An outcome is in the event A AND B if the outcome is in both A and B at the same time. For example, let 
    - A and B be {1, 2, 3, 4, 5} and {4, 5, 6, 7, 8}, respectively. 
    - Then A AND B = {4, 5}.

### The complement of event A is denoted A′ 
- (read "A prime"). 
- A′ consists of all outcomes that are NOT in A. 
- Notice that P(A) + P(A′) = 1. 
    - For example, let 
    - S = {1, 2, 3, 4, 5, 6} and let 
    - A = {1, 2, 3, 4}. Then, 
    - A′ = {5, 6}. 
    - P(A) =  4/6 , 
    - P(A′) =  2/6 , and 
    - P(A) + P(A′) =  4/6+2/6  = 1

## `The conditional probability`
- of `A given B` is written`P(A|B)`.
- P(A|B) is the probability that event A will occur given that the event B has already occurred. 
- A conditional reduces the sample space. 
- We calculate the probability of A from the reduced sample space B. 
- The formula to calculate P(A|B) is 

    - P(A|B) =  P(A AND B)/P(B)
    -  where P(B) is greater than zero.

In [5]:
0.44999999999999996/0.50

0.8999999999999999

# `Independent Events: `

Two events are `independent if the following are true`:

`P(A|B) = P(A)` 

`P(B|A) = P(B)`
 
`P(A AND B) = P(A)P(B)`

#### Two events A and B are independent if the knowledge that one occurred does not affect the chance the other occurs. For

- For example, the `outcomes of two roles of a fair die are independent` events. 
- The `outcome of the first roll does not change the probability for the outcome of the second` roll. 
- To show two events are independent, you must show only one of the above conditions. 
- If `two events are NOT independent`, then we say that they are `dependent.`

## Sampling may be done with replacement or without replacement.

### `With replacement: `
- If each member of a population is replaced after it is picked, then that member has the possibility of being chosen more than once. 
- When sampling is done with replacement, then events are considered to be `independent`, meaning the result of the first pick will not change the probabilities for the second pick.


### `Without replacement:`
- When sampling is done without replacement, each member of a population may be chosen only once. In this case, the probabilities for the second pick are affected by the result of the first pick.
- The events are considered to be `dependent or not independent`.

If it is not known whether A and B are independent or dependent, assume they are dependent until you can show otherwise.

#### Example : 

- You have a fair, well-shuffled deck of 52 cards.
- It consists of four suits. The suits are clubs, diamonds, hearts and spades.
- There are 13 cards in each suit consisting of 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, J (jack), Q (queen), K (king) of that suit.

a. `Sampling with replacement:`

- Suppose you pick three cards with replacement. 
- The first card you pick out of the 52 cards is the Q of spades.
- You put this card back, reshuffle the cards and pick a second card from the 52-card deck.
- It is the ten of clubs. 
- You put this card back, reshuffle the cards and pick a third card from the 52-card deck.
- This time, the card is the Q of spades again.
- Your picks are {Q of spades, ten of clubs, Q of spades}.
- You have picked the Q of spades twice.
- You pick each card from the 52-card deck.

b. `Sampling without replacement:`

- Suppose you pick three cards without replacement.
- The first card you pick out of the 52 cards is the K of hearts.
- You put this card aside and pick the second card from the 51 cards remaining in the deck. 
- It is the three of diamonds.
- You put this card aside and pick the third card from the remaining 50 cards in the deck.
- The third card is the J of spades.
- Your picks are {K of hearts, three of diamonds, J of spades}.
- Because you have picked the cards without replacement, you cannot pick the same card twice.

# ` Mutually Exclusive Events`

##### A and B are mutually exclusive events `if they cannot occur at the same time.` 
##### This means that `A and B do not share any outcomes` and `P(A AND B) = 0.`



- For example, 
- suppose the sample space 
    S = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}. 
    Let
    A = {1, 2, 3, 4, 5}, 
    
    B = {4, 5, 6, 7, 8}, and 
    
    C = {7, 9}. 
    
    A AND B = {4, 5}. 
    
    P(A AND B) =  210 
    
    and is not equal to zero. Therefore, A and B are not mutually exclusive. 
    
    A and C do not have any numbers in common so 
    
    P(A AND C) = 0. Therefore, A and C are mutually exclusive.
    

If it is not known whether A and B are mutually exclusive, assume they are not until you can show otherwise. The following examples illustrate these definitions and terms.

### Flip two fair coins.

- The sample space is {HH, HT, TH, TT} where T = tails and H = heads. 
- The outcomes are 
    - HH, HT, TH, and TT. 
    - The outcomes HT and TH are different. 
    - The HT means that the first coin showed heads and the second coin showed tails. 
    - The TH means that the first coin showed tails and the second coin showed heads.
    
    
- Let A = the event of getting at most one tail. 
    - (At most one tail means zero or one tail.) 
    - Then A can be written as 
        - {HH, HT, TH}. 
        - The outcome HH shows zero tails. 
        - HT and TH each show one tail.
        
        
- Let B = the event of getting all tails. 
    - B can be written as 
    - {TT}.
    - B is the complement of A, 
    - so B = A′. Also, 
        - P(A) + P(B) = P(A) + P(A′) = 1.
        
        
- Let C = the event of getting all heads.
    - C = {HH}. 
    - Since B = {TT}, 
    - P(B AND C) = 0.
    - B and C are mutually exclusive. (B and C have no members in common because you cannot have all tails and all heads at the same time.)
    
   
- Let D = event of getting more than one tail. 
    - D = {TT}. 
        - P(D) =  1/4 
        
        
- Let E = event of getting a head on the first roll. 
    - (This implies you can get either a head or tail on the second roll.) 
        - E = {HT, HH}. 
        - P(E) =  2/4 
        
        
- Find the probability of getting at least one (one or two) tail in two flips. 

- Let F = event of getting at least one tail in two flips. 
    - F = {HT, TH, TT}. 
    - P(F) =  3/4

##### When calculating probability, there are two rules to consider when determining if two events are independent or dependent and if they are mutually exclusive or not.

### The Multiplication Rule


- If A and B are two events defined on a sample space, 
    - `P(A AND B) = P(B) * P(A|B)`
   

- This rule may also be written as: 
    - `P(A|B) =  P(A AND B) / P(B) `


- (The probability of A given B equals the probability of A and B divided by the probability of B.)


- If A and B are independent, then
     - `P(A|B) = P(A). `
     - Then `P(A AND B) = P(A|B) * P(B) `
     
         becomes `P(A AND B) = P(A)*P(B).`

### The Addition Rule

- If A and B are defined on a sample space, then:
    
    - `P(A OR B) = P(A) + P(B) - P(A AND B).`

- If `A and B are mutually exclusive`, then 
    
    - `P(A AND B) = 0.`
    
    Then 
        - P(A OR B) = P(A) + P(B) - P(A AND B) 
        
        becomes P(A OR B) = P(A) + P(B).

### Conditional Probability
- the likelihood that an event will occur given that another event has already occurred


### contingency table
- the method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; the table provides an easy way to calculate conditional probabilities.


### Dependent Events

- If two events are NOT independent, then we say that they are dependent.

### Equally Likely
- Each outcome of an experiment has the same probability.

### Event
- a subset of the set of all outcomes of an experiment; the set of all outcomes of an experiment is called a sample space and is usually denoted by S.
- An event is an arbitrary subset in S.
- It can contain one outcome, two outcomes, no outcomes (empty subset), the entire sample space, and the like.
- Standard notations for events are capital letters such as A, B, C, and so on.


### Experiment
- a planned activity carried out under controlled conditions

### Independent Events
- The occurrence of one event has no effect on the probability of the occurrence of another event. Events A and B are independent if one of the following is true:

        P(A|B) = P(A)
        P(B|A) = P(B)
        P(A AND B) = P(A)P(B)


### Mutually Exclusive
- Two events are mutually exclusive if the probability that they both happen at the same time is zero. If events A and B are mutually exclusive, then P(A AND B) = 0.


### Outcome
- a particular result of an experiment


### Probability
- a number between zero and one, inclusive, that gives the likelihood that a specific event will occur; the foundation of statistics is given by the following 3 axioms (by A.N. Kolmogorov, 1930’s): Let S denote the sample space and A and B are two events in S. Then:
        - 0 ≤ P(A) ≤ 1
        - If A and B are any two mutually exclusive events, then P(A OR B) = P(A) + P(B).
        - P(S) = 1
        
### Sample Space

- the set of all possible outcomes of an experiment


### Sampling with Replacement
- If each member of a population is replaced after it is picked, then that member has the possibility of being chosen more than once.

### Sampling without Replacement

- When sampling is done without replacement, each member of a population may be chosen only once.


### The AND Event

- An outcome is in the event A AND B if the outcome is in both A AND B at the same time.


### The Complement Event

- The complement of event A consists of all outcomes that are NOT in A.


### The Conditional Probability of A GIVEN B
- P(A|B) is the probability that event A will occur given that the event B has already occurred.

### The Conditional Probability of One Event Given Another Event
- P(A|B) is the probability that event A will occur given that the event B has already occurred.

### The Or Event
- An outcome is in the event A OR B if the outcome is in A or is in B or is in both A and B.

### The OR of Two Events
- An outcome is in the event A OR B if the outcome is in A, is in B, or is in both A and B.

### Tree Diagram
- the useful visual representation of a sample space and events in the form of a “tree” with branches marked by possible outcomes together with associated probabilities (frequencies, relative frequencies)


### Venn Diagram
- the visual representation of a sample space and events in the form of circles or ovals showing their intersections

![image.png](attachment:image.png)