# Probability

Probability is the chance of something happening, a more academic definition for this would be the likelihood of an event occurring.

An event is a specific outcome or a combination of several outcomes. These outcomes can be pretty much anything.

Having a probability of one expresses absolute certainty of the event occurring and a probability of zero expresses absolute certainty of the event not occurring. You probably figured this out, but higher probability values indicate a higher likelihood.

$ A = event $

$ P(A) = probability $
              
$$P(A) = \frac{Preferred\;(favourable)}{All\;(sample\;space)}$$


**The probability of two independent events occurring at the same time is equal to the product of all the probabilities of the individual events.**

$$ P(A\;and\;B) = P(A) . P(B)$$

eg: The likelihood of getting the ace of spades = (the probability of getting an ace) **times** (the probability of getting a spade.)

**Q. Why do we express probabilities numerically?**

Our goal is to be able to compare the probabilities of events and determine which is the relatively more likely outcome. 

**Experimental Probabilities :** The probabilities we get after conducting experiments.
whereas the ones we introduced earlier were theoretical or true probabilities.

The experimental probabilities we get are not always equal to the theoretical ones, but are a good approximation.
       
$$P(A) = \frac{Successful\;trials}{All\;trials}$$

**Q. Why do we use experimental probabilities?**

Because they are easy to compute and serve as good predictors for theoretical ones

**Expected value E(A) :** It is used to predict future events. It is The average outcome we expect if we run an experiment many times.

The expected value of an event A denoted **E(A)** is the outcome we expect to occur when we run an experiment.

We can use expected values to make predictions about the future based on past data. 

eg: *We frequently make predictions using intervals instead of specific values due to the uncertainty the future brings.*
Meteorologists often use these when forecasting the weather, they do not know exactly how much snow,rain or wind there's going to be, so they provide us with likely intervals That is why we often hear statements like expect between three and five feet of snow tomorrow morning or temperatures rising up to 90 degrees on Wednesday.

**For Categorical Outcome :**

For an event with categorical outcomes like suits, we calculate the expected value by multiplying the theoretical probability of the event P(A)  by the number of trials we carried out in (n).

**E(A) = P(A) * n**

**Numerical outcome :**

**A + B + C =  P(A) . A +  P(B) . B +  P(C) . C**

We take the value for every element in the sample space and multiply it by its probability, then we add all of those up to get the expected value.

**Q. Why do we use intervals when forecasting future events?**

1. because the expected value might have a low probability of occuring.
2. because we want to increase the likelihood of our predictions being accurate.
3. because the expected value could not be attainable.

#### Frequency
**What does the “frequency” of a value within the sample space represent?**

The number of times the value features in a sample space.

**Probability frequency distribution**

a collection of the probabilities for each possible outcome and how often they occur. It is usually presented through *Graphs or Tables*

*Transforming the frequency of each outcome into a probability:* <br>Knowing the size of the sample space, we can determine the true probabilities for each outcome. We simply divide the frequency for each possible outcome by the size of the sample space.

**Note :** When making predictions, we generally want our interval to have the highest probability we can see that the individual outcomes with the highest probability are the ones with the highest bars in the graph, usually the highest bars will form around the expected value. Thus the values around it would also be the values with the highest probability.

This suggests that if we want the interval with the highest probability, we should construct it around the expected value.

**Q.  2 standard six-sided dice, why is the probability of rolling a sum of 7 equal to one sixth?**

There are 6 favourable outcomes out of 36 outcomes in the sample space so the favourable over all the formula results in 6/36 or simply 1/6 

**Compliment P(A') :**
a compliment of an event is everything.

$P(A') = 1-P(A)$

Compliment of compliment is the event itself. $P(A')' = P(A)$
<br>eg: What is the complement of NOT rolling a 3?</br>
Ans - Rolling a 3.




sum of probabilitites of A & B = P(A) + P(B)

if $P(A) + P(B) + P(C) = 1$ then, we are 100% sure this event will occur

# Combinatorics
## Permutation

Permutations, represents the number of different possible ways we can arrange **entire** set of elements. These elements can be digits, letters, objects or even people. **Here we are just arranging, not picking. Here n = p**

Permutation refers to the different ways of arranging a set of objects in a sequential order.

**In permutations the order in which choices are made is of utmost importance.**

Formula for permutaion is **n! or p!**

$$P_n = n * (n-1) * (n-2) * .........* 1 = n! \;\;\;or\;\;\; p!$$ 

### Factorials

$$n!\;=\;n\;*\;(n-1)!$$
n! = Product of the natural number from 1 to n.

>Negative numbers don't have factorials.

> 0! = 1


$$(n+1)! = n!*(n+1)$$
$$eg: 7! = 6!*7$$

### $(n + k)!\;and\;(n - k)!$

$A/Q\;\;(n+k)!$ 
$$(n+k)! = n! * (n+1) * (n+2) * (n+3) ......... * (n+k)$$
 
<br>$$So\;if\;\; n = 5 , k = 2$$</br>

 
$$7! = 5!* 6 * 7$$

***

$A/Q\;\;(n-k)!$ 
$$(n-k)! = \frac{n!}{(n-k+1) * (n-k+2) *.......* (n)} $$
 
<br>$$ So\;if\;\; n = 5 , k = 2 $$</br>

 
$$3! \;= \frac{5!}{(4*5)}$$


## Variations 
Variations express the total number of ways we can pick and arrange some elements of a given set.
<br>**<br>In Variations the order in which choices are made is of utmost importance.<br>**</br>

n = the total number of elements, we have available

r = the number of positions we need to fill


<br>$$Variation \; with \; repition = n^r$$</br>

The number of variations with repetition when picking r-many elements out of n elements, is equal to **n** to the power of **r**


$$Variation \; without \;repition = \frac{n!}{(n-r)!}$$ 

$$OR$$

$$Variation\;(^nP_r)\;=\;^nC_r * r!$$

$$\Longrightarrow\;^nP_r\; = \frac{n!}{r!*(n-r)!}\;*\; r!$$

$$\Longrightarrow\;^nP_r\; = \frac{n!}{(n-r)!}\;$$

<br>The number of variations without repitition when arranging **p** elements out of **n**</br>
<br></br>
**Q. Out of 7 We need to choose 4 of them and arrange them in what order to run, but in how many ways can we accomplish that?** 

In this case, we can plug in 7 for n and 4 for p into the formula. 

Then, $\frac{n!}{(n-p)!} = \frac{7!}{(7-4)!}$, which is just $\frac{7!}{3!}$.

**Q. When do we use Variations instead of Permutations?**

We use Variations when we have to first pick and then arrange some (but not all) elements of the sample space. in Variation we are not arranging all elements in the sample space.

## Combinations

Combinations are the number of different ways we can pick certain elements of a set. 
<br>Basically selecting 4 people out of 7 people. i.e $^7C_4$ where 7 is n and 4 is p.</br>

**Here Order is not important.** we are just concerned about choosing any number of things from the lot without giving any precedence to the order.

### Combination =  $ \frac{Variation}{Permutation} $
$$Combination = \frac{n!}{r!(n-r)!}$$ 


eg: All possible combinations chosen with letter m, n, o. When three out of three letters are to be selected, then the only combination is mno. When two out of three letters are to be selected, then the possible combinations are mn, no, om. **It doesnot consider the different repition of elements.**

**eg: Selecting 3 people out of 10:**

$Variation =\frac{n!}{(n-p)!} = 720$

<br>$i.e. (10*9*8 = 720)$

but here we are counting every group of 3 people several times i.e Choosing Sarah, Alex and Dave is exactly the same as choosing Sarah, Dave and Alex, Dave, Sarah and Alex, Dave, Alex and Sarah, Alex, Dave and Sarah or Alex, Sarah and Dave.
Any of the six permutations we wrote is a different variation, but not a different combination. That is what we meant when we said that combinations take into account double counting.

Variations don't take into account double counting elements.
<br>That is where combinations step in.

We can say that all the different permutations of a single combination are different variations.

So choosing 3 people we have $p!$ $permutation.$ i.e $3! = 6$ permutations for Alex,Dave and Sarah.
<br> We have 6 variation for any combination

$Combination = 720/6 = 120$ ways of choosing who represents the company. 



We can say that all the different permutations of a single combination are different variations, there are six permutations, 120 combinations and 720 variations. <br>


</br>

**Combination with repetition**

$$\frac{(n+r-1)!}{r!(n-1)}$$


### Difference between Variation and Combination: 

Variation is the total number of ways we can **pick & arrange some elements of a given set. It can be with (or) without Repetitions.**

Combination is the number of different ways we can **pick certain element of a set.**
Also the order in which the elements needs to be selected is also not important

**Q. Imagine you are on a trip to Paris and decide to try some of their famous macaroons. The bakery you go to offers a different size “variety” packs, where you get to choose 3, 5 or 8 macaroons. The only requirement is that they all be different flavors.**

**How many different 3-macaroon packs can you get, considering there are 8 distinct flavors.**


Since it makes no difference to us what order they put the macarons in the box, we are dealing with combinations instead of variations. There are 8 distinct flavours, but we are only getting 3 macarons. Then n is 8 and p is 3, so we plug these values into the formula from this lecture to get 
$$\frac{n!}{r!(n-r)!}= \frac{8!}{3!*5!}$$


**Q. Now imagine you want to get the medium pack which contains 5 macaroons instead of 3. How many different possible packs can you make?**

Just like the last question, it makes no difference to us what order they put the macaroons in the box, we are dealing with combinations. There are 8 distinct flavors, but we are only getting 5 macaroons. Then n is 8 and p is 5, so we plug these values into the formula from this lecture to get 
$$\frac{n!}{p!(n-p)!}= \frac{8!}{3!*5!}$$

**Q. Now imagine the same scenario but this time you want the large box of 8 macaroons. How many different variety packs can you get?**

If we plug in 8 for both n and p, we get that there are $\frac{n!}{p!*(n-p)!}\;=\; \frac{8!}{(8!*0!)}$ different variety packs of 8 macrons. Since 0!=1, 8!/(8!0!)=8!/8! , which is just 1. Because we have only 1 way of getting 8 macrons with different flavors considering we only have 8 distinct fillings.

### Symmetry

Symmetry is used to avoid calculating factorials of large numbers. 

$$ When\;\;p> \frac{n}{2} > (n-p)$$

We had to select three of our 10 employees to represent the company at a conference.There are 120 possible selections.

We would also have 120 different ways of picking seven employees. That's because picking seven out of 10 employees to take to the conference is the same as choosing three out of 10 to leave behind.

**Q.Imagine your company is trying to gain customers by running an online ad campaign. The idea is to focus on a certain demographic which frequently uses social media. Your campaign will run ads on Facebook, Messenger, Instagram, Twitter and Reddit. Your graphical designers have created 8 different versions of the banner you can use. Based on this information:**


**Q1.Calculate how many different options you have for the entire campaign, assuming you want to use a different one for each platform.**

Using different banners for each platform means we can think of each social media platform as a different position. Hence, we are going to be dealing with Variations = $8*7*6*5*4 = 6720$


**Q2. Calculate how many different options you have for the entire campaign, assuming you can use the same banner for some or all the platforms.**

$8^5 = 32768$

**Q3. Calculate how many ways we can pick which of the 8 banners to use, assuming we use different ones for each platform.**

We need to select 5 out of the 8 banners to use.
$^8C_5$

We use different ones for each platform, so repetition is not allowed. Hence, we use the formula for Combinations without repetition.

$$\frac{8*7*6*5!}{5!*3!} \Longrightarrow\;\frac{8*7*6}{3*2} = 56$$


**Q4. Calculate how many ways we can pick which of the 8 banners to use, assuming we can use each one multiple times.**

Now, we need to select 5 out of the 8 banners to use. However, we can choose some multiple times. Therefore, we need to use Combinations with repetition

$$\frac{(n+r-1)!}{r!(n-1)} = \frac{8*9*10*11*12}{1*2*3*4*5} = 792$$

In this case, it is vital to not only know which banners we are using, but also to know how many times we are using each one, so we can assign them accordingly.

### Sets and Events

Every *event* has a *set of outcomes* that satisfy it. These are the favorable outcomes.

* Sets are denoted by Uppercase $\Longrightarrow$ "X"
* Elements are denoted by Lower-case $\Longrightarrow$ "x"
<br></br>
* **Empty set or Null set:** An empty set denoted by $\emptyset$ 
* **Non- Empty sets** can be finite or infinite

<br></br>
* An element being part of a set is denoted by  $"x\; \in \;A"$  where x is the element and A is the entire set.

**As for all or for any $"\forall"$**

* **for all x in A** $\rightarrow "\forall\; x \in A "$

**' : '** is used for such that.

* **for all x in A such x is even** $\rightarrow "\forall\; x \in A : x\;is\;even "$

**Subset:** 
<br>subset is a set that is fully contained in another set. If every element of A is also an element of B, then A is a subset of B, we denote that with A subset B.
$$A\subset B$$

**Every set contains at least two subsets itself and the null set.**

$A\subset A\;\; and\;\;  \emptyset \subset A$

>**Note :** There always exist a set of outcomes that satisfy a given event. Because the null set exists, even the outcomes of an impossible event can be described with a set.

#### Types of Venn Diagram:

* **If the two circles never touch :** then the two events can never happen simultaneously, essentially event A occurring guarantees that Event B is not occurring and vice versa.
<br>eg: If we get a heart, we can't get a diamond and if we get a diamond, we can't get a heart since each card has exactly one suit.
<br></br>
* **If two circles are intersecting :** if these circles intersect, it means that the two events can occur at the same time.
<br>eg:Event A is drawing a diamond and Event B is drawing a queen. The area where they intersect will be represented solely by the queen of diamonds.
<br></br>
* **If one circle completely overlaps another :** One event can only ever occur if the other one does as well.
<br>eg: Event A could be drawing a red card and event B could be drawing a diamond.The circle of B is completely contained inside, so we can only ever get a diamond if we get a red card. Notice that if the card we drew is black, it cannot be a diamond. Thus, if Event A does not occur,then neither does event B.
<br> However, because we can draw heart, it is possible to get a red card that isn't a diamond. *Therefore, event B not occurring does not guarantee event A not occurring.*
    
    **Note:** if Event A does not occur, then neither does event B. in short if an outcome is not part of a set, it cannot be part of any of its subsets, however, an outcome, not being part of some subset does not excluded from the entirety of the greater set</br>



#### INTERSECTION

When we want both A and B to happen at the same time, we are talking about their intersection, GraphicLy, the intersection is exactly as the name suggests, the area where these events intersect.It consists of all the outcomes that are favorable for both Event A and Event B simultaneously, as we denoted as A, Intersect B.

$$A \cap B $$

eg: With red cards and diamonds, the intersection of the two would simply be all diamonds.That is because any diamond is simultaneously red and a diamond.

We would write this as A intersect B equals B. ie. $A \cap B \; = \; B $


We use intersections only when we want to denote instances where both events A and B happen simultaneously.

#### Diagram is different

![intersection.PNG](attachment:intersection.PNG)

#### UNION

The union of two sets is a combination of **all** outcomes preferred for either A or B.

$$A \cup B\;\;or\;\;B \cup A$$

*Types of Union:*
*  **If the sets A and B do not touch at all :** Then their intersection would be the empty set. $A\cap B = \emptyset$ Therefore, their union will be their sum $A \cup B = A\; + \; B$ 
<br>Intuitively this makes sense. No element is in both sets simultaneously.So we do not need to worry about double counting.</br>


* **If the events intersect:**  The area of the union is represented by the sum of the two assets minus their intersection, 
<br>$A \cup B = A+B\;\;-\;A\cap B$</br>
<br>That is because if we simply add up the area of the two sets, we would be double counting every element that is part of the intersection.</br>
<br>eg: five people in the office have blond hair. Four people have blue eyes and only Kate has both. Therefore, there are only three non blond people with blue eyes and four blonde people with non blue eyes. Thus, the union of blond and blue eyed colleagues is the sum of people who have precisely one of the two features, as well as Kate, who has both.</br>


* **If one event is inside another :** if B is a subset of A, well, in that case, the union would simply be the entire set A. $A \cup B = A$

![union.PNG](attachment:union.PNG)

**Q. There are 8 blond and 10 brown-eyed people in the office. If only Jason and Eve have both features, how many people represent the union of blond and brown-eyed people?**

**Ans - 18-2 = 16**

<br>$A \cup B = A+B\;\;-\;A\cap B$</br>
$A \cup B = 16$ 
<br>$A+B = 18$</br>
$A \cap B = 2$

#### MUTUALLY EXCLUSIVE SETS 

Mutually exclusive sets are sets which are not allowed to have any overlapping elements. Graphically, their circles never intersect. i.e **They cannot both occur at the same time**

eg: tossing a coin - head and tails cannot come at same time 


Mutually exclusive sets have the empty set as their intersection.Therefore, if the intersection of any number of sets is the empty set, then they must be mutually exclusive and vice versa.

About their Union : If some sets are mutually exclusive, their union is simply the sum of all separate individual sets.
$$A \cup B = A+B$$
<br></br>

#### Compliments of sets

All values that are part of the sample space but not part of the set.

**NOTE:** **Compliments are always mutually exclusive. However, not all mutually exclusive sets are compliments.**
<br>eg: For instance, imagine A is the set of all even numbers, and B is the set of all numbers ending in five. We know that any number ending with five is odd, so these two sets are definitely mutually exclusive. However, the complement of all even is all odd and not just the ones ending with five. Therefore, a number like 13 would be part of the complement, but not the set B.</br>



**Example for mutually exclusive, but not complements:**

A : Winning a Game.
<br>B : Drawing a Game.</br>

Because you can not simultaneously win and draw the same game. However, you can also lose this game, so the two are not complements.


#### DEPENDENT & INDEPENDENT EVENTS
* <b>Independent Events : </b> The theoretical probability remains unaffected by other events. eg: Fliping a coin or rolling a dice
* <b>Dependent Events : </b> Probabilities of dependent events vary as conditions change. eg: Taking out a marble from a bag of 5 different marbles.
<br></br>
##### CONDITIONAL PROBABILITY : 
The probability of getting A, if we are given that B has already occured $P(A|B)$ .
<br>We use it to distinguish dependent from independent events.</br>
<br>eg: A $\rightarrow$ drawing Queen of spades.
    <br>B $\rightarrow$ is drawing a spade.
<br>Therefore, $P(A|B)$ would represent the probability of drawing the queen of spades if we know the card is a spade. We already calculated this earlier, so P(A) given B equals $\frac{1}{13}$.

>**NOTE:** If When A and B are independent, then $P(A|B) = P(A)\;and\;P(B|A) = P(B)$

>If two events are **Independent** then the probability of their intersection i.e. Probability(A and B) is the product of their individual probability.  $$\longrightarrow P(A\cap B) = P(A)* P(B)$$ 

eg: Queen of spades 

* A$\rightarrow$ represented drawing the exact card i.e. Queen of Spade 
* B$\rightarrow$ drawing the correct suit i.e Spade
* C$\rightarrow$ represented getting a Queen

Therefore, P(A) given B i.e. $P(A|B)$ would represent the probability of drawing the queen of spades if we know the card is a spade is $\frac{1}{13}$


Event C represents getting a queen, then $P(A|C)$ expresses the likelihood of getting the queen of spades, assuming we drew a queen. Thus $P(A|C) = \frac{1}{4}$.

we call this probability the conditional probability, and we use it to distinguish dependent from independent events.


<br>Normally, the probability of drawing the queen of spades $P(A) = \frac{1}{52}$ however, it increases if we know it's a spade $P(A|B) = \frac{1}{13}$ , we can say the two events are **dependent as they are not equal.**

Similarly, because the probability of drawing our desired card alters, if we know it is a queen, we can say A and C are also dependent as both have queens.

#### Condionality probability Formula:
$$P(A|B) = \frac{P(A\cap B)}{P(B)}$$  

<i>here to satisfy the conditional probability, we need both events B and A to occur simultaneously.</i>

$P(A|B)\rightarrow$ Probability of A occuring given B has already occured.

$P(A\cap B)\rightarrow$This suggests that the intersection of A and B would consist of all favorable outcomes for this probability.

$P(B) \rightarrow$ conditional probability requires that Event B occurs, so the sample space would simply be all outcomes where event B is satisfied.
$$only\;if\;P(B)> 0$$ 

* $if P(B) = 0$ event b would never occur thus $A|B$ would not be interpretable.
<br></br>
* If $P(A) = P(A|B)$ then A nd B events are independent. $So\;P(A\cap B) = P(A)*P(B) \;\;[product\;of\;indvidual\;probabilities]$
#### Favourable all Formula:

$$P(A) = \frac{favourable}{All}$$

**Importance of Condinality Formula**

* The order in which we write the elements is crucial. So $P(A|B)\ne P(B|A)$

**Q. Applying the Conditional Probability Formula, what is the probability of event A occurring, given event B has occurred if the likelihood of getting their intersection is 0.15 and the likelihood of event B is 0.6?**

According to the formula, $P(A|B) = \frac{0.15}{0.6} = 0.25.$

### ADDITIVE LAW  


#### For Mutual Exclusive Events

Probability of Occuring A or B.

$$ P(A\;or\;B) = P(A) + P(B) $$

#### For Non- Mutual Exclusive Events
The additive law states something very similar, the probability of the union of two sets is equal to the sum of the individual probabilities of each event, minus the probability of their intersection.
$$P(A \cup B) = P(A)\;+\;P(B)\;\;-\;P(A\cap B)$$

Example:

**Probability of getting a "King" or "Heart" card.** 
<br></br>
$$P(getting\;King) + P(getting\;Heart) - P(K\cap Heart)$$

### MULTIPLICATION RULE




#### For Dependent Events
<br></br>
The probability of both events happening equals to the product of the likelihood of A occuring and the conditional probability that B occurs, given A has already occured.
$$P(A\cap B) = P(A|B)*P(B)$$

This we get from the conditional probability formula.

eg: If event B occurs in *40* of the time **P(B) = 0.4** and event A occurs in 50 of the time B occurs
$P(A|B) = 0.5$ then they would simultaneously occur in 20 of the time $P(A|B)* P(B) = 0.5*0.4 = 0.2$

### Baye's Law

The conditional probability of getting B given A * the probability of A divided by the probability of B. This equation is known as Bayes Theorem.

$$P(A|B)= \frac{P(B|A)*P(A)}{P(B)}$$

It is crucial because it allows us to find a relationship between the different conditional probabilities of two events.

