# <center>PROBABILITY</center>

## Introduction to Probability

<b>What is Probability</b>
- Probability is simply how likely an event will occur.
- Whenever we're unsure about the outcome of an event, we can talk about the probabilities of how likely certain outcomes will occur.
- An important skill for data scientists using data affected by chance.
  
P(A) = $\frac{n}{N}$ = $\frac{outcomes\ in\ A}{outcomes\ in\ Sample\ Space}$

<br>

<b>Why Probability?</b>  
- With randomness existing everywhere, the use of probability theory allows for the analysis of chance events.
- Our Aim is to determine the likelihood of an event occuring, often using a numerical scale of between 0 and 1. With the number '0' indicating impossibility & '1' indicating certainty.
- Example: Tossing a coin has 50% probability for heads.

<br>

<b>Examples:</b>

- There are 6 balls, 3 are red, 2 are yellow and 1 is blue.  
    What is the probability of picking a yellow ball?  
    <u>solution</u>  
    P(Yellow) = $\frac{no.\ of\ Yellow\ balls}{total\ no.\ of\ balls}$
                = $\frac{2}{6}$ = 0.33
                
<br>
 
- There is a container full of coloured bottles, red, blue, green and orange. 
    Some of the bottles are picked out and displaced. A person did this 1000 times and got the following results:
    - No. of blue bottles picked: 300
    - No. of red bottles: 200
    - No. of green bottles: 450
    - No. of orange bottles: 50  
    What is the probability that he will pick green bottle?  
    <u>solution</u>  
    P(Green) = $\frac{frequency\ of\ Green\ bottles}{total\ frequency\ of\ bottles}$
                = $\frac{450}{1000}$ = 0.45  
                               
<br>

<b>Probability Theory: Case Study</b>  
- Probability theory is a tool employed by researchers, business, investment analysts and countless others for risk management and scenario analysis.
- Small Business:
    - If a business enterprise expects to receive between 500k and 700k in revenue each month, the linear graph will begin with 500k at the low end and end with 700k at the high end. For a typical probability distribution, the graph will resemble a bell curve, where the least likely outcomes fall nearer the extreme ends of the range and the most likely, nearer to the midpoint of the extremes.  
    
<br>

<b>Usage of Probability in Data Science:</b>
- Confidence intervals in statistics to know the probability of our data lying within the given intervals.
- Probability distribution:
    - Every type of data cannot be analyzed in the same way. Some data follow a Normal distribution, Poission distribution, Bernoulli distribution, Binomial distribution, while others follow Exponential distribution.
- Bayes Theorem - Naive Bayes Algorithm
- Conditional Probability
- Central Limit Theorem
- Markov Chains and Hidden Markov Chains.

##  Key Terminologies

- Random Experiment:
    - In ML, we mostly deal with uncertain events. Random Experiment is an experiment in which outcome is not known with certainty.

- Sample Space: 
    - Its a Universal set that consists of all possible outcomes of an experiment.
    - Example: Outcomes of College Application. S = {admitted, not admitted}

- Event:
    - Subset of sample space. Probability is usually calculated with respect to an event.
    - Example: Chances of getting head on a coin toss.

<br>

<b>Random Variable:</b>
- A function that maps every outcome in the sample space to a real number.
- It can be classified as discrete or continuous depending on the values it can take.
- If a random variable X can assume only countable infinite set of values, it is called discrete random variable.
- If a random variable X can take a value from an infinite set of values, it is called continuous random variable.

<br>

<b>Examples of Discrete Random Variable:</b>
- Credit Rating (finite category)
- No. of orders received.
- Customer churn: the percentage of customers that stopped using your company's product. (Yes or No)
- Fraud Detection (Yes or No)

<br>

<b>Examples of Continuous Random Variable:</b>
- Market share of a company (any value from an infinite set between 0 and 100).
- Percentage of attrition(reduction) of employees.
- Time to failure of an Engineering System.
- Time taken to complete the order.

## Rules of Probability

<b>Rule 1:</b>   
The probability of an impossible event is 0. The probability of a certain event is 1.  
Therefore, for any event A, the range of possible probabilities for all possible events is equal to 1.

<br>

<b>Rule 2:</b>  
For S, the sample space of all possiblilities, P(S) = 1.  
i.e. the sum of all the probabilities for all possible events is equal to 1.

<br>

<b>Rule 3:</b>  
For any event A, P(A') = 1 - P(A).  
Similarly, P(A) = 1 - P(A').

<br>

<b>Rule 4 (Addition Rule OR):</b>  
This is the probability that either one or both events occur.
- If two events, say A and B, are mutually exclusive,  
    i.e. A and B have no outcomes in common then,  
    P(A ∪ B) = P(A) + P(B)
    
- If two events are NOT mutually exclusive then,   
    P(A ∪ B) = P(A) + P(B) - P(A ∩ B)

<br>

<b>Rule 5 (Multiplication Rule AND):</b>  
This is the probability that both events occur.
- P(A ∩ B) = P(A)*P(B|A) or P(B)*P(A|B)
    .
- If A and B are independent, neither event influences or affects the probability that the other event occurs  
    P(A ∩ B) = P(A)*P(B).  
    This particular rule extends to more than two independent events.  
    For example, P(A ∩ B ∩ C) = P(A)*P(B)*P(C)
    
<br>

<b>Rule 6 (Conditional Probability):</b>  
- P(A|B) = $\frac{A\ and\ B}{P(B)}$ or P(B|A) = $\frac{A\ and\ B}{P(A)}$  
    Note: this straight line symbol, |, does not mean divide!  
    This symbols means "conditional" or "given".  
    For instance P(A|B) means the probability that event A occurs given event B has occurred.

## Marginal, Joint & Conditional Probability

<b>Case Study: Netflix:</b>  
Let us look at the frequency table below.  
The table shows the frequency of male and female population that watch the mentioned tv shows.  
<img src='assets/Netflix - Case Study.png' width=400>  

<br>

Since the highest data in the table is 500 (Total sample population), dividing all the data by 500.  
Now we have the values between 0 to 1. The table thus obtained is called Probability Distribution Table.  
<img src='assets/Netflix - Probability Distribution Table.png' width=400>  

<br>

Here, the ones that are highlighted in green are called <u>Marginal Probability</u> and the ones that are highlighted in blue are called <u>Joint Probability</u>.  
For example, the value 0.16 is representing the chances of occuring two events - probability of Male liking Money Heist. Thus called the Joint Probability.  
The value 0.46 is a Marginal Probability because, it is the probability of selecting a Male. It doesnot care about their show preference. It describes the probability of occurance of a single event.

<img src='assets/Netflix - Joint & Marginal Probability.png' width=400>

<br>

<b>Joint Probability vs Marginal Probability vs Conditional Probability:</b>
- Joint Probability:
    - It is a Statistical Measure that calculates the likelihood of two events occuring together and at the same point of time.

- Marginal Probability:
    - It is the probability of an event irrespective of the outcome of another variable.

- Conditional Probability:
    - It is the probability of occurence of an event given that another event has already occured.

<br>

<u>Question 1:</u>  
What is the probability of a Netflix Subscriber being Male?
- P(Male) = 0.46


<u>Question 2:</u>  
What is the probability of a Netflix Subscriber preferring Money Heist?
- P(Money Heist) = 0.4

<u>Question 3:</u>  
What is the probability of a Netflix Subscriber being a Male and preferring Breaking Bad?
- P(Male ∩ Breaking Bad) = 0.2

<u>Question 4:</u>
What is the probability of a Netflix Subscriber being a Female and preferring Breaking Bad?
- P(Female ∪ Breaking Bad) = P(Female) + P(Breaking Bad) - P(Male ∩ Breaking Bad)  
   = 0.54 + 0.25 - 0.05 = 0.74
   
<u>Question 5:</u>
Samrat is a new Netflix Subscriber, what is the chance that he would like Breaking Bad?
- We know, Samrat is a Male and we need to know the probablity of Samrat preferring Breaking bad.  
    Here, we use Conditional Probability.  
    P(Breaking Bad | Male) = $\frac{P(Breaking\ Bad\ ∩\ Male)}{P(Male)}$  = $\frac{0.2}{0.46}$ = 0.43

<u>Question 6:</u>  
Hinata is a new Netflix Subscriber, what is the chance that she would like Money Heist?
- Using Conditional Probability,  
    P(Money Heist | Female) = $\frac{P(Money\ Heist\ ∩\ Female)}{P(Female)}$  = $\frac{0.24}{0.54}$ = 0.44