# Probability
Probability is a measure of how likely an event is to occur. It quantifies uncertainty, ranging from 0 (impossible event) to 1 (certain event).
![image.png](attachment:image.png)
![image-2.png](attachment:image-2.png)

**Mutually Exclusive Events**

Two events are mutually exclusive if they cannot occur at the same time. If one event happens, the other cannot.

If 𝐴 and 𝐵 are mutually exclusive events, then:
- **P(𝐴∩𝐵)=0**
- **𝑃(𝐴∪𝐵)=𝑃(𝐴)+𝑃(𝐵)**

(since they cannot happen together, the probability of both happening is zero)

*Example:*
1. Tossing of coin: **P(H or T)=P(H)+P(T)** {Additive Rule for mutual exclusive event}
2. Rolling a dice: **P(1 or 5)=P(1)+P(5)** 

**Non-Mutually Exclusive Events**

Two events are non-mutually exclusive if they can occur at the same time. They may overlap, meaning both events can happen together.

If 𝐴 and 𝐵 are non-mutually exclusive events, then:
- **𝑃(𝐴∪𝐵)=𝑃(𝐴)+𝑃(𝐵)-P(𝐴∩𝐵)**

(the overlap P(𝐴∩𝐵) is subtracted to avoid double-counting)

*Example:*
1. Drawing a card from a deck: **P(King or Red)=P(King)+P(Red)-P(King of Red)** {Additive Rule for non-mutual exclusive event}

**Multiplication Rule**

It helps calculate the probability of two events happening together (joint probability), depending on whether the events are independent or dependent.

**(i)  Independent Events**

Two events are independent if the occurrence of one event does not affect the occurrence of the other.

If 𝐴 and 𝐵 are independent events, then:
- **𝑃(𝐴∩𝐵)=𝑃(𝐴)x𝑃(𝐵)**

(The probability of both events 𝐴 and 𝐵 occurring is the product of their individual probabilities.)

*Example:*
1. Tossing a coin and rolling a die: **P(H and 4)=P(H)xP(4)**

**(ii)  Dependent Events** (Conditional Probability)-helps in **Naive Bayes** Algorithm in ML

Two events are dependent if the occurrence of one event affects the occurrence of the other.

If 𝐴 and 𝐵 are independent events, then:
- **𝑃(𝐴∩𝐵)=𝑃(𝐴)x𝑃(𝐵|𝐴)**

(The probability of both events 𝐴 and 𝐵 occurring is the probability of 𝐴 occurring, multiplied by the conditional probability of 𝐵 occurring given that 𝐴 has occurred.)

*Example:*
1. Drawing a king and then queen from a deck without replacement: **P(King and Queen)=P(King)xP(Queen|King)**

### Probability Distribution Function
It describes how probabilities are distributed over the possible value of as random variable. It is a mathematical function that gives the probability of different outcomes in a probabilistic experiment.

NTOE: **when we smoothen the histogram, we will get the probability density** and **Probability Density is defined as Gradient of Cumulative Density Function**

**Types of Probability Distribution function**
1. **Probability Mass Function (PMF):** For discrete random variables, the function is called a Probability Mass Function. It gives the probability of each possible value the random variable can take.

    - Formula: **P(𝑋=𝑥𝑖)=𝑝𝑖**    where, 𝑋 is the random variable, 𝑥𝑖 are the possible values of 𝑋, 𝑝𝑖 is the probability of 𝑋 taking the value 𝑥𝑖

    ![image.png](attachment:image.png)

    - Example: Rolling a fair 6-sided die, the possible value are {1,2,3,4,5,6}. Hence, the PMF is: P(𝑋=𝑥𝑖)=1/6 for each 𝑥𝑖 ∈ {1,2,3,4,5,6}, since each face has an equal probability.
    
2. **Probability Density Function (PDF):** For continuous random variables, the function is called a Probability Density Function (PDF). It describes the relative likelihood for a random variable to take on a particular value.
    - Formula: ![image-2.png](attachment:image-2.png)   where, f(x) is the probability density at value 𝑥, 𝐹(𝑥) is the cumulative distribution function (CDF), the probability that 𝑋 is less than or equal to 𝑥

    ![image-3.png](attachment:image-3.png)

    - Properties: Non-negativity, f(x)>=0 for all 𝑥 and The total area under the curve of the PDF equals 1 as ![image-4.png](attachment:image-4.png)

**Cumulative Distribution Function (CDF)**
The Cumulative Distribution Function (CDF) of a random variable gives the cumulative probability that the variable takes a value less than or equal to a certain point. It essentially sums up the probabilities from the start of the distribution up to the point of interest.
- For Discrete Variables, the CDF sums the probabilities of all values less than or equal to 𝑥
- For continuous variables, the CDF is the integral of the Probability Density Function (PDF). It accumulates the probabilities over a continuous range.
- Formula: **F(x)=P(X≤x)**   where, F(x) is the cumulative probability up to 𝑥, 𝑋 is the random variable.𝑃(𝑋≤𝑥) is the probability that 𝑋 takes a value less than or equal to 𝑥
- **Probability Density: Gradient of Cumulative Density Function**

NOTE: **with respect to different distribution, f(x) is going to change**

**A Dataset usually follows different distribution**

**Types of Distributions**
- Bernoulli Distribution (PMF)
- Binomial Distribution (PMF)
- Normal/Gaussian Distribution (PDF)
- Poisson Distribution (PMF)
- Log Normal Distribution (PDF)
- Uniform Distribution (PMF)