### Q1: What are the Probability Mass Function (PMF) and Probability Density Function (PDF)? Explain with an example.

Ans:

### Probability Mass Function (PMF) and Probability Density Function (PDF)

Probability distributions are used to describe how probabilities are assigned to different possible values of a **random variable**.

There are two main types:

* **PMF** ‚Üí Used for **Discrete Random Variables**
* **PDF** ‚Üí Used for **Continuous Random Variables**


## 1. Probability Mass Function (PMF)

A **Probability Mass Function (PMF)** gives the probability that a **discrete random variable** takes a specific value.

### Definition:

If ( X ) is a discrete random variable, then:

P(X = x)

is given by the PMF.

### Properties:

1. 0‚â§P(X=x)‚â§1

2. ‚àëP(X=x)=1

#### Example: Tossing a Fair Die

Let ( X ) be the number that appears when a fair die is rolled.

Possible values: 1, 2, 3, 4, 5, 6

Each outcome has probability:

P(X=x)=1/6

for ( x = 1,2,3,4,5,6 )


## 2. Probability Density Function (PDF)

A **Probability Density Function (PDF)** gives the probability distribution of a **continuous random variable**.

‚ö† Important:
For continuous variables,

P(X = x) = 0

Instead, we find probability over an interval:

P(a‚â§X‚â§b)

by calculating the **area under the curve**.

### Properties:

1. f(x)‚â•0
2.‚à´‚àí‚àû to ‚àû f(x)dx=1

#### Example: Normal Distribution

Suppose ( X ) represents the height of students in a class.

Heights are continuous ‚Äî they can take any value like 165.2 cm, 170.45 cm, etc.

This is modeled using a **normal distribution**.



### Q2: What is Cumulative Density Function (CDF)? Explain with an example. Why CDF is used?

Ans:

## Cumulative Distribution Function (CDF)

The **Cumulative Distribution Function (CDF)** gives the probability that a random variable (X) takes a value **less than or equal to** a certain value.

#### Definition

For a random variable (X):

F(x)=P(X‚â§x)

It works for **both discrete and continuous** random variables.

### 1. CDF for a Discrete Random Variable

#### Example: Rolling a Fair Die

Let (X) = outcome of a fair die.

Each value has:

P(X=x) = 1/6

Now the CDF is cumulative:

* F(1)=P(X‚â§1)=1/6
* F(2)=P(X‚â§2)=2/6
* F(3)=3/6
* F(6)=6/6=1

So the CDF increases step-by-step.


### 2. CDF for a Continuous Random Variable

For a continuous variable:

F(x)=‚à´‚àí‚àû to x‚Äã f(t)dt

(where (f(t)) is the PDF)


#### Example: Normal Distribution

Suppose students' heights follow a normal distribution.

If:
F(170) = 0.70

It means:

70% of students have height **‚â§ 170 cm**


### Why is CDF Used?

CDF is very important because:

###### 1. It gives cumulative probability directly

Instead of summing probabilities manually, we use:

P(a‚â§X‚â§b)=F(b)‚àíF(a)

###### 2. It works for both discrete & continuous variables

PMF ‚Üí only discrete

PDF ‚Üí only continuous

CDF ‚Üí works for **both**

###### 3. Used in Statistics & Data Science

* Finding percentiles (e.g., 90th percentile)
* Hypothesis testing
* p-values calculation
* Risk analysis
* Machine Learning probability models

### Simple Intuition

* **PMF/PDF** ‚Üí Probability at a value (or density)
* **CDF** ‚Üí Probability up to that value

 Think of CDF as a **running total of probability**.


### Q3: What are some examples of situations where the normal distribution might be used as a model? Explain how the parameters of the normal distribution relate to the shape of the distribution.

Ans:

### Normal Distribution as a Model

The **Normal Distribution** (also called the Gaussian distribution) is one of the most widely used probability models in statistics and data science because many real-world phenomena naturally follow a bell-shaped pattern.

### Common Real-Life Examples

#### 1. Human Heights & Weights

* Heights of adults in a population
* Most people cluster around the average
* Very short and very tall people are rare

#### 2. Exam/Test Scores

* In large classes, marks often form a bell-shaped curve
* Most students score near the mean
* Very high and very low scores are less frequent

#### 3. Measurement Errors in Experiments

* Small errors occur more frequently
* Large errors are rare
* Common in physics, engineering, and data science

#### 4. IQ Scores

* IQ is designed to follow a normal distribution
* Mean = 100
* Standard deviation = 15
* Most people fall between 85 and 115

#### 5. Financial Returns (Approximately)

* Daily stock returns are often modeled as normal (though real markets can deviate)
* Used in risk analysis and portfolio theory


### Parameters of the Normal Distribution

A normal distribution is defined by two parameters:

X‚àºN(Œº,œÉ^2)

Where:

* Œº = Mean
* œÉ = Standard Deviation
* œÉ^2= Variance

#### 1. Mean (Œº) ‚Äì Controls the Center

* Determines the **location** of the peak
* Shifts the curve left or right
* Does NOT change shape

If Œº increases ‚Üí curve moves right

If Œº decreases ‚Üí curve moves left

#### 2. Standard Deviation (œÉ) ‚Äì Controls Spread

* Determines **width** of the curve
* Larger œÉ ‚Üí wider & flatter curve
* Smaller œÉ ‚Üí narrower & taller curve


### Q4: Explain the importance of Normal Distribution. Give a few real-life examples of Normal Distribution.

Ans:

### Importance of Normal Distribution

The **Normal Distribution** (bell-shaped curve) is one of the most important concepts in statistics because many real-world variables naturally follow this pattern, and many statistical methods are based on it.

### Why is Normal Distribution Important?

#### 1. Foundation of Statistical Methods

* Hypothesis testing
* Confidence intervals
* Regression analysis
* ANOVA
  Most classical statistical techniques assume data is normally distributed.


#### 2. Central Limit Theorem (CLT)

Even if data is not normally distributed, the **sampling distribution of the mean** becomes approximately normal when the sample size is large.

 This makes the normal distribution extremely powerful in inferential statistics.

#### 3. Easy Probability Calculation

Using Z-scores, we can easily compute probabilities.

Z=(X‚àíŒº‚Äã)/œÉ

This standardization allows comparison between different datasets.

#### 4. Symmetry & Predictability

* Mean = Median = Mode
* 68‚Äì95‚Äì99.7 Rule:

  * 68% within 1 standard deviation
  * 95% within 2 standard deviations
  * 99.7% within 3 standard deviations

This makes interpretation simple and practical.

#### 5. Widely Used in Data Science & ML

* Gaussian Naive Bayes
* Error modeling
* Feature normalization
* Outlier detection
* Risk modeling

### Real-Life Examples of Normal Distribution

#### 1. Human Height

* Most people have average height
* Very short and very tall individuals are rare

#### 2. Exam Scores


* Most students score near average
* Very high and very low scores are less frequent

#### 3. Measurement Errors

* Small errors occur frequently
* Large errors occur rarely

####  4. IQ Scores

* Mean = 100
* Standard deviation = 15
* Most people fall between 85 and 115

#### 5. Blood Pressure / Biological Measurements

* Many medical measurements follow an approximately normal pattern
* Useful in medical research and diagnosis

### Summary

The normal distribution is important because:

‚úî Many natural phenomena follow it

‚úî It simplifies probability calculations

‚úî It forms the base of inferential statistics

‚úî It supports hypothesis testing and ML algorithms



### Q5: What is Bernaulli Distribution? Give an Example. What is the difference between Bernoulli Distribution and Binomial Distribution?

Ans:

### Bernoulli Distribution

The **Bernoulli Distribution** is the simplest discrete probability distribution.
It models a random experiment that has **only two possible outcomes**:

* Success (1)
* Failure (0)

---

#### Definition

A random variable (X) follows a Bernoulli distribution if:

P(X=1)=p

p(X=0)=1‚àíp

Where:

* (p) = probability of success
* (1-p) = probability of failure

It is written as:

X ‚àº Bernoulli(p)

#### Example

#### Tossing a Coin

Let:

* 1 = Head (Success)
* 0 = Tail (Failure)

If the coin is fair:
[
p = 0.5
]

So:

* (P(X=1)=0.5)
* (P(X=0)=0.5)

---

### Binomial Distribution

The **Binomial Distribution** models the number of successes in **n independent Bernoulli trials**.

It is written as:

X‚àºBinomial(n,p)

Where:

* (n) = number of trials
* (p) = probability of success in each trial

#### Example

Suppose we toss a coin **5 times** and count the number of heads.

Here:

* (n = 5)
* (p = 0.5)

Possible values of (X): 0,1,2,3,4,5

---
### Difference Between Bernoulli and Binomial Distribution

| Feature          | Bernoulli Distribution | Binomial Distribution                  |
| ---------------- | ---------------------- | -------------------------------------- |
| Number of Trials | 1                      | n (multiple)                           |
| Possible Values  | 0 or 1                 | 0 to n                                 |
| Parameters       | p                      | n and p                                |
| Example          | One coin toss          | 5 coin tosses                          |
| Special Case     | ‚Äî                      | Bernoulli is a special case when n = 1 |

---

### Key Relationship

 **Bernoulli Distribution is a special case of Binomial Distribution when (n = 1)**

Binomial(1, p) = Bernoulli(p)

### Simple Intuition

* Bernoulli ‚Üí One yes/no experiment
* Binomial ‚Üí Counting number of yes results in multiple experiments



### Q6. Consider a dataset with a mean of 50 and a standard deviation of 10. If we assume that the dataset is normally distributed, what is the probability that a randomly selected observation will be greater than 60? Use the appropriate formula and show your calculations.

In [1]:
from scipy.stats import norm

# Given values
mean = 50
std_dev = 10
x = 60

# Calculate probability P(X > 60)
probability = 1 - norm.cdf(x, loc=mean, scale=std_dev)

print("Probability that X > 60:", probability)

Probability that X > 60: 0.15865525393145707


### Q7: Explain uniform Distribution with an example.
Ans:

### Uniform Distribution

A Uniform Distribution is a probability distribution in which all outcomes are equally likely.

It means the probability is distributed evenly across all possible values.

#### There are two types:

1)Discrete Uniform Distribution

Used when outcomes are countable and equally likely.

-> Example: Rolling a Fair Die

Let X be the number obtained when rolling a fair die.

Possible values:
1,2,3,4,5,6

Each value has equal probability:

P(X=x)=1/6

2)Continuous Uniform Distribution

-> Example:

Suppose a bus arrives randomly between 10:00 AM and 10:30 AM.

Let:
* a=0 minutes
* b=30 minutes

Then:

f(x)=1/30

Probability that bus arrives within first 10 minutes:

P(0‚â§X‚â§10)=10/30=1/3
	‚Äã



### Q8: What is the z score? State the importance of the z score.
Ans:

### What is a Z-Score?

A **Z-score** (also called a **standard score**) tells us **how many standard deviations a data point is away from the mean** of a distribution.

#### -> Formula:

Z=(X‚àíŒº‚Äã)/œÉ

Where:

* X = observed value
* Œº‚Äã = mean
* œÉ = standard deviation

---

### Interpretation

* **Z = 0** ‚Üí value is exactly at the mean
* **Z > 0** ‚Üí value is above the mean
* **Z < 0** ‚Üí value is below the mean
* Larger |Z| ‚Üí farther from the mean

---

### Importance of Z-Score

#### 1. Standardization

It converts different datasets into a **common scale**, allowing comparison.

Example:
Comparing marks from two different exams with different means and standard deviations.

#### 2. Probability Calculation

Using Z-scores and the **standard normal table**, we can calculate probabilities:

P(X > a), P(a < X < b)

#### 3. Outlier Detection

* If |Z| > 2 ‚Üí unusual
* If |Z| > 3 ‚Üí possible outlier

Used heavily in:

* Data cleaning
* Anomaly detection
* Fraud detection

#### 4. Basis of Hypothesis Testing

Z-scores are used in:

* Z-tests
* Confidence intervals
* Statistical inference

#### 5. Machine Learning & Data Science

* Feature scaling (Standardization)
* Gaussian Naive Bayes
* Statistical modeling


### Q9: What is Central Limit Theorem? State the significance of the Central Limit Theorem.
Ans:

### Central Limit Theorem (CLT)

The CLT says:

> If we take sufficiently large random samples from any population (with mean ( Œº ) and standard deviation ( œÉ )), the **sampling distribution of the sample mean** will be approximately **normally distributed**, regardless of the original population distribution.

This is true when the sample size ( n ) is large (commonly ( n ‚â• 30 )).

---

## üîé Mathematical Form

If:

* Population mean = Œº
* Population standard deviation = œÉ
* Sample size = n

Then:

x bar ‚àº N(Œº,‚ÄãœÉ‚Äã/sqrt(n))

Where:

* Mean of sample means = ùúá
* Standard deviation of sample means =‚ÄãœÉ‚Äã/sqrt(n) (called **Standard Error**)

---

### Significance of Central Limit Theorem

#### 1. Makes Normal Distribution Universal

Even if population is:

* Skewed
* Uniform
* Random

The **sample mean** follows normal distribution for large (n).

#### 2. Foundation of Inferential Statistics

CLT allows us to:

* Construct confidence intervals
* Perform hypothesis testing
* Calculate probabilities of sample means

Without CLT, most statistical testing would not work.

#### 3. Introduces Standard Error

SE = ‚ÄãœÉ‚Äã/sqrt(n)

As sample size increases:

* Standard error decreases
* Estimates become more accurate

#### 4. Used in Data Science & ML

* Estimating model performance
* Bootstrapping methods
* A/B testing
* Sampling techniques




### Q10: State the assumptions of the Central Limit Theorem.

Ans:

### Assumptions of the Central Limit Theorem (CLT)

The Central Limit Theorem (CLT) states that the sampling distribution of the sample mean becomes approximately normal as the sample size increases.
For this theorem to hold properly, certain assumptions must be satisfied.

* 1.Random Sampling
* 2.Independence of Observations
* 3.Sample Size Should Be Large Enough
* 4.Finite Mean and Variance
* 5.Identically Distributed Data