### Q1: What are the Probability Mass Function (PMF) and Probability Density Function (PDF)? Explain with an example.

#### Probability Mass Function (PMF) and Probability Density Function (PDF)

Both PMF and PDF are functions used to describe probability distributions, but they apply to different types of random variables:

#### Probability Mass Function (PMF)

- **Definition**: The PMF is used for discrete random variables. It gives the probability that a discrete random variable is exactly equal to a specific value.
- **Formula**: For a discrete random variable \( X \), the PMF is \( P(X = x) \), where \( x \) is a possible value of \( X \).
- **Properties**:
  - The sum of PMF over all possible values of the random variable is 1.
  - \( 0 \leq P(X = x) \leq 1 \) for all possible values of \( x \).

- **Example**: For a fair six-sided die, the PMF can be represented as:
  \[
  P(X = x) = \frac{1}{6} \text{ for } x = 1, 2, 3, 4, 5, 6
  \]
  Here, \( X \) is the outcome of the die roll, and each outcome (1 through 6) has a probability of \( \frac{1}{6} \).

#### Probability Density Function (PDF)

- **Definition**: The PDF is used for continuous random variables. It describes the likelihood of a random variable falling within a particular range of values. The probability of the variable taking on a specific value is zero, but the PDF allows for calculation of probabilities over intervals.
- **Formula**: For a continuous random variable \( X \), the PDF is \( f(x) \), where \( x \) is any value of \( X \). The probability of \( X \) falling within an interval \( [a, b] \) is given by:
  \[
  P(a \leq X \leq b) = \int_a^b f(x) \, dx
  \]
- **Properties**:
  - The area under the PDF curve over the entire range of possible values is 1.
  - The PDF value itself is not a probability but a density, so \( f(x) \) can be greater than 1.

- **Example**: For a normal distribution with mean \( \mu \) and standard deviation \( \sigma \), the PDF is given by:
  \[
  f(x) = \frac{1}{\sigma \sqrt{2 \pi}} e^{-\frac{(x - \mu)^2}{2 \sigma^2}}
  \]
  Here, \( X \) is a continuous random variable (e.g., height of a person), and \( f(x) \) gives the density at any point \( x \). To find the probability that \( X \) is within a certain range, you integrate the PDF over that range.

### Summary

- **PMF**: Used for discrete variables, provides the probability of specific outcomes.
  - **Example**: Probability of rolling a 3 on a fair six-sided die.

- **PDF**: Used for continuous variables, provides the density of outcomes over an interval.
  - **Example**: The density function of heights in a population where heights follow a normal distribution.

### Q2: What is Cumulative Density Function (CDF)? Explain with an example. Why CDF is used?

### Cumulative Distribution Function (CDF)

The Cumulative Distribution Function (CDF) is a fundamental concept in probability theory and statistics. It provides the probability that a random variable takes on a value less than or equal to a given point. It can be used for both discrete and continuous random variables.

#### Definition

- **For a random variable \( X \)**, the CDF is denoted by \( F(x) \) and is defined as:
  \[
  F(x) = P(X \leq x)
  \]
  where \( x \) is any value of \( X \).

- **For Discrete Variables**: The CDF is the sum of the probabilities of all outcomes less than or equal to \( x \).
  \[
  F(x) = \sum_{k \leq x} P(X = k)
  \]

- **For Continuous Variables**: The CDF is the integral of the Probability Density Function (PDF) up to \( x \).
  \[
  F(x) = \int_{-\infty}^x f(t) \, dt
  \]
  where \( f(t) \) is the PDF of the random variable.

#### Properties

1. **Non-Decreasing**: The CDF is a non-decreasing function; it either increases or stays constant as \( x \) increases.
2. **Range**: The CDF ranges from 0 to 1.
   - As \( x \to -\infty \), \( F(x) \to 0 \).
   - As \( x \to \infty \), \( F(x) \to 1 \).
3. **Right-Continuous**: The CDF is right-continuous with left limits.

#### Example

**Discrete Example**: Rolling a fair six-sided die.
- Let \( X \) be the outcome of the die roll.
- The PMF of \( X \) is \( P(X = x) = \frac{1}{6} \) for \( x = 1, 2, 3, 4, 5, 6 \).

The CDF \( F(x) \) is calculated as follows:
- For \( x < 1 \): \( F(x) = 0 \)
- For \( 1 \leq x < 2 \): \( F(x) = P(X \leq 1) = \frac{1}{6} \)
- For \( 2 \leq x < 3 \): \( F(x) = P(X \leq 2) = \frac{1}{6} + \frac{1}{6} = \frac{2}{6} \)
- Continue this way up to \( x \geq 6 \), where \( F(x) = 1 \).

**Continuous Example**: Normal Distribution.
- For a normal random variable \( X \) with mean \( \mu \) and standard deviation \( \sigma \), the CDF is:
  \[
  F(x) = \Phi\left(\frac{x - \mu}{\sigma}\right)
  \]
  where \( \Phi \) is the CDF of the standard normal distribution.

#### Why CDF is Used

1. **Probability Calculations**: The CDF helps in calculating the probability that a random variable falls within a specific range. For instance, the probability that \( X \) lies between \( a \) and \( b \) can be found using:
   \[
   P(a \leq X \leq b) = F(b) - F(a)
   \]

2. **Understanding Distribution**: The CDF provides a complete description of the probability distribution of a random variable. It shows how probabilities accumulate up to different values.

3. **Statistical Analysis**: CDFs are useful in various statistical methods, including hypothesis testing, statistical inference, and when working with empirical data.

4. **Quantile Calculation**: The CDF is used to find quantiles, which are critical in understanding the distribution of data and making predictions. For example, finding the median, quartiles, or other percentiles.

In summary, the CDF is a comprehensive tool that provides insights into the distribution and probability of random variables, making it essential for both theoretical and applied statistics.

### Q3: What are some examples of situations where the normal distribution might be used as a model? Explain how the parameters of the normal distribution relate to the shape of the distribution.

The normal distribution, often called the Gaussian distribution, is a key concept in statistics and probability. It's widely used in various fields due to its natural occurrence in many real-world scenarios. Here are some examples of situations where the normal distribution might be used as a model:

#### Examples of Situations

1. **Height of Individuals**:
   - **Description**: Heights of people within a specific population (e.g., adult women in a country) often follow a normal distribution.
   - **Application**: Understanding average height and variability in a population for health studies or designing products.

2. **Test Scores**:
   - **Description**: Scores from standardized tests or exams tend to follow a normal distribution due to the Central Limit Theorem, which states that the sum (or average) of a large number of random variables will tend to follow a normal distribution, regardless of the distribution of the individual variables.
   - **Application**: Assessing the performance of students and making decisions about grading and admissions.

3. **Measurement Errors**:
   - **Description**: Errors in measurements, such as those from scientific instruments, often follow a normal distribution due to various small, random factors affecting the measurements.
   - **Application**: Analyzing the accuracy and precision of measurements in experiments and quality control.

4. **Stock Prices**:
   - **Description**: Daily returns of stock prices or financial assets often approximate a normal distribution, especially when analyzed over a long period.
   - **Application**: Risk assessment, portfolio management, and financial modeling.

5. **Natural Phenomena**:
   - **Description**: Various natural phenomena, such as the distribution of rainfall amounts or the size of natural objects, often exhibit a normal distribution.
   - **Application**: Environmental studies, resource management, and predicting natural events.

#### Parameters of the Normal Distribution

The normal distribution is characterized by two key parameters:

1. **Mean (\(\mu\))**:
   - **Description**: The mean represents the center of the distribution. It is the value around which the data is symmetrically distributed.
   - **Effect on Shape**: The mean determines the location of the peak of the bell curve. Shifting the mean moves the entire distribution left or right along the x-axis.

2. **Standard Deviation (\(\sigma\))**:
   - **Description**: The standard deviation measures the spread or dispersion of the distribution. It quantifies how much the values deviate from the mean.
   - **Effect on Shape**: The standard deviation affects the width of the bell curve:
     - A **smaller standard deviation** results in a **narrower** and **taller** curve, indicating that the data values are closer to the mean.
     - A **larger standard deviation** results in a **wider** and **flatter** curve, indicating that the data values are more spread out from the mean.

#### Visual Representation

- **Mean (\(\mu\))**: The peak of the bell curve is at the mean. For example, if \(\mu = 50\), the highest point of the curve is at \(x = 50\).

- **Standard Deviation (\(\sigma\))**:
  - **1 Standard Deviation (\(\mu \pm \sigma\))**: Approximately 68% of the data falls within one standard deviation of the mean.
  - **2 Standard Deviations (\(\mu \pm 2\sigma\))**: Approximately 95% of the data falls within two standard deviations.
  - **3 Standard Deviations (\(\mu \pm 3\sigma\))**: Approximately 99.7% of the data falls within three standard deviations.

#### Summary

The normal distribution is a powerful model used in various applications to describe data that is symmetrically distributed around a mean. The shape of the distribution is primarily influenced by the mean and standard deviation, which determine the location of the peak and the spread of the data, respectively. Understanding these parameters helps in interpreting and analyzing normally distributed data effectively.

### Q4: Explain the importance of Normal Distribution. Give a few real-life examples of Normal Distribution.

#### Importance of Normal Distribution

The normal distribution, also known as the Gaussian distribution, is crucial in statistics and probability for several reasons:

1. **Central Limit Theorem (CLT)**:
   - **Description**: The CLT states that the sum (or average) of a large number of independent and identically distributed random variables will approximately follow a normal distribution, regardless of the original distribution of the variables.
   - **Importance**: This theorem underpins many statistical methods and justifies the use of normal distribution models in various fields.

2. **Simplifies Analysis**:
   - **Description**: Many statistical methods and tests are based on the assumption of normality. The normal distribution's properties make it easier to perform statistical analysis and hypothesis testing.
   - **Importance**: It allows for straightforward application of statistical techniques such as confidence intervals and significance tests.

3. **Modeling Natural Phenomena**:
   - **Description**: Many natural and human-made phenomena approximate a normal distribution, making it a valuable tool for modeling and prediction.
   - **Importance**: Accurate modeling leads to better predictions and decision-making.

4. **Quantifying Uncertainty**:
   - **Description**: The normal distribution helps quantify the uncertainty and variability of data. It provides a way to estimate probabilities and make inferences about populations.
   - **Importance**: It aids in risk assessment, quality control, and de#cision-making.

### Real-Life Examples of Normal Distribution

1. **Height of Individuals**:
   - **Description**: Heights of people in a given population (e.g., adult men in a country) often follow a normal distribution. Most people are of average height, with fewer individuals being very short or very tall.
   - **Example**: If the average height of adult men in the U.S. is 70 inches with a standard deviation of 3 inches, the distribution of heights will approximate a normal curve centered around 70 inches.

2. **Test Scores**:
   - **Description**: Scores from standardized tests, such as the SAT or GRE, often approximate a normal distribution. Most students score near the average, with fewer students achieving very high or very low scores.
   - **Example**: If the average score on a standardized test is 500 with a standard deviation of 100, the distribution of scores will be normal, with most students scoring close to 500.

3. **Measurement Errors**:
   - **Description**: Errors in scientific measurements often follow a normal distribution. This is due to the accumulation of many small, random factors that influence the measurements.
   - **Example**: If you measure the length of a metal rod multiple times, the measurement errors will typically be normally distributed around the true length of the rod.

4. **Stock Market Returns**:
   - **Description**: The daily returns of stock prices or financial assets often approximate a normal distribution, especially over a long period.
   - **Example**: If the average daily return of a stock is 0.1% with a standard deviation of 2%, the distribution of daily returns will be normal, with most returns clustered around 0.1%.

5. **IQ Scores**:
   - **Description**: IQ scores are designed to follow a normal distribution, with the majority of individuals scoring around the average and fewer individuals scoring very high or very low.
   - **Example**: The average IQ is set to 100 with a standard deviation of 15, leading to a normal distribution of scores where mstatistics and various real-world scenarios.

### Q5: What is Bernaulli Distribution? Give an Example. What is the difference between Bernoulli Distribution and Binomial Distribution?

### Bernoulli Distribution

The Bernoulli distribution is a discrete probability distribution for a random variable that has exactly two possible outcomes: success (usually coded as 1) and failure (usually coded as 0). It is named after the Swiss mathematician Jacob Bernoulli.

#### Definition

- **Random Variable**: \( X \)
- **Possible Outcomes**: 0 or 1
- **Probability of Success**: \( p \)
- **Probability of Failure**: \( 1 - p \)
- **Probability Mass Function (PMF)**:
  \[
  P(X = x) = 
  \begin{cases} 
  p & \text{for } x = 1 \\
  1 - p & \text{for } x = 0
  \end{cases}
  \]
  where \( 0 \leq p \leq 1 \).

#### Example

Consider a single coin flip where heads is considered a success (1) and tails is considered a failure (0). If the coin is fair, the probability of heads (success) is \( p = 0.5 \) and the probability of tails (failure) is \( 1 - p = 0.5 \). The distribution of the coin flip outcome follows a Bernoulli distribution with \( p = 0.5 \).

### Difference between Bernoulli Distribution and Binomial Distribution

Both Bernoulli and Binomial distributions are related, but they describe different types of scenarios:

1. **Bernoulli Distribution**:
   - **Description**: Models a single trial with two possible outcomes (success or failure).
   - **Number of Trials**: 1
   - **Parameters**: \( p \) (probability of success)
   - **Usage**: Used when analyzing the outcome of a single experiment or trial.

2. **Binomial Distribution**:
   - **Description**: Models the number of successes in a fixed number of independent Bernoulli trials.
   - **Number of Trials**: \( n \) (where \( n > 1 \))
   - **Parameters**: \( n \) (number of trials), \( p \) (probability of success in each trial)
   - **Probability Mass Function (PMF)**:
     \[
     P(X = k) = \binom{n}{k} p^k (1 - p)^{n - k}
     \]
     where \( k \) is the number of successes, and \( \binom{n}{k} \) is the binomial coefficient.
   - **Usage**: Used when analyzing multiple trials or experiments and counting the number of successes.

#### Example to Illustrate the Difference

- **Bernoulli Distribution**: Rolling a single die and checking whether it lands on a 6 (success) or not (failure). If the die is fair, \( p = \frac{1}{6} \) for landing on a 6.

- **Binomial Distribution**: Rolling the die 10 times and counting how many times it lands on a 6. Here, you have \( n = 10 \) trials and \( p = \frac{1}{6} \) for each trial. The number of times the die lands on a 6 follows a Binomial distribution with parameters \( n = 10 \) and \( p = \frac{1}{6} \).

### Summary

- **Bernoulli Distribution**: Describes the probability of a single trial with two outcomes.
- **Binomial Distribution**: Describes the number of successes in multiple independent Bernoulli trials.

### Q6. Consider a dataset with a mean of 50 and a standard deviation of 10. If we assume that the dataset is normally distributed, what is the probability that a randomly selected observation will be greater than 60? Use the appropriate formula and show your calculations.

To find the probability that a randomly selected observation from a normally distributed dataset will be greater than 60, given that the dataset has a mean of 50 and a standard deviation of 10, you need to use the properties of the normal distribution.

Here's the step-by-step process:

### Step-by-Step Solution

1. **Standardize the Observation**:
   Convert the observation to a standard normal variable (Z-score). The Z-score represents how many standard deviations an observation is from the mean.

   The formula to calculate the Z-score is:
   \[
   Z = \frac{X - \mu}{\sigma}
   \]
   where:
   - \( X \) = value of interest (60 in this case)
   - \( \mu \) = mean of the distribution (50)
   - \( \sigma \) = standard deviation of the distribution (10)

   Plug in the values:
   \[
   Z = \frac{60 - 50}{10} = \frac{10}{10} = 1
   \]

2. **Find the Probability**:
   Use the standard normal distribution table (Z-table) or a computational tool to find the probability associated with the Z-score.

   - The Z-table provides the probability that a standard normal variable is less than or equal to a given Z-score. For \( Z = 1 \), the Z-table gives us the probability of \( P(Z \leq 1) \), which is approximately 0.8413.

   - To find the probability of an observation being greater than 60, subtract the probability from 1:
     \[
     P(X > 60) = 1 - P(Z \leq 1)
     \]
     \[
     P(X > 60) = 1 - 0.8413 = 0.1587
     \]

### Summary

The probability that a randomly selected observation from this normally distributed dataset will be greater than 60 is approximately **0.1587**, or **15.87%**.

### Q7: Explain uniform Distribution with an example.

#### Uniform Distribution

The uniform distribution is a type of probability distribution in which all outcomes are equally likely. It is characterized by having a constant probability density function over a certain range. This distribution can be either discrete or continuous.

#### Discrete Uniform Distribution

In a discrete uniform distribution, the probability of each possible outcome is equal. 

- **Definition**: If a discrete random variable \( X \) can take \( n \) distinct values, and each value is equally likely, then \( X \) follows a discrete uniform distribution.
- **Probability Mass Function (PMF)**:
  \[
  P(X = x) = \frac{1}{n}
  \]
  where \( x \) is any of the possible values of \( X \) and \( n \) is the total number of possible values.

**Example**: Rolling a fair six-sided die.
- **Description**: Each of the six faces (1 through 6) has an equal probability of landing face up.
- **Probability**: The probability of rolling any specific number (say 3) is:
  \[
  P(X = 3) = \frac{1}{6}
  \]

#### Continuous Uniform Distribution

In a continuous uniform distribution, the probability density function is constant within a given range.

- **Definition**: If a continuous random variable \( X \) can take any value between \( a \) and \( b \), and each value within this interval is equally likely, then \( X \) follows a continuous uniform distribution.
- **Probability Density Function (PDF)**:
  \[
  f(x) = \frac{1}{b - a} \quad \text{for } a \leq x \leq b
  \]
  where \( a \) and \( b \) are the minimum and maximum values of \( X \), respectively.

- **Cumulative Distribution Function (CDF)**:
  \[
  F(x) = \frac{x - a}{b - a} \quad \text{for } a \leq x \leq b
  \]

**Example**: Selecting a random number from the interval [0, 10].
- **Description**: Each number between 0 and 10 is equally likely to be chosen.
- **Probability Density Function**: The PDF is constant at \( \frac{1}{10 - 0} = \frac{1}{10} = 0.1 \) for \( 0 \leq x \leq 10 \).

- **Probability Calculation**: To find the probability that the number falls between 3 and 7, calculate the area under the PDF from 3 to 7:
  \[
  P(3 \leq X \leq 7) = \text{PDF} \times \text{Length of interval} = 0.1 \times (7 - 3) = 0.4
  \]

### Summary

- **Discrete Uniform Distribution**: Each outcome in a finite set of possible outcomes has an equal probability.
- **Continuous Uniform Distribution**: Any value within a specific interval has an equal probability density, and the total probability across the interval sums to 1.

### Q8: What is the z score? State the importance of the z score.

### Z-Score

The Z-score, also known as the standard score, measures how many standard deviations a data point is from the mean of the data set. It is a way to standardize scores on the same scale, allowing for comparison across different distributions.

#### Formula

The Z-score for a given data point \( X \) is calculated using the following formula:
\[
Z = \frac{X - \mu}{\sigma}
\]
where:
- \( X \) = data point
- \( \mu \) = mean of the data set
- \( \sigma \) = standard deviation of the data set

#### Importance of the Z-Score

1. **Standardization**:
   - **Description**: The Z-score transforms data to a standard normal distribution with a mean of 0 and a standard deviation of 1.
   - **Importance**: This allows for comparing data points from different distributions or datasets by putting them on a common scale.

2. **Identifying Outliers**:
   - **Description**: A Z-score indicates how far a data point is from the mean in terms of standard deviations.
   - **Importance**: Data points with Z-scores significantly greater than 3 or less than -3 are considered outliers, helping to identify unusual or extreme values.

3. **Probability Calculation**:
   - **Description**: The Z-score is used to calculate probabilities and percentiles in a standard normal distribution.
   - **Importance**: It helps in determining the likelihood of a value occurring within a certain range, which is useful in hypothesis testing and confidence interval estimation.

4. **Comparing Different Distributions**:
   - **Description**: When data from different distributions are transformed into Z-scores, comparisons can be made more easily.
   - **Importance**: This is useful in various fields, such as comparing test scores across different exams or assessing performance across different datasets.

5. **Normalization**:
   - **Description**: Z-scores are used to normalize data in preprocessing steps for machine learning algorithms.
   - **Importance**: Normalization ensures that features contribute equally to the model, improving the performance and convergence of algorithms.

#### Example

Suppose you have test scores from two different exams. Exam A has a mean score of 70 with a standard deviation of 10, and Exam B has a mean score of 80 with a standard deviation of 15. If a student scores 85 on Exam A and 95 on Exam B, you can calculate the Z-scores to compare their performance:

- **Exam A**:
  \[
  Z_A = \frac{85 - 70}{10} = 1.5
  \]
- **Exam B**:
  \[
  Z_B = \frac{95 - 80}{15} = 1
  \]

Even though the student scored higher on Exam B, the Z-score indicates that the performance on Exam A was relatively better compared to the mean of that exam.



### Q9: What is Central Limit Theorem? State the significance of the Central Limit Theorem.


### Central Limit Theorem (CLT)

The Central Limit Theorem (CLT) is a fundamental theorem in probability theory and statistics. It states that, regardless of the original distribution of a population, the distribution of the sample mean approaches a normal distribution as the sample size becomes large, provided the samples are independent and identically distributed (i.i.d.).

#### Formal Statement

If \( \{X_1, X_2, \ldots, X_n\} \) are i.i.d. random variables with a mean \( \mu \) and a finite variance \( \sigma^2 \), then the sampling distribution of the sample mean \( \bar{X} \) approaches a normal distribution with mean \( \mu \) and variance \( \frac{\sigma^2}{n} \) as \( n \) (the sample size) becomes large.

Mathematically:
\[
\bar{X} \sim N\left(\mu, \frac{\sigma^2}{n}\right) \text{ as } n \to \infty
\]

where \( \bar{X} \) is the sample mean.

#### Significance of the Central Limit Theorem

1. **Facilitates Inference**:
   - **Description**: CLT allows for the use of normal distribution properties even when the population distribution is not normal, provided the sample size is sufficiently large.
   - **Significance**: This simplifies the process of making inferences about population parameters, such as estimating confidence intervals and performing hypothesis tests.

2. **Enables Use of Parametric Tests**:
   - **Description**: Many statistical tests and methods (e.g., t-tests, ANOVA) assume normality. The CLT justifies the use of these tests on sample means.
   - **Significance**: It allows for the application of a wide range of statistical methods and tests in practical scenarios, even if the original data is not normally distributed.

3. **Improves Accuracy of Estimations**:
   - **Description**: As sample size increases, the sample mean becomes a more accurate estimate of the population mean.
   - **Significance**: This improves the reliability of statistical estimates and predictions derived from sample data.

4. **Foundation for Statistical Quality Control**:
   - **Description**: The CLT underpins quality control techniques such as control charts, which rely on the normal distribution of sample means to monitor process quality.
   - **Significance**: It supports effective monitoring and control of manufacturing processes and service quality.

5. **Predicts Long-Term Behavior**:
   - **Description**: The CLT predicts that sample means will follow a normal distribution, which helps in understanding and predicting the long-term behavior of processes.
   - **Significance**: This is useful in fields like finance, insurance, and operations management for forecasting and risk assessment.

#### Example

Suppose you have a population with a skewed distribution (e.g., income distribution). If you take a large number of random samples from this population and compute the sample means, the distribution of these sample means will approximate a normal distribution. For instance, if you take samples of size 30 or more, the distribution of the sample means will approach a normal distribution, regardless of the shape of the original income distribution.



### Q10: State the assumptions of the Central Limit Theorem.

#### Assumptions of the Central Limit Theorem (CLT)

The Central Limit Theorem (CLT) relies on several key assumptions to ensure that the sample mean approximates a normal distribution. Here are the main assumptions:

1. **Independence**:
   - **Description**: The sampled observations must be independent of each other. This means the value of one observation should not influence or be influenced by the value of another observation.
   - **Significance**: Ensures that each sample contributes equally to the overall distribution of the sample mean, without bias from other samples.

2. **Identically Distributed**:
   - **Description**: The samples must be drawn from the same probability distribution with the same mean (\( \mu \)) and variance (\( \sigma^2 \)).
   - **Significance**: Ensures that all samples are comparable and have the same underlying characteristics, which is crucial for accurately approximating the normal distribution.

3. **Sample Size**:
   - **Description**: The sample size \( n \) should be sufficiently large. While there is no strict rule for what constitutes a "large" sample size, a common rule of thumb is that \( n \geq 30 \) is often considered adequate.
   - **Significance**: Larger sample sizes lead to a better approximation of the normal distribution for the sample mean. For populations with highly skewed distributions or large variances, a larger sample size might be required.

4. **Finite Variance**:
   - **Description**: The population from which the samples are drawn should have a finite variance (\( \sigma^2 \)).
   - **Significance**: Ensures that the spread of the data is not infinite, which supports the convergence of the sample mean to a normal distribution.

#### Summary of Assumptions

1. **Independence**: Observations must be independent.
2. **Identically Distributed**: Observations must be drawn from the same distribution.
3. **Sample Size**: Sample size should be large (typically \( n \geq 30 \)).
4. **Finite Variance**: Population variance must be finite.

These assumptions collectively ensure that the distribution of the sample mean approaches a normal distribution as the sample size increases, which allows for reliable statistical inference using the properties of the normal distribution.