**Q1: What are the Probability Mass Function (PMF) and Probability Density Function (PDF)? Explain with
an example.**

**Probability Mass Function (PMF):**

The Probability Mass Function (PMF) is a concept used in probability theory to describe the probability distribution of a discrete random variable. It gives the probability that a discrete random variable is exactly equal to some value. Mathematically, for a discrete random variable X, the PMF is denoted by P(X = x), where x is a specific value the random variable can take.

For example, consider a fair six-sided die. The PMF for rolling a particular number on the die is:

$P(X = x) = \frac{1}{6}$

where X is the random variable representing the outcome of the die roll, and x can be any of the numbers 1 through 6.

**Probability Density Function (PDF):**

On the other hand, the Probability Density Function (PDF) is used for continuous random variables. It describes the likelihood of the random variable taking on a particular value within a range. Unlike the PMF, the PDF doesn't directly give probabilities for specific values but rather gives the probability density over a range of values.

For example, consider a continuous random variable Y representing the height of people. The PDF might be denoted as f(y), where y is a height value. The probability that Y falls within a certain range [a, b] is given by the integral of the PDF over that range:

$P(a \leq Y \leq b) = \int_{a}^{b} f(y) \,dy $

It's important to note that for a continuous random variable, the probability at a specific point is technically zero, and probabilities are defined over intervals.

In summary, the PMF is used for discrete random variables, providing the probability of specific values, while the PDF is used for continuous random variables, providing the probability density over a range of values.

**Q2: What is Cumulative Density Function (CDF)? Explain with an example. Why CDF is used?**

**Cumulative Density Function (CDF):**

The Cumulative Density Function (CDF) is a concept used in probability theory to describe the cumulative probability distribution of a random variable. For both discrete and continuous random variables, the CDF gives the probability that the random variable takes on a value less than or equal to a given value.

Mathematically, for a random variable X, the CDF is denoted by F(x), and it is defined as:

$F(x) = P(X \leq x)$

For a discrete random variable, the CDF is the sum of the probabilities up to a certain value, and for a continuous random variable, it is the integral of the probability density function (PDF) up to a certain value.

**Example:**

Let's consider a fair six-sided die again. The CDF for rolling a value less than or equal to x is given by:

$F(x) = P(X \leq x)$

For a fair six-sided die, where each outcome is equally likely, the CDF is piecewise constant. For any value \(x\) in the range 1 to 6 (inclusive), the CDF is:

$F(x) = \frac{x}{6}$

So, for example, if you want to find the probability of rolling a number less than or equal to 3, you would plug in \(x = 3\) into the CDF:

$F(3) = \frac{3}{6} = \frac{1}{2}$

**Why CDF is used:**

1. **Cumulative Probability:** The CDF provides a way to calculate the cumulative probability of a random variable up to a certain point. It gives the probability that the random variable is less than or equal to a specific value.

2. **Ease of Calculation:** In many cases, it's easier to work with cumulative probabilities, especially when dealing with complex distributions. The CDF simplifies the computation of probabilities by avoiding the need to sum or integrate the probability mass or density function for each individual value.

3. **Probability Intervals:** The CDF is particularly useful for finding probabilities within intervals. The probability that the random variable falls within a certain range $[a, b]$ is given by $F(b) - F(a)$.

In summary, the Cumulative Density Function is a valuable tool in probability theory, providing a convenient way to express and calculate cumulative probabilities for random variables, both discrete and continuous.

**Q3: What are some examples of situations where the normal distribution might be used as a model?**<br>
Explain how the parameters of the normal distribution relate to the shape of the distribution.

**Examples of Situations for Normal Distribution:**

The normal distribution, also known as the Gaussian distribution or bell curve, is commonly used to model various phenomena in fields such as statistics, physics, finance, and natural sciences. Some examples of situations where the normal distribution might be used as a model include:

1. **Height of Individuals:** The distribution of human heights tends to follow a normal distribution.

2. **IQ Scores:** IQ scores are often modeled using a normal distribution.

3. **Measurement Errors:** Many measurement errors in experimental sciences are assumed to be normally distributed.

4. **Financial Returns:** Stock prices and financial returns often exhibit a normal distribution.

5. **Blood Pressure:** Blood pressure measurements in a population may be modeled using a normal distribution.

6. **Population IQ:** When studying a large population, IQ scores often approximate a normal distribution.

**Parameters of the Normal Distribution:**

The normal distribution is characterized by two parameters: the mean $\mu$ and the standard deviation $\sigma$. These parameters play a crucial role in determining the shape of the normal distribution:

1. **Mean $\mu$:** The mean represents the central location of the distribution. It is the point around which the data is symmetrically distributed. Shifting the mean to the right or left moves the entire distribution along the horizontal axis.

2. **Standard Deviation $\sigma$:** The standard deviation measures the spread or dispersion of the distribution. A larger standard deviation results in a wider, more spread-out distribution, while a smaller standard deviation produces a narrower, more concentrated distribution.

The probability density function (PDF) of the normal distribution is given by the formula:

$f(x; \mu, \sigma) = \frac{1}{\sqrt{2\pi}\sigma} e^{-\frac{(x - \mu)^2}{2\sigma^2}}$

Here, $x$ is a random variable, $\mu$ is the mean, $\sigma$ is the standard deviation, $\pi$ is a mathematical constant (approximately 3.14159), and $e$ is the base of the natural logarithm.

In summary, the normal distribution is a versatile model that can represent a wide range of natural phenomena. The mean and standard deviation parameters control the location and spread of the distribution, respectively, influencing its shape and characteristics.

**Q4: Explain the importance of Normal Distribution. Give a few real-life examples of Normal
Distribution.**

**Importance of Normal Distribution:**

The normal distribution is of great importance in various fields due to its mathematical properties and its prevalence in describing the distribution of many natural phenomena. Here are some reasons why the normal distribution is crucial:

1. **Statistical Inference:** The normal distribution is fundamental in statistical inference. Many statistical methods and tests, such as hypothesis testing and confidence intervals, assume that the data is approximately normally distributed.

2. **Central Limit Theorem:** The Central Limit Theorem states that the sum (or average) of a large number of independent, identically distributed random variables, regardless of their original distribution, will be approximately normally distributed. This theorem underpins the validity of many statistical procedures.

3. **Modeling Uncertainty:** In many cases, when there is uncertainty about a quantity or a process, the normal distribution is used to model that uncertainty. This is because of its simplicity and the fact that it arises naturally in various situations.

4. **Predictive Modeling:** In predictive modeling and machine learning, assumptions about the distribution of errors or residuals are often based on the normal distribution. Linear regression models, for example, often assume that the residuals are normally distributed.

5. **Quality Control:** The normal distribution is frequently used in quality control processes to model variations in manufacturing processes. It helps in setting control limits and identifying outliers.

**Real-life Examples of Normal Distribution:**

1. **IQ Scores:** Intelligence Quotient (IQ) scores are designed to follow a normal distribution with a mean of 100 and a standard deviation of 15. This distribution allows for the classification of intelligence levels.

2. **Height of Individuals:** Human height is often modeled as a normal distribution. While actual height data may deviate slightly from a perfect normal distribution, the approximation is often close enough for practical purposes.

3. **Exam Scores:** In educational settings, the distribution of exam scores for a large population of students often approximates a normal distribution. This is particularly true for exams that are well-designed and cover a diverse range of topics.

4. **Blood Pressure:** Blood pressure in a population is often modeled using a normal distribution. Normal blood pressure values are centered around a mean value, and deviations from this value are expected to follow a bell-shaped curve.

5. **Financial Returns:** In finance, the daily returns of stock prices are often assumed to be normally distributed. This assumption is foundational in various financial models.

6. **Error Terms in Regression:** In linear regression models, the errors or residuals are often assumed to be normally distributed. This assumption is important for making statistical inferences and constructing prediction intervals.

The normal distribution's ubiquity in various real-world scenarios makes it a valuable tool for statistical analysis and modeling, aiding in making predictions, drawing inferences, and understanding natural variability.

**Q5: What is Bernaulli Distribution? Give an Example. What is the difference between Bernoulli
Distribution and Binomial Distribution?**

**Bernoulli Distribution:**

The Bernoulli distribution is a discrete probability distribution representing a random variable that can take on one of two possible outcomes, typically labeled as success and failure. It is named after Jacob Bernoulli, a Swiss mathematician. The distribution is characterized by a single parameter, $p$, which is the probability of success.

The probability mass function (PMF) of a Bernoulli-distributed random variable is given by:

$P(X = k) = 
  \begin{cases} 
    p & \text{if } k = 1 \\
    1 - p & \text{if } k = 0 
  \end{cases}
$

Here, $X$ is the random variable, $k$ is the outcome (1 for success, 0 for failure), and $p$ is the probability of success.

**Example of Bernoulli Distribution:**

Consider a single coin flip, where success is defined as getting heads $H$ and failure as getting tails $T$. Let's say the probability of getting heads is $p = 0.5$. The Bernoulli distribution for this scenario is:

$P(X = k) = 
  \begin{cases} 
    0.5 & \text{if } k = 1 \text{ (heads)} \\
    0.5 & \text{if } k = 0 \text{ (tails)} 
  \end{cases}
$

**Difference between Bernoulli Distribution and Binomial Distribution:**

1. **Number of Trials:**
   - **Bernoulli Distribution:** Describes a single trial with two possible outcomes (success or failure).
   - **Binomial Distribution:** Describes the number of successes in a fixed number of independent and identical Bernoulli trials.

2. **Random Variable:**
   - **Bernoulli Distribution:** The random variable can only take two values, typically 0 and 1.
   - **Binomial Distribution:** The random variable represents the number of successes in a fixed number of trials, and it can take values from 0 to the total number of trials.

3. **Parameter:**
   - **Bernoulli Distribution:** Characterized by a single parameter $p$, the probability of success in a single trial.
   - **Binomial Distribution:** Characterized by two parameters: $n$, the number of trials, and $p$, the probability of success in a single trial.

4. **Probability Mass Function (PMF):**
   - **Bernoulli Distribution:** $( P(X = k) = p^k (1 - p)^{1-k} ) for (k = 0, 1)$.
   - **Binomial Distribution:** $( P(X = k) = \binom{n}{k} p^k (1 - p)^{n-k} ) for (k = 0, 1, 2, ..., n)$.

In summary, the Bernoulli distribution is a special case of the binomial distribution when there is only one trial $n = 1$. The binomial distribution extends the concept to multiple independent and identical trials, allowing the modeling of the number of successes in those trials.

**Q6. Consider a dataset with a mean of 50 and a standard deviation of 10. If we assume that the dataset
is normally distributed, what is the probability that a randomly selected observation will be greater
than 60? Use the appropriate formula and show your calculations.**

To find the probability that a randomly selected observation from a normal distribution is greater than a specific value, we can use the Z-score formula and standard normal distribution tables.

The Z-score is calculated as follows:

$Z = \frac{(X - \mu)}{\sigma}$

where:
- $X$ is the specific value (in this case, 60),
- $\mu$ is the mean of the distribution (given as 50),
- $\sigma$ is the standard deviation of the distribution (given as 10).

Let's calculate the Z-score:

$ Z = \frac{(60 - 50)}{10} = 1 $

Now, we look up the corresponding probability from the standard normal distribution table for a Z-score of 1. The standard normal distribution table provides the cumulative probability up to a given Z-score.

From the table, the probability corresponding to a Z-score of 1 is approximately 0.8413.

So, the probability that a randomly selected observation from the dataset is greater than 60 is approximately 0.8413, or 84.13%.

**Q7: Explain uniform Distribution with an example.**

**Uniform Distribution:**

The uniform distribution is a probability distribution in which all values within a given range are equally likely to occur. It is characterized by a constant probability density function (PDF) over its entire range. The probability density is uniform, meaning that every possible outcome has the same likelihood of occurring.

**Probability Density Function (PDF) of the Uniform Distribution:**

For a continuous uniform distribution over the interval $[a, b]$, the PDF is defined as:

$ f(x) = 
  \begin{cases} 
    \frac{1}{b-a} & \text{for } a \leq x \leq b \\
    0 & \text{otherwise}
  \end{cases}
$

Here, $a$ and $b$ are the parameters defining the interval, and the PDF is flat (constant) over that interval.

**Example of Uniform Distribution:**

Let's consider an example of a uniform distribution representing the outcome of rolling a fair six-sided die. In this case, the uniform distribution is discrete, as the possible outcomes are integers.

For a fair six-sided die, the outcomes are 1, 2, 3, 4, 5, and 6. The probability of each outcome is $ \frac{1}{6} $ because all outcomes are equally likely.

So, the discrete uniform distribution for the die roll is:

$ P(X = k) = 
  \begin{cases} 
    \frac{1}{6} & \text{for } k = 1, 2, 3, 4, 5, 6 \\
    0 & \text{otherwise}
  \end{cases}
$

This means that each outcome has an equal probability of $ \frac{1}{6} $, making it a uniform distribution.

In summary, the uniform distribution is characterized by a constant probability over a specific interval, and it is often used to model situations where each outcome within the range has an equal likelihood of occurring.

**Q8: What is the z score? State the importance of the z score.**

**Z-Score:**

The Z-score, also known as the standard score or z-value, is a measure of how many standard deviations a particular data point is from the mean of a distribution. It is calculated using the following formula:

$ Z = \frac{(X - \mu)}{\sigma} $

where:
- $ Z $ is the Z-score,
- $ X $ is the individual data point,
- $ \mu $ is the mean of the distribution,
- $ \sigma $ is the standard deviation of the distribution.

The Z-score indicates whether a data point is typical or unusual in comparison to the rest of the distribution. A positive Z-score means the data point is above the mean, while a negative Z-score means it is below the mean.

**Importance of Z-Score:**

1. **Standardization:** Z-scores are used to standardize data and make different distributions comparable. By expressing data points in terms of standard deviations from the mean, comparisons can be made across different scales and units.

2. **Identifying Outliers:** Z-scores help identify outliers or extreme values in a dataset. Data points with Z-scores significantly different from the mean may be considered outliers.

3. **Probability Calculation:** Z-scores are used in probability calculations. In a standard normal distribution (with a mean of 0 and standard deviation of 1), Z-scores correspond to the probabilities of observing values below or above a certain point.

4. **Data Analysis and Inference:** Z-scores are widely used in statistical analysis and hypothesis testing. They help in making decisions about sample means and comparing data from different populations.

5. **Quality Control:** In manufacturing and quality control processes, Z-scores are used to assess how far a particular measurement is from the expected value, helping identify potential issues.

6. **Determination of Percentiles:** Z-scores can be used to determine the percentile rank of a data point within a distribution. This is useful in understanding the relative position of a value in a dataset.

7. **Standardizing Scores in Education:** Z-scores are often used in educational assessments to standardize scores across different tests, allowing for a fair comparison of performance.

In summary, the Z-score is a valuable statistical tool that provides a standardized measure of a data point's position within a distribution. It aids in comparing and interpreting data, identifying outliers, and making probabilistic assessments.

**Q9: What is Central Limit Theorem? State the significance of the Central Limit Theorem.**

**Central Limit Theorem (CLT):**

The Central Limit Theorem (CLT) is a fundamental concept in statistics that describes the shape of the sampling distribution of the sample mean (or sum) for a sufficiently large sample size, regardless of the shape of the original population distribution. It states that, as the sample size increases, the sampling distribution of the sample mean approaches a normal distribution, even if the population distribution is not normal.

The Central Limit Theorem is crucial for inferential statistics, hypothesis testing, and confidence interval construction. It is applicable when drawing repeated random samples from a population, making it a powerful tool in statistical analysis.

**Statement of the Central Limit Theorem:**

Let $X_1, X_2, ..., X_n$ be a random sample of size $n$ drawn from any population with mean $μ$ and standard deviation $σ$. As $n$ becomes large, the sampling distribution of the sample mean $\bar{X}$ will be approximately normally distributed with mean $μ$ and standard deviation $\frac{σ}{\sqrt{n}}$.

**Significance of the Central Limit Theorem:**

1. **Normality of the Sample Mean Distribution:** The CLT allows us to assume that the sampling distribution of the sample mean is approximately normal, regardless of the shape of the original population distribution. This is particularly powerful because the normal distribution is well understood and has many desirable properties.

2. **Statistical Inference:** The CLT is the foundation for many statistical inference procedures. It enables the use of normal distribution-based methods, such as calculating confidence intervals and conducting hypothesis tests, even when dealing with non-normally distributed populations.

3. **Large Sample Sizes:** The CLT provides a guideline for when the sample size is considered large enough for normal approximation. As a general rule of thumb, a sample size of 30 or more is often considered sufficient for the CLT to apply.

4. **Population Distribution Irrespective:** The CLT is applicable to populations with unknown or non-normal distributions. This makes it a versatile tool for a wide range of applications where the underlying population distribution might not be known.

5. **Averages Approach Normality:** The CLT states that the distribution of sample means approaches normality even if the original population distribution is not normal. This holds true for both symmetric and skewed populations.

In summary, the Central Limit Theorem is a cornerstone of statistical theory, providing a bridge between the characteristics of a population and the properties of the sample mean distribution. It enables the use of normal distribution-based methods in statistical inference, contributing to the robustness and applicability of statistical techniques in practice.

**Q10: State the assumptions of the Central Limit Theorem.**

The Central Limit Theorem (CLT) is a powerful statistical concept, but it relies on certain assumptions to hold true. Here are the key assumptions of the Central Limit Theorem:

1. **Random Sampling:** The samples must be drawn randomly from the population. Each individual in the population has an equal chance of being selected in the sample.

2. **Independence:** The observations in the sample must be independent of each other. The outcome of one observation should not influence the outcome of another. In the case of sampling without replacement, the sample size should be small relative to the population size to maintain independence.

3. **Sample Size:** While the CLT doesn't specify an exact sample size, the sample size should be "sufficiently large." In practice, a commonly used guideline is that a sample size of 30 or more is often considered large enough for the CLT to apply. However, the appropriateness of this rule depends on the nature of the population distribution.

4. **Finite Variance:** The population from which the samples are drawn must have a finite variance (\(σ^2\)). If the population variance is infinite, the CLT might not hold.

5. **Identically Distributed:** The sampled observations should come from an identical distribution. In other words, each observation in the sample should follow the same probability distribution.

It's important to note that while the CLT provides an approximation of normality for the distribution of the sample mean, the underlying population distribution can be non-normal. The CLT is particularly robust for larger sample sizes, even when the population distribution is skewed or not well-behaved.

Violating these assumptions might lead to biased or unreliable results when relying on the Central Limit Theorem. Understanding and checking these assumptions are crucial when applying statistical methods based on the CLT in practice.