Q1: What are the Probability Mass Function (PMF) and Probability Density Function (PDF)? Explain with
an example
## Probability Mass Function (PMF) and Probability Density Function (PDF)
### Probability Mass Function (PMF)
- **Context**: Discrete random variables.
- **Definition**: The PMF gives the probability that a discrete random variable is exactly equal to some value.
- **Example**: If \( X \) is the outcome of rolling a fair six-sided die, the PMF \( P(X=x) \) for \( x = 1, 2, 3, 4, 5, 6 \) is \( \frac{1}{6} \).

### Probability Density Function (PDF)
- **Context**: Continuous random variables.
- **Definition**: The PDF gives the relative likelihood of a continuous random variable taking on a specific value, but the probability at an exact point is 0. Instead, probabilities are found over intervals by integrating the PDF.
- **Example**: If \( Y \) is the height of adult males in a certain country and follows a normal distribution with mean \( \mu = 70 \) inches and standard deviation \( \sigma = 3 \) inches, the PDF \( f_Y(y) \) describes the relative likelihood of different heights. The probability that \( Y \) lies between 68 and 72 inches is found by integrating \( f_Y(y) \) over this interval.

.  

# Cumulative Distribution Function (CDF)

## Definition
The Cumulative Distribution Function (CDF) describes the probability that a random variable \(X\) will take a value less than or equal to \(x\).

- **Formula**:
  - For a discrete random variable \(X\), \(F_X(x) = P(X \leq x) = \sum_{t \leq x} P(X = t)\).
  - For a continuous random variable \(X\), \(F_X(x) = P(X \leq x) = \int_{-\infty}^x f_X(t) \, dt\), where \(f_X(t)\) is the probability density function (PDF).

## Example
- **Discrete Example**: If \(X\) is the outcome of rolling a fair six-sided die, the CDF \(F_X(x)\) at \(x = 3\) is the sum of the probabilities of rolling a 1, 2, or 3:
  \[
  F_X(3) = P(X \leq 3) = P(X=1) + P(X=2) + P(X=3) = \frac{1}{6} + \frac{1}{6} + \frac{1}{6} = \frac{3}{6} = 0.5
  \]
- **Continuous Example**: If \(Y\) is the height of adult males in a certain country following a normal distribution with mean \(\mu = 70\) inches and standard deviation \(\sigma = 3\) inches, the CDF \(F_Y(y)\) at \(y = 72\) is the probability that a randomly selected male is 72 inches or shorter. This is calculated using the integral of the PDF from \(-\infty\) to 72.

## Why CDF is Used
- **Cumulative Probability**: The CDF provides the cumulative probability up to a certain value, which helps in understanding the likelihood of a range of outcomes.
- **Probability Calculations**: It is useful for calculating probabilities of intervals, comparing distributions, and finding percentiles.
- **Intuitive Understanding**: The CDF gives an intuitive understanding of how probabilities accumulate over the range of possible values of the random variable.


# Q3: Examples of Situations Where the Normal Distribution Might Be Used as a Model

## Examples of Situations:
1. **Heights of People**: Heights of adult men and women tend to follow a normal distribution.
2. **Test Scores**: Standardized test scores like the SAT or IQ scores often follow a normal distribution.
3. **Measurement Errors**: Errors in measurements in various scientific experiments usually follow a normal distribution.
4. **Blood Pressure**: The distribution of blood pressure readings in a healthy population can often be modeled using a normal distribution.
5. **Daily Stock Returns**: The daily returns of stock prices for large, diversified portfolios are often assumed to be normally distributed.

## Parameters of the Normal Distribution:
- **Mean (\(\mu\))**: This is the central value of the distribution. It determines the location of the peak of the curve. For example, if the average height of adult men is 70 inches, then \(\mu = 70\).
- **Standard Deviation (\(\sigma\))**: This measures the spread or dispersion of the distribution. It determines the width of the bell curve. A smaller \(\sigma\) results in a steeper curve, while a larger \(\sigma\) results in a flatter curve. For instance, if the standard deviation of heights is 3 inches, then \(\sigma = 3\).

## Shape of the Distribution:
- **Symmetry**: The normal distribution is symmetric about the mean.
- **Bell-shaped Curve**: The highest point on the curve corresponds to the mean, and the curve tails off symmetrically on both sides.
- **68-95-99.7 Rule**:
  - About 68% of the data falls within one standard deviation (\(\mu \pm \sigma\)) of the mean.
  - About 95% falls within two standard deviations (\(\mu \pm 2\sigma\)).
  - About 99.7% falls within three standard deviations (\(\mu \pm 3\sigma\)).

## Example Visualization:
If we consider the heights of adult males with \(\mu = 70\) inches and \(\sigma = 3\) inches, the distribution will have its peak at 70 inches. About 68% of the males will have heights between 67 and 73 inches (70 \(\pm 3\) inches), 95% will have heights between 64 and 76 inches (70 \(\pm 6\) inches), and 99.7% will have heights between 61 and 79 inches (70 \(\pm 9\) inches).

This visualization helps in understanding how the mean and standard deviation affect the shape and spread of the normal distribution.


# Q5: Bernoulli Distribution and Binomial Distribution

## Bernoulli Distribution
- **Definition**: The Bernoulli distribution is a discrete probability distribution for a random variable which takes the value 1 with probability \( p \) and the value 0 with probability \( 1 - p \).
- **Example**: Flipping a coin where heads (success) is coded as 1 and tails (failure) is coded as 0. If the coin is fair, then \( p = 0.5 \).
  - PMF of a Bernoulli random variable \( X \):
    \[
    P(X = x) = \begin{cases} 
    p & \text{if } x = 1 \\
    1 - p & \text{if } x = 0 
    \end{cases}
    \]

## Binomial Distribution
- **Definition**: The Binomial distribution is the discrete probability distribution of the number of successes in a sequence of \( n \) independent experiments, each asking a yes–no question, and each with its own Boolean-valued outcome: success (with probability \( p \)) or failure (with probability \( 1 - p \)).
- **Example**: Flipping a coin 10 times and counting the number of heads. Here, each flip is a Bernoulli trial with \( p = 0.5 \), and we are interested in the total number of successes (heads) in 10 trials.
  - PMF of a Binomial random variable \( X \) (number of successes in \( n \) trials):
    \[
    P(X = k) = \binom{n}{k} p^k (1 - p)^{n - k}
    \]
    where \( \binom{n}{k} \) is the binomial coefficient, representing the number of ways to choose \( k \) successes out of \( n \) trials.

## Differences Between Bernoulli and Binomial Distributions:
1. **Number of Trials**:
   - **Bernoulli**: A single trial.
   - **Binomial**: Multiple trials (\( n \)).

2. **Random Variable**:
   - **Bernoulli**: The random variable represents a single trial outcome (0 or 1).
   - **Binomial**: The random variable represents the number of successes in \( n \) trials.

3. **Parameters**:
   - **Bernoulli**: Only one parameter \( p \) (the probability of success).
   - **Binomial**: Two parameters \( n \) (number of trials) and \( p \) (probability of success).


# Q6: Probability of a Randomly Selected Observation Being Greater Than 60

Given:
- Mean (\(\mu\)) = 50
- Standard deviation (\(\sigma\)) = 10
- We need to find \( P(X > 60) \).

## Steps to Calculate:

1. **Convert to the standard normal distribution (Z-score):**
   The Z-score is calculated using the formula:
   \[
   Z = \frac{X - \mu}{\sigma}
   \]
   For \( X = 60 \):
   \[
   Z = \frac{60 - 50}{10} = 1
   \]

2. **Find the probability corresponding to the Z-score:**
   Using standard normal distribution tables or a calculator, find the cumulative probability \( P(Z \leq 1) \).
   \[
   P(Z \leq 1) \approx 0.8413
   \]

3. **Calculate the probability of \( X \) being greater than 60:**
   \[
   P(X > 60) = 1 - P(Z \leq 1) = 1 - 0.8413 = 0.1587
   \]

Therefore, the probability that a randomly selected observation from this normally distributed dataset will be greater than 60 is approximately \( 0.1587 \) or 15.87%.


# Q7: Uniform Distribution

## Definition
The uniform distribution is a type of probability distribution in which all outcomes are equally likely. Every interval of the same length has an equal probability of occurring.

## Types
1. **Discrete Uniform Distribution**: Every value in a finite set of values is equally likely.
2. **Continuous Uniform Distribution**: Every value in a continuous range of values is equally likely.

## Discrete Uniform Distribution Example
- **Example**: Rolling a fair six-sided die.
  - Possible outcomes: {1, 2, 3, 4, 5, 6}
  - Each outcome has an equal probability of \( \frac{1}{6} \)

## Continuous Uniform Distribution Example
- **Example**: Suppose we have a continuous random variable \( X \) that is uniformly distributed between 0 and 10.
  - Notation: \( X \sim U(a, b) \), where \( a \) is the minimum value and \( b \) is the maximum value. Here, \( X \sim U(0, 10) \).
  - Probability Density Function (PDF): 
    \[
    f(x) = \begin{cases} 
    \frac{1}{b - a} & a \leq x \leq b \\
    0 & \text{otherwise}
    \end{cases}
    \]
    For \( a = 0 \) and \( b = 10 \):
    \[
    f(x) = \begin{cases} 
    \frac{1}{10 - 0} = \frac{1}{10} & 0 \leq x \leq 10 \\
    0 & \text{otherwise}
    \end{cases}
    \]

## Properties of Continuous Uniform Distribution
1. **Mean** (\(\mu\)):
   \[
   \mu = \frac{a + b}{2}
   \]
   For \( a = 0 \) and \( b = 10 \):
   \[
   \mu = \frac{0 + 10}{2} = 5
   \]

2. **Variance** (\(\sigma^2\)):
   \[
   \sigma^2 = \frac{(b - a)^2}{12}
   \]
   For \( a = 0 \) and \( b = 10 \):
   \[
   \sigma^2 = \frac{(10 - 0)^2}{12} = \frac{100}{12} = \frac{25}{3} \approx 8.33
   \]

## Application
Uniform distributions are often used in simulations where each possible outcome is equally likely, such as random sampling, computer simulations, and games of chance.


# Q8: Z-Score

## Definition
The z-score, also known as the standard score, measures how many standard deviations an element is from the mean of the distribution. It indicates the relative position of a value within a distribution.

## Formula
The z-score is calculated using the formula:

$$
Z\text{-Score} = \frac{x - \mu}{\sigma}
$$

where:
- \( X \) is the value of the observation,
- \( \mu \) is the mean of the distribution,
- \( \sigma \) is the standard deviation of the distribution.

## Example
Suppose you have a dataset with a mean score of 80 and a standard deviation of 10. For a test score of 90:
\[
Z = \frac{90 - 80}{10} = 1
\]
This means the score of 90 is 1 standard deviation above the mean.

## Importance of the Z-Score

1. **Standardization**: Z-scores standardize different data points so they can be compared directly, even if they come from different distributions or have different units.

2. **Probability Calculations**: In a normal distribution, the z-score allows for the calculation of probabilities and percentiles. It helps in determining how likely an observation is within a given range.

3. **Outlier Detection**: Z-scores help identify outliers. Typically, values with z-scores greater than 2 or less than -2 are considered outliers.

4. **Comparing Data Across Different Scales**: Z-scores are useful when comparing data that come from different scales or units, making it easier to see which values are unusually high or low relative to their own distributions.

5. **Statistical Inference**: In hypothesis testing and confidence intervals, z-scores are used to standardize test statistics and make decisions based on standard normal distribution tables.


### Central Limit Theorem (CLT)

The **Central Limit Theorem** (CLT) is a powerful concept in statistics. It states that if you take sufficiently large samples from any population, the sample means will tend to follow a normal distribution, regardless of the original population's distribution.

#### Importance of the CLT

1. **Simplification**: By working with the sampling distribution of the mean (which is approximately normal), we simplify complex statistical problems.
2. **Inference**: We can make informed inferences about population parameters based on sample statistics.
3. **Statistical Techniques**: The CLT allows us to use techniques that assume a normal distribution, even when the original data isn't normally distributed.

Remember, the CLT is like a magical bridge that connects our sample world to the broader population universe! 🌟
