### 1
- Probability Mass Function (PMF): The PMF is a function that describes the probability distribution of a discrete random variable. It assigns a probability to each possible outcome or value that the discrete random variable can take. For example, when rolling a fair six-sided die, the PMF would assign a probability of 1/6 to each of the six possible outcomes (1, 2, 3, 4, 5, and 6).

- Probability Density Function (PDF): The PDF is a function that describes the probability distribution of a continuous random variable. It represents the relative likelihood of the continuous random variable taking on a specific value within a given range. For instance, in a normal distribution, the PDF assigns higher probabilities to values near the mean and lower probabilities to values farther from the mean.

### 2
- Cumulative Density Function (CDF): The CDF is a function that gives the cumulative probability that a random variable takes a value less than or equal to a specified value. It provides a way to understand the probability distribution of a random variable over its entire range. The CDF starts at 0 and increases monotonically to 1.

Example: Consider rolling a fair six-sided die. The CDF for the outcome "less than or equal to 3" is calculated as follows:
  - P(X ≤ 3) = P(X = 1) + P(X = 2) + P(X = 3) = 1/6 + 1/6 + 1/6 = 1/2

The CDF is useful because it provides a complete picture of the probabilities associated with a random variable. It can be used to find probabilities for various ranges of values, calculate percentiles, and make probability comparisons.

### 3
The normal distribution is commonly used as a model in various real-world situations, including:

1. Heights of Adults: The distribution of heights in a population often follows a normal distribution.
2. IQ Scores: Intelligence quotient (IQ) scores tend to be normally distributed with a mean of 100 and a standard deviation of 15.
3. Test Scores: In educational testing, scores on standardized tests, such as the SAT, are often assumed to be normally distributed.
4. Errors in Measurements: Errors in measurements, such as errors in scientific experiments or manufacturing processes, are often assumed to be normally distributed.
5. Financial Data: Stock returns and financial data often approximate a normal distribution.
6. Biological Traits: Characteristics like birth weights, blood pressure, and cholesterol levels in a population can exhibit a normal distribution.

Parameters of the Normal Distribution:
- Mean (μ): The mean represents the central location or average of the data. It determines the location of the peak of the normal distribution.
- Standard Deviation (σ): The standard deviation measures the spread or variability of the data. A larger σ results in a wider and flatter distribution, while a smaller σ results in a narrower and taller distribution.

### 4
The normal distribution is important in statistics and data analysis for several reasons:

1. Modeling Real-World Data: Many natural phenomena and measurements in the real world follow a normal distribution or can be approximated by it. Understanding this distribution helps in analyzing and making predictions about various aspects of these phenomena.

2. Statistical Inference: The normal distribution is a fundamental concept in inferential statistics. Many statistical tests and confidence interval calculations are based on the assumption of normality.

3. Central Limit Theorem: The normal distribution is closely tied to the Central Limit Theorem, which states that the distribution of sample means from a population approaches a normal distribution, regardless of the underlying population distribution. This theorem is essential for hypothesis testing and confidence intervals.

Examples of Real-Life Normal Distributions:
- Heights of adult humans in a population.
- IQ scores in a large sample of people.
- Exam scores in a class when the grading process is fair and unbiased.
- Errors in manufacturing processes when the errors are small and independent.
- Natural variation in the measurements of physical quantities.

### 5
- Bernoulli Distribution: The Bernoulli distribution models a single trial or experiment with two possible outcomes, typically labeled as "success" and "failure." It is characterized by a single parameter, p, which represents the probability of success. The random variable X following a Bernoulli distribution is equal to 1 if success occurs and 0 if failure occurs.

Example: Tossing a fair coin, where "Heads" is considered success (X = 1) and "Tails" is considered failure (X = 0).

Difference between Bernoulli Distribution and Binomial Distribution:
- The Bernoulli distribution models a single trial or experiment with two outcomes.
- The Binomial distribution models the number of successes in a fixed number (n) of independent and identical Bernoulli trials.
- In a Bernoulli distribution, you have a single random variable (X) with two possible values (0 or 1).
- In a Binomial distribution, you have multiple random variables (X₁, X₂, ..., Xₙ) representing the number of successes in each of the n trials.

### 6
To find the probability that a randomly selected observation from a normally distributed dataset with mean μ = 50 and standard deviation σ = 10 is greater than 60, you can use the standard normal distribution (Z) and the z-score formula:

Z = (X - μ) / σ

Where:
- X is the value you want to find the probability for (in this case, X = 60).
- μ is the mean of the distribution (μ = 50).
- σ is the standard deviation of the distribution (σ = 10).

Calculate the z-score:

Z = (60 - 50) / 10 = 1.0

Now, you need to find the probability associated with this z-score using a standard normal distribution table or calculator. The probability that Z is greater than 1.0 is approximately 0.1587.

So, the probability that a randomly selected observation from the dataset is greater than 60 is approximately 0.1587 or 15.87%.

### 7
- Uniform Distribution: The uniform distribution is a probability distribution in which all values within a given range have equal probabilities of occurring. In other words, each possible outcome has the same likelihood of being observed.

Example: Rolling a fair six-sided die is an example of a discrete uniform distribution. There are six possible outcomes (1, 2, 3, 4, 5, and 6), and each outcome has a probability of 1/6 because the die is fair and unbiased. The

 probability mass function (PMF) for a uniform distribution is constant across all possible values.

In continuous uniform distribution, an example could be selecting a random point within a specified interval, such as choosing a random real number between 0 and 1. In this case, the probability density function (PDF) is a constant value within the interval.

### 8
- Z-Score (Standard Score): The z-score, also known as the standard score, is a measure of how many standard deviations a data point is away from the mean of a dataset. It is calculated using the formula:

Z = (X - μ) / σ

Where:
- X is the individual data point.
- μ is the mean (average) of the dataset.
- σ is the standard deviation of the dataset.

Importance of the Z-Score:
1. Standardization: Z-scores standardize data, making it possible to compare data from different distributions and scales. This is particularly useful in statistical analysis and hypothesis testing.

2. Outlier Detection: Z-scores help identify outliers in a dataset. Data points with extreme z-scores (much larger or smaller than 0) are potential outliers.

3. Probability Calculations: Z-scores are used in calculating probabilities associated with specific values in a normal distribution. They allow you to find the probability of a data point falling above, below, or between certain values.

4. Normal Distribution Analysis: In a standard normal distribution (with mean μ = 0 and standard deviation σ = 1), z-scores directly represent the number of standard deviations from the mean. This simplifies normal distribution analysis.

### 9
- Central Limit Theorem (CLT): The Central Limit Theorem is a fundamental concept in statistics. It states that the distribution of the sample means of a large enough sample from any population will be approximately normally distributed, regardless of the population's underlying distribution, as long as certain conditions are met.

Significance of the Central Limit Theorem:
1. Foundation for Inference: The CLT is a cornerstone of statistical inference. It allows us to make inferences about a population based on the distribution of sample means, even when we don't know the population's distribution.

2. Hypothesis Testing: It enables the use of normal distribution-based tests and confidence intervals for various statistical analyses, including hypothesis testing and parameter estimation.

3. Real-World Applications: In practice, many datasets may not follow a normal distribution, but the CLT allows us to apply the principles of the normal distribution to analyze and make predictions about real-world data.

4. Large Sample Sizes: The CLT is particularly valuable when dealing with large sample sizes because the distribution of sample means approaches a normal distribution rapidly as sample size increases.

5. Quality Control: Industries often rely on the CLT to assess the quality of products or processes by analyzing sample data and making inferences about population characteristics.

### 10
The Central Limit Theorem (CLT) relies on certain assumptions to hold true for the distribution of sample means to be approximately normally distributed:

1. Random Sampling: Samples must be selected randomly from the population of interest. This ensures that the sample means are independent.

2. Sample Size: The sample size should be sufficiently large. While there is no strict rule, a common guideline is that the sample size should be greater than or equal to 30. In some cases, smaller sample sizes may be acceptable if the population distribution is not heavily skewed.

3. Independence: Samples must be drawn independently. In practical terms, this means that the outcome of one sample should not influence the outcome of another sample.

4. Finite Variance: The population from which the samples are drawn should have a finite variance (finite second moment). If the population has an infinite variance, the CLT may not apply.

If these assumptions are met, the CLT allows us to use the properties of the normal distribution for inference and analysis, even when the population distribution is non-normal.