#### Q1: What are the Probability Mass Function (PMF) and Probability Density Function (PDF)? Explain with an example.

The PMF is used to describe the probability distribution of a discrete random variable. A discrete random variable is a variable that can take on only a finite or countably infinite set of distinct values. The PMF assigns a probability to each possible value of the random variable, and the sum of all probabilities equals 1.
Suppose we have a random variable X that represents the outcome of a fair six-sided die roll. X can take on the values 1, 2, 3, 4, 5, and 6, each with probability 1/6. This is an example of a discrete random variable.

The PMF of X is given by:

P(X = 1) = 1/6
P(X = 2) = 1/6
P(X = 3) = 1/6
P(X = 4) = 1/6
P(X = 5) = 1/6
P(X = 6) = 1/6

In [None]:
## function for pmf
def pmf(X, x_values):
    """
    Calculate the Probability Mass Function (PMF) of a discrete random variable X at a given set of values.
    """
    pmf_values = []
    for x in x_values:
        pmf_values.append(X.count(x) / len(X))
    return pmf_values

Note that the sum of all probabilities equals 1.

Now suppose we have a random variable Y that represents the height of a randomly selected person in a certain population. Y can take on any value within a certain range, say from 4 feet to 7 feet. This is an example of a continuous random variable.

The PDF of Y can be any function that satisfies the following conditions:

The PDF is always non-negative.
The area under the PDF curve within the range of values of Y is equal to 1.

One example of a PDF for Y is the normal distribution with mean 5.5 feet and standard deviation 0.5 feet:

scss

f(y) = (1 / (0.5 * sqrt(2*pi))) * exp(-((y - 5.5)**2) / (2 * (0.5**2)))

In [3]:
import math

def pdf(x, mu, sigma):
    """
    Calculate the Probability Density Function (PDF) of a normal distribution with mean mu and standard deviation sigma
    at a given point x.
    """
    return (1 / (sigma * math.sqrt(2*math.pi))) * math.exp(-((x - mu)**2) / (2 * (sigma**2)))

This PDF assigns a higher probability density to heights close to the mean of 5.5 feet and a lower probability density to heights further away from the mean.

In summary, the PMF and PDF are probability distribution functions used to describe the probability distribution of discrete and continuous random variables, respectively.

#### Q2: What is Cumulative Density Function (CDF)? Explain with an example. Why CDF is used?

The Cumulative Density Function (CDF) is a function used in probability theory and statistics to describe the probability distribution of a random variable. The CDF gives the probability that the value of the random variable is less than or equal to a certain value.

For a discrete random variable, the CDF is defined as the sum of the probabilities of all values less than or equal to the given value. For a continuous random variable, the CDF is defined as the integral of the PDF from negative infinity up to the given value.

The CDF is useful in many statistical applications, such as hypothesis testing and confidence interval estimation. It allows us to calculate probabilities for a range of values of a random variable, and to compare different probability distributions.

Here's an example to illustrate the CDF:

Suppose we have a random variable X that represents the number of heads obtained in two coin flips. X can take on the values 0, 1, or 2, each with probability 1/4. This is an example of a discrete random variable.

The CDF of X is given by:


>F(x) = P(X <= x)

 >For x < 0: F(x) = 0
 
 >For 0 <= x < 1: F(x) = 1/4
 
 >For 1 <= x < 2: F(x) = 3/4
 
 >For x >= 2: F(x) = 1

The CDF is a step function that increases in discrete jumps at the possible values of the random variable. For example, F(0) = 1/4, F(1) = 3/4, and F(2) = 1.

We can use the CDF to calculate probabilities for a range of values of X. For example, the probability of getting at most one head in two coin flips is:


P(X <= 1) = F(1) = 3/4
In summary, the CDF is a function used to describe the probability distribution of a random variable. It gives the probability that the value of the random variable is less than or equal to a certain value. The CDF is useful in many statistical applications, and allows us to calculate probabilities for a range of values of a random variable.

#### Q3: What are some examples of situations where the normal distribution might be used as a model? Explain how the parameters of the normal distribution relate to the shape of the distribution.

The normal distribution is one of the most widely used probability distributions in statistics and has a wide range of applications. Here are some examples of situations where the normal distribution might be used as a model:

- Height and weight measurements: In many populations, the distribution of heights and weights follows a normal distribution.

- IQ scores: Intelligence quotient (IQ) scores are often modeled using a normal distribution.

- Errors in measurement: In many scientific experiments, measurements contain random error that is normally distributed.

- Financial markets: The daily returns of stocks and other financial assets are often assumed to follow a normal distribution.

The normal distribution is characterized by two parameters: the mean (μ) and the standard deviation (σ). The mean represents the center of the distribution, while the standard deviation represents the spread of the distribution. The shape of the normal distribution is symmetric and bell-shaped, with the peak of the curve at the mean.

The standard deviation determines the width of the bell curve. A smaller standard deviation produces a narrower bell curve, while a larger standard deviation produces a wider bell curve. The mean determines the center of the distribution, so if the mean is shifted to the right, the entire distribution will shift to the right. Conversely, if the mean is shifted to the left, the entire distribution will shift to the left.

#### Q4: Explain the importance of Normal Distribution. Give a few real-life examples of Normal Distribution.

The normal distribution is an important statistical concept with wide-ranging applications in various fields. Some key reasons why normal distribution is important are:

- Common occurrence: Many natural and human-made phenomena follow a normal distribution, which makes it a useful tool for modeling and understanding the world around us.

- Central Limit Theorem: The normal distribution is central to the Central Limit Theorem, which states that the sum or average of a large number of independent and identically distributed random variables tends to be normally distributed, even if the original variables themselves are not normally distributed. This theorem underpins many statistical analyses and inference procedures.

- Statistical inference: The normal distribution is widely used in statistical inference, such as hypothesis testing and confidence interval estimation. This is because many test statistics have normal distributions under certain assumptions, which allows us to make probabilistic statements about the population parameters of interest.

Some real-life examples of normal distribution are:

>Heights of people: The distribution of heights in many populations follows a normal distribution, with the mean height being around the center of the distribution.

>IQ scores: Intelligence quotient (IQ) scores are often modeled using a normal distribution, with the mean being set at 100 and the standard deviation at 15.

>Exam scores: In many educational settings, exam scores tend to follow a normal distribution, with the mean and standard deviation reflecting the difficulty of the exam.

>Financial markets: The daily returns of stocks and other financial assets are often assumed to follow a normal distribution, with the mean reflecting the average rate of return and the standard deviation reflecting the volatility of the asset.

#### Q5: What is Bernaulli Distribution? Give an Example. What is the difference between Bernoulli Distribution and Binomial Distribution?

The Bernoulli distribution is a discrete probability distribution that models the outcome of a single binary experiment, where there are only two possible outcomes: success (1) or failure (0). The Bernoulli distribution is named after Swiss mathematician Jacob Bernoulli, who introduced the concept in the early 18th century.

An example of the Bernoulli distribution is flipping a fair coin, where the outcome is either heads (success) or tails (failure). Another example is rolling a die and considering a success to be rolling a six.

The Bernoulli distribution is a special case of the binomial distribution, which models the number of successes in a fixed number of independent Bernoulli trials. The key difference between the Bernoulli and binomial distributions is that the Bernoulli distribution models a single trial, while the binomial distribution models multiple trials.

In the Bernoulli distribution, there is only one parameter, which is the probability of success (denoted by p). In the binomial distribution, there are two parameters: the number of trials (n) and the probability of success (p).

For example, suppose we flip a fair coin 10 times and count the number of heads. The number of heads is a random variable that follows a binomial distribution with parameters n=10 and p=0.5. The probability of getting exactly k heads (where k can be any integer from 0 to 10) is given by the binomial probability mass function.

In summary, the Bernoulli distribution models the outcome of a single binary experiment, while the binomial distribution models the number of successes in a fixed number of independent Bernoulli trials. The Bernoulli distribution is a special case of the binomial distribution, with n=1.







#### Q6. Consider a dataset with a mean of 50 and a standard deviation of 10. If we assume that the dataset is normally distributed, what is the probability that a randomly selected observation will be greater than 60? Use the appropriate formula and show your calculations.

To find the probability that a randomly selected observation will be greater than 60 from a normally distributed dataset with mean μ = 50 and standard deviation σ = 10, we need to standardize the value of 60 using the z-score formula:

z = (x - μ) / σ

where x is the observed value, μ is the mean, and σ is the standard deviation.

Substituting the values, we get:

z = (60 - 50) / 10 = 1

Now, we need to find the area under the standard normal distribution curve to the right of z = 1, which represents the probability of getting a value greater than 60. We can use a z-table or a calculator to find this area, which is approximately 0.1587.

Therefore, the probability of a randomly selected observation being greater than 60 is 0.1587 or 15.87%.


#### Q7: Explain uniform Distribution with an example.

The uniform distribution is a continuous probability distribution that models a situation where any value within a given range is equally likely to occur. In other words, the probability density function of the uniform distribution is a constant over a specified range.

An example of the uniform distribution is rolling a fair die, where each of the six outcomes (1, 2, 3, 4, 5, or 6) is equally likely to occur. The probability of each outcome is 1/6, which is a constant value over the range of possible outcomes.

Another example is choosing a random number between 0 and 1. Since any value between 0 and 1 is equally likely to be chosen, the probability density function of this distribution is a horizontal line with a height of 1 over the range [0, 1].

In general, the uniform distribution is useful for modeling situations where there is no preference or bias for any particular value within a given range. It is often used in simulations and modeling, as well as in statistical inference and hypothesis testing.





#### Q8: What is the z score? State the importance of the z score.

The z-score, also known as the standard score, is a dimensionless quantity that represents the number of standard deviations an observation or data point is from the mean of a distribution. The z-score is calculated as:

z = (x - μ) / σ

where x is the observed value, μ is the mean of the distribution, and σ is the standard deviation of the distribution.

The z-score allows us to standardize observations from different normal distributions, making it possible to compare them on a common scale. The z-score can be positive, negative, or zero, depending on whether the observed value is above, below, or equal to the mean of the distribution.

The importance of the z-score lies in its usefulness in statistical analysis, particularly in hypothesis testing and confidence intervals. By converting observations to z-scores, we can calculate the probability of observing a value as extreme or more extreme than the observed value, assuming a normal distribution. This probability can be used to make inferences about the population from which the sample was drawn.

The z-score is also used in quality control to identify outliers, which are observations that are more than a certain number of standard deviations away from the mean. Outliers can indicate problems in the data collection process or suggest the presence of unusual or unexpected phenomena.

#### Q9: What is Central Limit Theorem? State the significance of the Central Limit Theorem.

The Central Limit Theorem (CLT) is a fundamental theorem in statistics that describes the behavior of the sample mean of a random variable as the sample size increases, regardless of the underlying distribution. The CLT states that, under certain conditions, the distribution of sample means approximates a normal distribution, regardless of the shape of the population distribution.

In more formal terms, the CLT states that as the sample size n increases, the distribution of the sample mean X̄ approaches a normal distribution with mean μ and standard deviation σ/√n, where μ and σ are the mean and standard deviation of the population, respectively.

The significance of the CLT lies in its usefulness in statistical inference. It allows us to make inferences about a population from a sample, even if the population distribution is unknown or non-normal. By taking multiple samples and calculating their means, we can approximate the population mean and standard deviation and make inferences about the population.

The CLT also has practical applications in many areas of science, engineering, and finance. For example, it is commonly used in quality control to monitor the quality of a product by taking multiple samples and analyzing the means, in finance to model the behavior of stock prices and returns, and in physics to model the behavior of particles in a gas or liquid.

#### Q10: State the assumptions of the Central Limit Theorem.

The Central Limit Theorem (CLT) is based on certain assumptions that must be satisfied for the theorem to hold true. The assumptions of the CLT are:

>Independence: The observations in the sample must be independent of each other.

>Sample size: The sample size n must be sufficiently large. The general rule of thumb is that the sample size should be greater than or equal to 30. However, if the population is highly skewed or has heavy tails, a larger sample size may be required.

>Identical distribution: The observations in the sample must be drawn from the same population distribution.

>Finite variance: The population from which the sample is drawn must have a finite variance.

>Randomness: The sample must be selected at random from the population.

If these assumptions are met, then the sample means will follow a normal distribution as the sample size increases. The CLT is a powerful tool that allows us to make statistical inferences about a population, but it is important to ensure that the assumptions are satisfied before relying on the theorem.