Q1. What is the Probability density function?

 Probability density function (PDF) is a mathematical function used to describe the probability distribution of a continuous random variable.

The PDF describes the relative likelihood of the values of the random variable occurring within a given interval. It is a non-negative function, and the area under the curve of the PDF over an interval gives the probability of the variable taking values in that interval.

The PDF is defined as the derivative of the cumulative distribution function (CDF) of the random variable. In other words, if F(x) is the CDF of the random variable X, then the PDF f(x) is defined as:

      f(x) = dF(x)/dx

where d/dx denotes the derivative with respect to x.

The PDF can be used to calculate various statistical properties of the random variable, such as mean, variance, and moments. It is an essential concept in probability theory and statistics and is used in various fields, including engineering, economics, physics, and finance.


Q2. What are the types of Probability distribution?

There are several types of probability distributions, but the most common ones are:

1.Normal Distribution: It is a bell-shaped distribution that is symmetric around its mean. Many natural phenomena follow the normal distribution, and it is often used in statistical inference.

2.Binomial Distribution: It is a discrete distribution that describes the number of successes in a fixed number of independent trials, where each trial has the same probability of success.

3.Poisson Distribution: It is a discrete distribution that describes the number of events occurring in a fixed interval of time or space, given that the events occur independently and at a constant rate.

4.Exponential Distribution: It is a continuous distribution that describes the time between two successive events occurring in a Poisson process.

5.Uniform Distribution: It is a continuous distribution where all values in a given interval have an equal probability of occurring.

6.Gamma Distribution: It is a continuous distribution that is used to model the time until a specified event occurs, given that the event occurs at a constant rate.

7.Beta Distribution: It is a continuous distribution that is used to model probabilities or proportions.

These are some of the most common types of probability distributions. There are many other probability distributions, each with its own unique characteristics and applications.


Q3. Write a Python function to calculate the probability density function of a normal distribution with
given mean and standard deviation at a given point.

 Here's an example Python function to calculate the probability density function (PDF) of a normal distribution with a given mean and standard deviation at a given point:

    import math

def normal_pdf(x, mu, sigma):
    """
    Calculates the PDF of a normal distribution with the given mean and standard deviation at the given point.
    
    Args:
    x (float): The point at which to calculate the PDF.
    mu (float): The mean of the normal distribution.
    sigma (float): The standard deviation of the normal distribution.
    
    Returns:
    float: The PDF of the normal distribution at the given point.
    """
    coefficient = 1 / (sigma * math.sqrt(2 * math.pi))
    exponent = -0.5 * ((x - mu) / sigma) ** 2
    pdf = coefficient * math.exp(exponent)
    return pdf

To use this function, simply call it with the desired parameters:
    
mu = 0.0  # mean of the normal distribution
sigma = 1.0  # standard deviation of the normal distribution
x = 1.5  # point at which to calculate the PDF

pdf = normal_pdf(x, mu, sigma)
print(pdf)

This will output the PDF of the normal distribution with mean 0.0 and standard deviation 1.0 at the point x=1.5.

Q4. What are the properties of Binomial distribution? Give two examples of events where binomial
distribution can be applied.

 Properties of Binomial Distribution:

1.Fixed number of independent trials: The binomial distribution describes the number of successes in a fixed number of independent trials.

2.Two possible outcomes: Each trial has only two possible outcomes, either success or failure.

3.Constant probability: The probability of success is constant for each trial.

4.Independent trials: The outcome of each trial is independent of the outcome of any other trial.

Examples of events where binomial distribution can be applied:

1.Coin tosses: The outcome of a coin toss can be modeled using a binomial distribution. If we toss a coin n times, the number of heads obtained follows a binomial distribution, where success is defined as obtaining a head, and failure is defined as obtaining a tail.

2.Product defects: A manufacturing company produces a large number of products, and some of them may be defective. If we inspect a sample of products and count the number of defective items, this can be modeled using a binomial distribution, where success is defined as finding a defective item, and failure is defined as finding a non-defective item. The probability of success in this case is the probability of a product being defective.


Q5. Generate a random sample of size 1000 from a binomial distribution with probability of success 0.4
and plot a histogram of the results using matplotlib.

-
    Here's an example Python code to generate a random sample of size 1000 from a binomial distribution with probability of success 0.4 and plot a histogram of the results using matplotlib:
    
import numpy as np
import matplotlib.pyplot as plt

# Set the parameters of the binomial distribution
n = 1  # number of trials
p = 0.4  # probability of success

# Generate a random sample of size 1000 from the binomial distribution
sample = np.random.binomial(n, p, size=1000)

# Plot a histogram of the sample using matplotlib
plt.hist(sample, bins=range(n+2), align='left', rwidth=0.8)
plt.xlabel('Number of successes')
plt.ylabel('Frequency')
plt.title('Binomial distribution with n={} and p={}'.format(n, p))
plt.show()

In this code, we first set the parameters of the binomial distribution, which are the number of trials (n) and the probability of success (p). We then use the np.random.binomial function from the numpy library to generate a random sample of size 1000 from the binomial distribution with these parameters. Finally, we use the plt.hist function from the matplotlib library to plot a histogram of the sample, where each bar represents the frequency of a certain number of successes.

The resulting plot will show the distribution of the number of successes in the sample, which should be centered around the expected value of np=0.4.


Q6. Write a Python function to calculate the cumulative distribution function of a Poisson distribution
with given mean at a given point.

Here's an example Python function to calculate the cumulative distribution function (CDF) of a Poisson distribution with a given mean at a given point:
    
    import math

def poisson_cdf(k, mu):
    """
    Calculates the CDF of a Poisson distribution with the given mean at the given point.
    
    Args:
    k (int): The point at which to calculate the CDF.
    mu (float): The mean of the Poisson distribution.
    
    Returns:
    float: The CDF of the Poisson distribution at the given point.
    """
    cdf = 0.0
    for i in range(k+1):
        cdf += math.exp(-mu) * mu**i / math.factorial(i)
    return cdf

To use this function, simply call it with the desired parameters:


mu = 2.5  # mean of the Poisson distribution
k = 4  # point at which to calculate the CDF

cdf = poisson_cdf(k, mu)
print(cdf)

This will output the CDF of the Poisson distribution with mean 2.5 at the point k=4. The CDF represents the probability that the random variable takes a value less than or equal to k, so the output of the function represents the probability of observing 4 or fewer events in a Poisson process with a mean of 2.5.


Q7. How Binomial distribution different from Poisson distribution?

Binomial and Poisson distributions are both discrete probability distributions, but they have some key differences:

1.Number of trials: The binomial distribution describes the number of successes in a fixed number of independent trials, while the Poisson distribution describes the number of events occurring in a fixed interval of time or space.

2.Probability of success: In the binomial distribution, the probability of success is constant for each trial, while in the Poisson distribution, the probability of an event occurring is constant for each unit of time or space.

3.Relationship between mean and variance: In the binomial distribution, the variance is equal to np(1-p), where n is the number of trials and p is the probability of success. In the Poisson distribution, the variance is equal to the mean.

4.Assumptions: The binomial distribution assumes a fixed number of trials and a constant probability of success, while the Poisson distribution assumes a large number of independent events occurring at a constant rate.

5.Application: The binomial distribution is commonly used to model the probability of a certain number of successes in a fixed number of trials, while the Poisson distribution is commonly used to model the probability of a certain number of events occurring in a fixed interval of time or space.


In summary, the binomial distribution is used when there are a fixed number of independent trials, and the probability of success is constant for each trial, while the Poisson distribution is used when there are a large number of independent events occurring at a constant rate


Q8. Generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the
sample mean and variance.

Here's an example Python code to generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the sample mean and variance:
     
import numpy as np

# Set the parameters of the Poisson distribution
mu = 5  # mean

# Generate a random sample of size 1000 from the Poisson distribution
sample = np.random.poisson(mu, size=1000)

# Calculate the sample mean and variance
sample_mean = np.mean(sample)
sample_var = np.var(sample, ddof=1)  # use ddof=1 for unbiased variance

print("Sample mean: {:.2f}".format(sample_mean))
print("Sample variance: {:.2f}".format(sample_var))

In this code, we first set the parameter of the Poisson distribution, which is the mean (mu). We then use the np.random.poisson function from the numpy library to generate a random sample of size 1000 from the Poisson distribution with this mean. Finally, we use the np.mean and np.var functions from the numpy library to calculate the sample mean and variance, respectively.

The output of this code will be the sample mean and variance of the generated sample, which should be close to the population mean and variance of a Poisson distribution with mean 5.


Q9. How mean and variance are related in Binomial distribution and Poisson distribution?

In the Binomial distribution, the mean (μ) and variance (σ^2) are related by the formula:

      σ^2 = μ(1 - p)

where μ is the mean of the distribution and p is the probability of success in each trial. This formula implies that the variance increases as the probability of success decreases, and it reaches its maximum when p = 0.5.

In the Poisson distribution, the mean (μ) and variance (σ^2) are equal:

      σ^2 = μ

This means that the variance of the distribution is proportional to its mean. As the mean of the distribution increases, so does its variance.

In summary, in the Binomial distribution, the variance depends on both the mean and the probability of success, while in the Poisson distribution, the variance depends only on the mean. In both cases, the variance is an important measure of the spread or dispersion of the distribution, and it provides information about how much the random variable deviates from its mean value.


Q10. In normal distribution with respect to mean position, where does the least frequent data appear?

In a normal distribution, the least frequent data appears in the tails of the distribution, which are the regions furthest away from the mean. Specifically, the probability density function of a normal distribution is highest at the mean and decreases as we move away from the mean in either direction towards the tails. Therefore, the least frequent data appears in the tails of the distribution, where the probability density function is lower.

For example, in a standard normal distribution (with a mean of 0 and a standard deviation of 1), the least frequent data appears in the tails beyond about 2 or 3 standard deviations from the mean. These extreme values are sometimes referred to as outliers or anomalies, and they may be of special interest in certain applications or analyses.
    