Q1. What is the Probability density function?

Answer = The Probability Density Function(PDF) defines the probability function representing the density of a continuous random variable lying between a specific range of values. In other words, the probability density function produces the likelihood of values of the continuous random variable.

Q2. What are the types of Probability distribution?
Answer = Here are some of the most common types of probability distributions:

Binomial distribution: The binomial distribution is used to describe the number of successes in a fixed number of independent trials. For example, the number of heads you get when you flip a coin 10 times is a binomial random variable.

Normal distribution: The normal distribution, also known as the Gaussian distribution, is a bell-shaped curve that is used to describe many natural phenomena. For example, the heights of people are normally distributed.

Poisson distribution: The Poisson distribution is used to describe the number of events that occur in a fixed interval of time or space. For example, the number of traffic accidents that occur on a given highway in a year is a Poisson random variable.

Uniform distribution: The uniform distribution is used to describe random variables that are equally likely to take on any value within a certain range. For example, the time it takes for a customer to arrive at a store is a uniformly distributed random variable.



Q3. Write a Python function to calculate the probability density function of a normal distribution with
given mean and standard deviation at a given point.



In [1]:
import math

def normal_pdf(x, mean, std_dev):
    """
    Calculate the probability density function (PDF) of a normal distribution at a given point.

    Parameters:
    - x: The point at which to evaluate the PDF.
    - mean: The mean of the normal distribution.
    - std_dev: The standard deviation of the normal distribution.

    Returns:
    The PDF value at the given point.
    """
    coefficient = 1 / (math.sqrt(2 * math.pi) * std_dev)
    exponent = -((x - mean) ** 2) / (2 * (std_dev ** 2))
    pdf_value = coefficient * math.exp(exponent)
    return pdf_value

# Example usage:
mean_value = 0
std_dev_value = 1
point = 1.5

pdf_at_point = normal_pdf(point, mean_value, std_dev_value)
print(f"The PDF at x={point} is: {pdf_at_point}")


The PDF at x=1.5 is: 0.12951759566589174


Q4. What are the properties of Binomial distribution? Give two examples of events where binomial
distribution can be applied.

Answer = The binomial distribution is a discrete probability distribution that describes the number of successes in a sequence of independent trials, where each trial has only two possible outcomes, such as success or failure. Here are some of the properties of the binomial distribution:

Finite number of trials: The binomial distribution applies to a fixed number of trials, denoted by n. Each trial has two possible outcomes, often labeled as success (S) or failure (F).

Independence of trials: The trials are assumed to be independent, meaning that the outcome of one trial does not affect the outcome of any other trial.

Constant probability of success: The probability of success (p) remains constant for each trial. This implies that the probability of failure (q) is 1 - p.

Discrete outcomes: The outcomes are discrete, meaning that only a finite or countably infinite number of values are possible. In the case of the binomial distribution, the possible outcomes are the number of successes (X) ranging from 0 to n.

Here are two examples of events where the binomial distribution can be applied:

Flipping a coin: Flipping a coin is a classic example of a binomial experiment. Each flip has two possible outcomes: heads (success) or tails (failure). The probability of success (heads) is p, and the probability of failure (tails) is q = 1 - p. The binomial distribution can be used to calculate the probability of getting a specific number of heads in a given number of flips.

Product quality control: In product quality control, a random sample of products is inspected to determine the number of defective items. Each item can be categorized as either defective (success) or non-defective (failure). The probability of a defective item (p) represents the proportion of defective products in the entire batch. The binomial distribution can be used to estimate the probability of finding a certain number of defective items in the sample.

Q5. Generate a random sample of size 1000 from a binomial distribution with probability of success 0.4
and plot a histogram of the results using matplotlib.

In [None]:
import numpy as np
import matplotlib.pyplot as plt

# Set the parameters for the binomial distribution
n = 10  # Number of trials
p = 0.4  # Probability of success

# Generate a random sample of size 1000 from a binomial distribution
random_sample = np.random.binomial(n, p, size=1000)

# Plot the histogram
plt.hist(random_sample, bins=np.arange(0, n+2)-0.5, edgecolor='black')
plt.title('Binomial Distribution - Random Sample')
plt.xlabel('Number of Successes')
plt.ylabel('Frequency')
plt.grid(axis='y', linestyle='--', alpha=0.7)
plt.show()


Q6. Write a Python function to calculate the cumulative distribution function of a Poisson distribution
with given mean at a given point.

In [None]:
import math

def poisson_cdf(k, mean):
    """
    Calculate the cumulative distribution function (CDF) of a Poisson distribution at a given point.

    Parameters:
    - k: The point at which to evaluate the CDF.
    - mean: The mean of the Poisson distribution.

    Returns:
    The CDF value at the given point.
    """
    cdf_value = 0
    for i in range(k + 1):
        cdf_value += math.exp(-mean) * (mean ** i) / math.factorial(i)
    return cdf_value

# Example usage:
mean_value = 2
point = 3

cdf_at_point = poisson_cdf(point, mean_value)
print(f"The CDF at k={point} is: {cdf_at_point}")


Q7. How Binomial distribution different from Poisson distribution?


Answer = The Binomial distribution and the Poisson distribution are both probability distributions used to model the number of successes in a fixed number of trials. However, there are key differences between the two:

Nature of the Process:

Binomial Distribution: It is used to model the number of successes in a fixed number of independent and identical Bernoulli trials, where each trial has only two possible outcomes (success or failure).
Poisson Distribution: It is used to model the number of events that occur in a fixed interval of time or space. It is often applied when the number of trials is not fixed, and the events occur independently with a low probability of success.
Number of Trials:

Binomial Distribution: The number of trials (

n) is fixed and known in advance.
Poisson Distribution: The number of trials is not fixed and can vary. It is often used in situations where the average rate of events per unit of time or space is known (

λ), and we are interested in the number of events occurring in a specific interval.
Probability of Success:

Binomial Distribution: The probability of success (

p) is constant for each trial.
Poisson Distribution: The probability of success is low, but as the number of trials becomes large and the product 

n⋅p remains constant, it approximates the Poisson distribution.
Distribution Function:

Binomial Distribution: The probability mass function (PMF) is given by the binomial coefficient and is defined for discrete values of the number of successes.
Poisson Distribution: The probability mass function is given by the Poisson formula and is also defined for discrete values.
Conditions for Use:

Binomial Distribution: It is suitable when there is a fixed number of trials, each with a constant probability of success, and the trials are independent.
Poisson Distribution: It is suitable when the number of trials is not fixed, the events are rare, and the probability of success in each trial is small.

Q8. Generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the
sample mean and variance.

In [None]:
import numpy as np

# Set the parameters for the Poisson distribution
mean_value = 5

# Generate a random sample of size 1000 from a Poisson distribution
random_sample = np.random.poisson(mean_value, size=1000)

# Calculate the sample mean and variance
sample_mean = np.mean(random_sample)
sample_variance = np.var(random_sample)

# Print the results
print(f"Sample Mean: {sample_mean}")
print(f"Sample Variance: {sample_variance}")


Q9. How mean and variance are related in Binomial distribution and Poisson distribution?

Answer =  In both the Binomial and Poisson distributions, the mean and variance are closely related.

Binomial Distribution

In the Binomial distribution, the mean (µ) and variance (σ²) are given by the following formulas:

µ = np

σ² = npq

where:

n is the number of trials
p is the probability of success
q is the probability of failure (q = 1 - p)
This means that the mean and variance of the Binomial distribution are directly proportional to each other. In other words, as the mean increases, the variance also increases. This is because the Binomial distribution is a discrete distribution, and the number of possible outcomes increases as the number of trials increases.

Poisson Distribution

In the Poisson distribution, the mean (µ) and variance (σ²) are also closely related, and they are given by the following formulas:

µ = λ

σ² = λ

where:

λ is the average number of events per unit time or space
This means that the mean and variance of the Poisson distribution are equal. This is because the Poisson distribution is a continuous distribution, and the number of possible outcomes is infinite.

Comparison

The relationship between the mean and variance is different for the Binomial and Poisson distributions. In the Binomial distribution, the variance is always greater than the mean. In the Poisson distribution, the variance is equal to the mean. This is because the Binomial distribution is a discrete distribution, while the Poisson distribution is a continuous distribution.

Here is a table summarizing the relationship between the mean and variance for the Binomial and Poisson distributions:




Q10. In normal distribution with respect to mean position, where does the least frequent data appear?

Answer = In a normal distribution, also known as a Gaussian distribution or bell curve, the least frequent data points appear in the tails of the distribution. The normal distribution is symmetric, meaning that the shape of the distribution is the same on both sides of the mean.

Here's how the data distribution is typically organized in a normal distribution:

The highest frequency of data points is at the mean of the distribution.
As you move away from the mean in either direction, the frequency of data points gradually decreases.
The least frequent data points are found in the tails of the distribution, farthest from the mean.
In a standard normal distribution (with a mean of 0 and a standard deviation of 1), approximately 68% of the data falls within one standard deviation of the mean, 95% within two standard deviations, and 99.7% within three standard deviations. Therefore, the data points in the tails, beyond three standard deviations from the mean, are relatively rare and make up only a small percentage of the distribution.
