In [None]:
Q1. What is the Probability density function?

ANS-1

The Probability Density Function (PDF) is a fundamental concept in probability theory and statistics. It is a function that describes the likelihood of a continuous random variable taking on a specific value or falling within a certain range of values. In other words, it provides the probability distribution of a continuous random variable.

For a continuous random variable X, the probability density function is denoted as f(x) and satisfies the following properties:

1. Non-negativity: f(x) ≥ 0 for all x in the range of X.
2. Total area under the curve: The integral of f(x) over the entire range of X equals 1. This means that the probability of X taking any value in its entire range is 1.

Mathematically, the probability that X lies within the interval [a, b] can be calculated by integrating the PDF over that interval:

P(a ≤ X ≤ b) = ∫[a to b] f(x) dx

It is important to note that the PDF is only defined for continuous random variables. For discrete random variables, we use the Probability Mass Function (PMF) to describe the probability distribution.

In summary, the PDF helps us understand the likelihood of different values occurring in a continuous random variable, and it serves as the continuous analog of the PMF for discrete random variables.





Q2. What are the types of Probability distribution?
ANS-2

There are several types of probability distributions, each of which describes the probability of different outcomes for different types of random variables. The two main categories of probability distributions are:

1. Discrete Probability Distributions:
   - Bernoulli Distribution: Describes the probability distribution of a random variable that can take only two possible outcomes, typically denoted as 0 and 1, with probabilities p and q (where q = 1 - p), respectively.
   - Binomial Distribution: Models the number of successes in a fixed number of independent Bernoulli trials, each with the same probability of success (p).
   - Poisson Distribution: Describes the number of events that occur in a fixed interval of time or space, given the average rate of occurrence (λ).

2. Continuous Probability Distributions:
   - Uniform Distribution: Provides a constant probability over a specific interval, with all values within that interval being equally likely to occur.
   - Normal Distribution (Gaussian Distribution): One of the most common distributions in statistics, it has a bell-shaped curve and is fully described by its mean (μ) and standard deviation (σ). Many natural phenomena follow a normal distribution.
   - Exponential Distribution: Models the time between events in a Poisson process, where events occur continuously and independently at a constant average rate (λ).
   - Gamma Distribution: A generalization of the exponential distribution, it is used to model the time required for several events to occur in certain situations.
   - Beta Distribution: Describes random variables that have values between 0 and 1, often used in Bayesian statistics for modeling probabilities.
   - Weibull Distribution: Often used to model reliability and lifetime data.

These are just some of the most commonly encountered probability distributions. There are many other specialized distributions used in various fields of study, such as the t-distribution, chi-square distribution, F-distribution, etc. The choice of distribution depends on the nature of the random variable and the problem being analyzed.






Q3. Write a Python function to calculate the probability density function of a normal distribution with
given mean and standard deviation at a given point.


ANS-3

To calculate the probability density function (PDF) of a normal distribution at a given point for a given mean (μ) and standard deviation (σ), you can use the following Python function:

```python
import math

def normal_pdf(x, mean, std_dev):
    """
    Calculate the probability density function (PDF) of a normal distribution at a given point.

    Parameters:
        x (float): The point at which to calculate the PDF.
        mean (float): The mean (μ) of the normal distribution.
        std_dev (float): The standard deviation (σ) of the normal distribution.

    Returns:
        float: The PDF value at the given point.
    """
    coefficient = 1 / (std_dev * math.sqrt(2 * math.pi))
    exponent = -0.5 * ((x - mean) / std_dev) ** 2
    pdf_value = coefficient * math.exp(exponent)
    return pdf_value

# Example usage:
mean = 0.0
std_dev = 1.0
point = 1.5
pdf_value_at_point = normal_pdf(point, mean, std_dev)
print("PDF at x =", point, "is:", pdf_value_at_point)
```

In this function, we use the formula for the PDF of a normal distribution:

PDF(x | μ, σ) = (1 / (σ * sqrt(2π))) * exp(-0.5 * ((x - μ) / σ)^2)

Where:
- x is the point at which we want to calculate the PDF.
- μ is the mean of the normal distribution.
- σ is the standard deviation of the normal distribution.

The function `normal_pdf` takes these three parameters as input and returns the PDF value at the given point `x`. The `math.exp` function is used to calculate the exponent term of the PDF formula. The resulting PDF value represents the likelihood of the random variable being equal to `x` under the normal distribution with the given mean and standard deviation.



Q4. What are the properties of Binomial distribution? Give two examples of events where binomial
distribution can be applied.

ANS-4


The Binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent Bernoulli trials, where each trial has only two possible outcomes: success (S) or failure (F). The distribution is characterized by two parameters: the number of trials (n) and the probability of success in each trial (p).

Properties of the Binomial distribution:

1. Fixed number of trials: The number of trials (n) is fixed and known in advance.

2. Independent trials: Each trial is independent of the others, meaning the outcome of one trial does not affect the outcomes of the other trials.

3. Binary outcomes: Each trial has two possible outcomes, usually denoted as success (S) and failure (F).

4. Constant probability: The probability of success (p) remains constant across all trials.

5. Discrete values: The random variable representing the number of successes can only take on discrete values from 0 to n.

Examples of events where Binomial distribution can be applied:

1. Coin Flips: Consider flipping a fair coin (where the probability of getting heads, p, is 0.5) ten times. The number of times we get heads in these ten flips can be modeled using a Binomial distribution. Here, n = 10 (number of trials) and p = 0.5 (probability of success, i.e., getting heads).

2. Quality Control: In a manufacturing process, a batch of products is tested for defects. Let's say that 10% of the products in the batch are defective (p = 0.10). If we randomly select 20 products from the batch and count the number of defective items, the number of defective products can be modeled using a Binomial distribution with n = 20 (number of trials) and p = 0.10 (probability of success, i.e., selecting a defective product).

In both of these examples, the Binomial distribution is used to model the number of successes (e.g., heads in coin flips or defective products in quality control) in a fixed number of independent trials, where each trial has two possible outcomes (success or failure) with a constant probability of success.



Q5. Generate a random sample of size 1000 from a binomial distribution with probability of success 0.4
and plot a histogram of the results using matplotlib.


ANS-5


The Binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent Bernoulli trials, where each trial has only two possible outcomes: success (S) or failure (F). The distribution is characterized by two parameters: the number of trials (n) and the probability of success in each trial (p).

Properties of the Binomial distribution:

1. Fixed number of trials: The number of trials (n) is fixed and known in advance.

2. Independent trials: Each trial is independent of the others, meaning the outcome of one trial does not affect the outcomes of the other trials.

3. Binary outcomes: Each trial has two possible outcomes, usually denoted as success (S) and failure (F).

4. Constant probability: The probability of success (p) remains constant across all trials.

5. Discrete values: The random variable representing the number of successes can only take on discrete values from 0 to n.

Examples of events where Binomial distribution can be applied:

1. Coin Flips: Consider flipping a fair coin (where the probability of getting heads, p, is 0.5) ten times. The number of times we get heads in these ten flips can be modeled using a Binomial distribution. Here, n = 10 (number of trials) and p = 0.5 (probability of success, i.e., getting heads).

2. Quality Control: In a manufacturing process, a batch of products is tested for defects. Let's say that 10% of the products in the batch are defective (p = 0.10). If we randomly select 20 products from the batch and count the number of defective items, the number of defective products can be modeled using a Binomial distribution with n = 20 (number of trials) and p = 0.10 (probability of success, i.e., selecting a defective product).

In both of these examples, the Binomial distribution is used to model the number of successes (e.g., heads in coin flips or defective products in quality control) in a fixed number of independent trials, where each trial has two possible outcomes (success or failure) with a constant probability of success.



Q6. Write a Python function to calculate the cumulative distribution function of a Poisson distribution
with given mean at a given point.

ANS-6

To calculate the Cumulative Distribution Function (CDF) of a Poisson distribution at a given point for a given mean (λ), you can use the following Python function:

```python
import math

def poisson_cdf(point, mean):
    """
    Calculate the Cumulative Distribution Function (CDF) of a Poisson distribution at a given point.

    Parameters:
        point (int): The point at which to calculate the CDF.
        mean (float): The mean (λ) of the Poisson distribution.

    Returns:
        float: The CDF value at the given point.
    """
    cdf_value = 0.0
    for k in range(point + 1):
        cdf_value += (math.exp(-mean) * (mean ** k)) / math.factorial(k)
    return cdf_value

# Example usage:
mean = 3.5
point = 2
cdf_value_at_point = poisson_cdf(point, mean)
print("CDF at x =", point, "is:", cdf_value_at_point)
```

In this function, we use the formula for the CDF of a Poisson distribution:

CDF(k | λ) = Σ[from k=0 to k] (e^(-λ) * λ^k) / k!

Where:
- k is the point at which we want to calculate the CDF.
- λ is the mean of the Poisson distribution.
- e is the base of the natural logarithm (approximately 2.71828).

The function `poisson_cdf` takes these two parameters as input and returns the CDF value at the given point `k`. The loop iterates from k=0 to the given point (inclusive), and for each value of k, it adds the corresponding term of the CDF formula to the cumulative sum. The resulting CDF value represents the probability that the Poisson-distributed random variable is less than or equal to the given point.





Q7. How Binomial distribution different from Poisson distribution?

ANS-7


The Binomial distribution and the Poisson distribution are both probability distributions used to model the number of successes in a certain number of trials or events. However, they differ in their assumptions and characteristics:

1. Nature of Trials:
   - Binomial Distribution: The Binomial distribution is used to model a fixed number of independent trials, where each trial can have two possible outcomes: success or failure.
   - Poisson Distribution: The Poisson distribution is used to model the number of events that occur in a fixed interval of time or space, where the events happen randomly and independently.

2. Number of Possible Outcomes:
   - Binomial Distribution: Each trial in the Binomial distribution has two possible outcomes: success or failure.
   - Poisson Distribution: The Poisson distribution deals with counts of events, so the number of possible outcomes can range from 0 to positive infinity.

3. Parameters:
   - Binomial Distribution: The Binomial distribution is characterized by two parameters: the number of trials (n) and the probability of success in each trial (p).
   - Poisson Distribution: The Poisson distribution is characterized by a single parameter, the average rate of occurrence (λ). This parameter represents the average number of events that occur in the given time interval or space.

4. Type of Random Variable:
   - Binomial Distribution: The random variable in the Binomial distribution represents the number of successes in a fixed number of trials.
   - Poisson Distribution: The random variable in the Poisson distribution represents the number of events that occur in a fixed interval.

5. Assumptions:
   - Binomial Distribution: Assumes that the trials are independent and the probability of success (p) remains constant for all trials.
   - Poisson Distribution: Assumes that events occur randomly and independently at a constant average rate (λ) in the given interval.

6. Range of Values:
   - Binomial Distribution: The random variable in the Binomial distribution can take only discrete values from 0 to the number of trials (n).
   - Poisson Distribution: The random variable in the Poisson distribution can take discrete values from 0 to positive infinity.

In summary, the main difference between the Binomial and Poisson distributions lies in the nature of the events they model. The Binomial distribution deals with a fixed number of independent trials with two possible outcomes, while the Poisson distribution deals with the number of events that occur randomly and independently in a fixed interval, with a constant average rate of occurrence.




Q8. Generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the
sample mean and variance.


ANS-8


import numpy as np

def sample_mean_and_variance(sample):
    """
    Calculate the sample mean and variance of a given sample.

    Parameters:
        sample (numpy.ndarray): The input sample.

    Returns:
        float: Sample mean.
        float: Sample variance.
    """
    sample_mean = np.mean(sample)
    sample_variance = np.var(sample, ddof=1)  # Setting ddof=1 for unbiased sample variance
    return sample_mean, sample_variance

# Generate a random sample of size 1000 from a Poisson distribution with mean 5
mean = 5
sample_size = 1000
random_sample = np.random.poisson(mean, size=sample_size)

# Calculate the sample mean and variance
sample_mean, sample_variance = sample_mean_and_variance(random_sample)

# Print the results
print("Sample Mean:", sample_mean)
print("Sample Variance:", sample_variance)





Q9. How mean and variance are related in Binomial distribution and Poisson distribution?


ANS-9


In both the Binomial and Poisson distributions, the mean and variance are related in a specific way, and this relationship provides valuable insights into the characteristics of these distributions.

Binomial Distribution:
For a Binomial distribution with parameters n (number of trials) and p (probability of success in each trial), the mean (μ) and variance (σ^2) are related as follows:

Mean (μ) = n * p
Variance (σ^2) = n * p * (1 - p)

Here, the mean represents the average number of successes in n trials, and the variance quantifies the spread or dispersion of the number of successes around the mean. The variance is largest when p = 0.5 (i.e., the probability of success and failure are equal), and it decreases as p moves away from 0.5 towards either 0 or 1.

Poisson Distribution:
For a Poisson distribution with parameter λ (average rate of occurrence), the mean (μ) and variance (σ^2) are related as follows:

Mean (μ) = λ
Variance (σ^2) = λ

In the Poisson distribution, both the mean and variance are equal to the average rate of occurrence λ. This means that the dispersion of the data around the mean is constant and does not depend on any other parameters.

Comparison:
In summary, the mean and variance of the Binomial distribution depend on both the number of trials (n) and the probability of success (p), whereas in the Poisson distribution, the mean and variance are both equal to the average rate of occurrence (λ).

Furthermore, as the number of trials in the Binomial distribution (n) becomes very large, and the probability of success (p) becomes very small in a way that np remains constant, the Binomial distribution approaches the Poisson distribution. In this limit, the mean and variance of the Binomial distribution become approximately equal to λ (the average rate of occurrence). This is known as the "Poisson limit of the Binomial distribution."



Q10. In normal distribution with respect to mean position, where does the least frequent data appear?

ANS-10



