In [None]:
Q1. What is the Probability density function?

In [None]:
The Probability Density Function (PDF) is a concept in probability theory and statistics that describes the likelihood of a continuous random variable falling within a particular range of values. The PDF is often associated with continuous probability distributions.

In [None]:
Q2. What are the types of Probability distribution?

In [None]:

There are several types of probability distributions, and they can be broadly classified into two main categories: discrete probability distributions and continuous probability distributions. Each category has specific distributions associated with it.

Discrete Probability Distributions:

Bernoulli Distribution: A distribution with two possible outcomes, often labeled as success (1) and failure (0).
Binomial Distribution: Represents the number of successes in a fixed number of independent Bernoulli trials.
Poisson Distribution: Describes the number of events that occur in a fixed interval of time or space, given a known average rate of occurrence.
Continuous Probability Distributions:

Normal Distribution (Gaussian Distribution): A bell-shaped curve characterized by its mean and standard deviation. Many natural phenomena follow a normal distribution.
Uniform Distribution: All outcomes have equal probability, forming a rectangular-shaped distribution.
Exponential Distribution: Describes the time between events in a Poisson process, such as the time between arrivals of customers at a service point.
Gamma Distribution: Generalizes the exponential distribution and is used to model the waiting time until a Poisson-distributed number of events occur.
Beta Distribution: Used to model random variables whose distribution is constrained to intervals between 0 and 1.
These are just a few examples, and there are many other probability distributions with specific characteristics and use cases. The choice of distribution depends on the nature of the random variable being modeled and the underlying assumptions about the data.

It's important to note that each distribution has its own probability density function (PDF) or probability mass function (PMF), which describes the likelihood of different outcomes or ranges of values. Probability distributions are fundamental tools in statistics and probability theory, enabling the modeling and analysis of various real-world phenomena.







In [None]:
Q3. Write a Python function to calculate the probability density function of a normal distribution with
given mean and standard deviation at a given point.

In [None]:
import math

def normal_pdf(x, mean, std_dev):
    """
    Calculate the probability density function (PDF) of a normal distribution at a given point.

    Parameters:
    - x: The point at which to calculate the PDF.
    - mean: The mean of the normal distribution.
    - std_dev: The standard deviation of the normal distribution.

    Returns:
    The PDF value at the given point.
    """
    coefficient = 1 / (math.sqrt(2 * math.pi * std_dev**2))
    exponent = -((x - mean)**2) / (2 * std_dev**2)
    pdf_value = coefficient * math.exp(exponent)
    return pdf_value

# Example usage:
mean = 0
std_dev = 1
point = 1.5

pdf_result = normal_pdf(point, mean, std_dev)
print(f"The PDF at x={point} is: {pdf_result}")


In [None]:
Q4. What are the properties of Binomial distribution? Give two examples of events where binomial
distribution can be applied.

In [None]:
The Binomial distribution is a discrete probability distribution that models the number of successes in a fixed number of independent Bernoulli trials, where each trial has two possible outcomes: success (S) or failure (F). The Binomial distribution has several key properties:

Fixed Number of Trials (n): The number of trials (experiments or observations) is fixed in advance.

Independent Trials: Each trial is independent of the others. The outcome of one trial does not affect the outcome of another.

Two Possible Outcomes: Each trial results in one of two outcomes, typically labeled as success (S) or failure (F).

Constant Probability of Success (p): The probability of success (denoted as 
�
p) remains constant across all trials.

Discrete Distribution: The Binomial distribution is discrete, meaning that it describes a set of distinct possible values (whole numbers).

Count of Successes: The random variable 
�
X represents the number of successes in the fixed number of trials.

In [None]:
Examples of Events Where Binomial Distribution Can Be Applied:

Coin Flipping:

Example: Tossing a fair coin 5 times and counting the number of heads.
Properties:
Fixed number of trials (n = 5).
Each trial is independent.
Two possible outcomes (heads or tails).
Constant probability of success (0.5 for a fair coin).
Quality Control:

Example: Inspecting a batch of products and counting the number of defective items.
Properties:
Fixed number of trials (n = the batch size).
Each item's inspection is independent.
Two possible outcomes (defective or non-defective).
Constant probability of success (probability of a defective item).

In [None]:
Q5. Generate a random sample of size 1000 from a binomial distribution with probability of success 0.4
and plot a histogram of the results using matplotlib.

In [None]:
import numpy as np
import matplotlib.pyplot as plt

# Set the parameters for the binomial distribution
n_trials = 1000  # Number of trials
p_success = 0.4  # Probability of success

# Generate a random sample from the binomial distribution
random_sample = np.random.binomial(n=n_trials, p=p_success, size=1000)

# Plot the histogram
plt.hist(random_sample, bins=np.arange(-0.5, n_trials + 1.5, 1), edgecolor='black')
plt.title(f'Binomial Distribution\n(n={n_trials}, p={p_success})')
plt.xlabel('Number of Successes')
plt.ylabel('Frequency')
plt.grid(axis='y', linestyle='--', alpha=0.7)
plt.show()


In [None]:
Q6. Write a Python function to calculate the cumulative distribution function of a Poisson distribution
with given mean at a given point.

In [None]:
import math

def poisson_cdf(k, mean):
    """
    Calculate the Cumulative Distribution Function (CDF) of a Poisson distribution at a given point.

    Parameters:
    - k: The point at which to calculate the CDF.
    - mean: The mean of the Poisson distribution.

    Returns:
    The CDF value at the given point.
    """
    cdf_value = 0
    for i in range(k + 1):
        cdf_value += math.exp(-mean) * (mean**i) / math.factorial(i)
    return cdf_value

# Example usage:
mean = 3
point = 2

cdf_result = poisson_cdf(point, mean)
print(f"The CDF at k={point} is: {cdf_result}")


In [None]:
Q7. How Binomial distribution different from Poisson distribution?

In [None]:
The Binomial and Poisson distributions are both probability distributions used in statistical modeling, but they differ in terms of the scenarios they are applicable to and their key characteristics. Here are the main differences between the Binomial and Poisson distributions:

1. Number of Trials:

Binomial Distribution: Describes the number of successes in a fixed number of independent Bernoulli trials, where each trial has two possible outcomes (success or failure).
Poisson Distribution: Describes the number of events that occur in a fixed interval of time or space, given a known average rate of occurrence. The number of trials is not fixed in the Poisson distribution; it represents events occurring in a continuous or infinite sequence.
2. Nature of Outcomes:

Binomial Distribution: The outcomes of each trial are discrete and can only take on two values (success or failure).
Poisson Distribution: The outcomes are also discrete, representing the count of events in a continuous or infinite sequence.
3. Independence:

Binomial Distribution: Assumes that the trials are independent, meaning the outcome of one trial does not affect the outcome of another.
Poisson Distribution: Assumes that the events are independent within the fixed interval.
4. Probability of Success:

Binomial Distribution: Requires a constant probability of success (p) for each trial.
Poisson Distribution: Describes events that occur randomly, and the probability of an event occurring in a small interval is proportional to the length of the interval.
5. Parameters:

Binomial Distribution: Requires two parameters - the number of trials (n) and the probability of success in each trial (p).

In [None]:
Q8. Generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the
sample mean and variance.

In [None]:
import numpy as np

# Set the parameters for the Poisson distribution
mean = 5
sample_size = 1000

# Generate a random sample from the Poisson distribution
random_sample = np.random.poisson(lam=mean, size=sample_size)

# Calculate the sample mean and variance
sample_mean = np.mean(random_sample)
sample_variance = np.var(random_sample)

# Print the results
print(f"Sample Mean: {sample_mean}")
print(f"Sample Variance: {sample_variance}")


In [None]:
Q9. How mean and variance are related in Binomial distribution and Poisson distribution?

In [None]:
Q10. In normal distribution with respect to mean position, where does the least frequent data appear?

In [None]:
In a normal distribution, also known as a Gaussian distribution or bell curve, the least frequent data appears in the tails of the distribution, farthest from the mean. Specifically:

Left Tail (Negative Side): The data values on the left side of the mean (to the left of the peak of the curve) are smaller than the mean. These represent the lower values in the distribution. The farther you move to the left, the less frequent the data becomes.

Right Tail (Positive Side): The data values on the right side of the mean (to the right of the peak of the curve) are larger than the mean. These represent the higher values in the distribution. The farther you move to the right, the less frequent the data becomes.

The normal distribution is symmetric, and the tails extend infinitely in both directions. However, the probability of observing data values in the tails decreases rapidly as you move away from the mean. The majority of the data is concentrated in the central portion of the distribution, near the mean.

In summary, the least frequent data in a normal distribution is found in the extreme tails, both to the left and right of the mean.