# Assignment

# Q1

In probability theory, a probability density function (PDF) is a mathematical function that describes the relative likelihood of different outcomes in a continuous random variable.

Specifically, the probability density function of a random variable X is a function f(x) such that the probability of X being in the interval [a, b] is given by the integral of f(x) over that interval, i.e., the area under the curve of the PDF between a and b:

P(a ≤ X ≤ b) = ∫a^b f(x)dx

The probability density function is non-negative, i.e., f(x) ≥ 0 for all x, and its integral over the entire range of the random variable must be equal to 1, i.e.,

∫-∞^∞ f(x)dx = 1

The PDF can be used to calculate various properties of a continuous random variable, such as its mean, variance, and higher-order moments. It is a fundamental concept in probability theory and has numerous applications in statistics, engineering, physics, and other fields.

#  Q2

There are many types of probability distributions, but some of the most commonly used ones are:

Normal distribution: Also known as the Gaussian distribution, it is the most well-known probability distribution and is often used in statistical analysis. It is a continuous distribution with a bell-shaped curve that is symmetrical about its mean.

Binomial distribution: This is a discrete distribution that describes the number of successes in a fixed number of independent trials, where each trial has only two possible outcomes (e.g., success or failure).

Poisson distribution: This is another discrete distribution that describes the number of events that occur in a fixed time interval or space, given a known rate of occurrence.

Exponential distribution: This is a continuous distribution that describes the time between events in a Poisson process, where events occur randomly and independently at a constant rate.

Uniform distribution: This is a continuous distribution where all values in a given range are equally likely to occur.

Gamma distribution: This is a continuous distribution that is often used to model waiting times or the lifetime of products, among other things.

Beta distribution: This is a continuous distribution that is often used to model probabilities or proportions, such as the probability of success in a Bernoulli trial.

# Q3

In [2]:
import math

def normal_pdf(x, mean, stdev):
    exponent = math.exp(-(math.pow(x - mean, 2) / (2 * math.pow(stdev, 2))))
    return (1 / (math.sqrt(2 * math.pi) * stdev)) * exponent
#f(x) = (1 / (sqrt(2 * pi) * stdev)) * e^(-(x - mean)^2 / (2 * stdev^2))
pdf_value = normal_pdf(1, 0, 1)
print(pdf_value)


0.24197072451914337


# Q4

The binomial distribution has several important properties, including:

It is a discrete probability distribution, meaning that it is used to model events with a finite number of outcomes or trials.
Each trial is independent and has only two possible outcomes, usually labeled as "success" and "failure".
The probability of success (p) is constant across all trials.
The number of successes in a fixed number of trials (n) is a random variable that follows a binomial distribution.
The mean of the distribution is μ = np, and the variance is σ^2 = np(1-p).
Two examples of events where the binomial distribution can be applied are:

Flipping a coin: If we flip a fair coin (i.e., with a probability of success of 0.5) 10 times, we can model the number of heads obtained using a binomial distribution. Here, n = 10 and p = 0.5.

Quality control in manufacturing: A manufacturer inspects a sample of 100 products and checks if they meet certain quality standards. If the probability of a product passing inspection is 0.95, we can model the number of products that pass using a binomial distribution. Here, n = 100 and p = 0.95. We can use this distribution to calculate the probability of obtaining a certain number of defective products in the sample.

# Q5

In [None]:
import numpy as np
import matplotlib.pyplot as plt

# Set the random seed for reproducibility
np.random.seed(123)

# Generate a random sample of size 1000 from a binomial distribution with n=1 and p=0.4
sample = np.random.binomial(n=1, p=0.4, size=1000)

# Plot a histogram of the sample
plt.hist(sample, bins=2, edgecolor='black')
plt.xticks([0, 1], ['Failure', 'Success'])
plt.xlabel('Outcome')
plt.ylabel('Frequency')
plt.title('Histogram of Random Sample from a Binomial Distribution')
plt.show()

# Q6

In [None]:
from scipy.stats import poisson

def poisson_cdf(mean, x):
   
    cdf = poisson.cdf(x, mu=mean)
    return cdf
mean = 2.5
x = 4
cdf = poisson_cdf(mean, x)
print(cdf)



# Q7

Binomial distribution and Poisson distribution are two common probability distributions used to model discrete events.

The main difference between the two distributions is in the way they are parameterized and the types of events they are used to model:

Parameterization:
The binomial distribution has two parameters: n, the number of trials, and p, the probability of success for each trial.
The Poisson distribution has one parameter: λ (lambda), which represents the average rate of occurrence of the event.
Types of events:
The binomial distribution is used to model events that have a fixed number of trials, where each trial is independent and has the same probability of success. Examples include flipping a coin a fixed number of times, or counting the number of defective items in a sample of a fixed size.
The Poisson distribution is used to model events where the number of occurrences in a fixed interval of time or space is of interest, such as the number of customers arriving at a store per hour, or the number of earthquakes per year in a certain region.
In terms of the shapes of the distributions, the binomial distribution is usually bell-shaped when n is large, while the Poisson distribution is skewed to the right.

# Q8

In [None]:
import numpy as np

# Set the parameters
mean = 5
n = 1000

# Generate the random sample
sample = np.random.poisson(mean, n)

# Calculate the sample mean and variance
sample_mean = np.mean(sample)
sample_var = np.var(sample, ddof=1)

# Print the results
print("Sample Mean:", sample_mean)
print("Sample Variance:", sample_var)


# Q9

In [None]:
Mean = n * p
Variance = n * p * (1 - p)
#In a binomial distribution with parameters n and p, the mean and variance are related as

In [None]:
Mean = λ
Variance = λ
#In a Poisson distribution with parameter λ, the mean and variance are related as:

In both cases, the mean and variance are directly proportional to each other. This means that as the mean increases, the variance also increases, and as the mean decreases, the variance also decreases.

However, there is an important difference between the binomial and Poisson distributions in terms of the relationship between the mean and variance. In a binomial distribution, the variance is dependent on both n and p, whereas in a Poisson distribution, the variance is equal to the mean, regardless of the value of λ.

# Q10

In a normal distribution, the data is symmetric around the mean, meaning that the least frequent data appears at the tails of the distribution, farthest from the mean.

The normal distribution is a bell-shaped curve, with the peak of the curve at the mean. As we move away from the mean in either direction, the probability density of the data decreases, forming the tails of the distribution. The least frequent data points are therefore located in the tails of the distribution, farthest from the mean