In [None]:
Answer 1:
    
The Probability Density Function (PDF) is a concept in probability theory and statistics that describes the likelihood of a continuous random variable taking on a particular value or falling within a certain range of values. The PDF is a function that assigns probabilities to the possible outcomes of a continuous random variable.




Answer 2:


Probability distributions are mathematical functions that describe the likelihood of observing different outcomes in a random experiment or process. There are many types of probability distributions, each with its own characteristics and applications. Here are some of the main types of probability distributions:

Discrete Probability Distributions:

Bernoulli Distribution: Models a single trial with two possible outcomes (success or failure), often used for binary events.
Binomial Distribution: Models the number of successes in a fixed number of independent Bernoulli trials.
Poisson Distribution: Models the number of events occurring in a fixed interval of time or space, given the average rate of occurrence.
Continuous Probability Distributions:

Normal Distribution (Gaussian Distribution): One of the most widely used distributions, characterized by a bell-shaped curve with symmetrical tails. Many natural phenomena follow a normal distribution due to the central limit theorem.
Uniform Distribution: All outcomes are equally likely within a specified range, resulting in a rectangular-shaped distribution.
Exponential Distribution: Models the time between events in a Poisson process, such as the time between arrivals of customers at a service counter.
Gamma Distribution: Generalization of the exponential distribution, often used to model waiting times or durations until an event occurs.
Beta Distribution: Used to model random variables that represent proportions or probabilities, such as the success rate of a product or the probability of an event occurring.
Multivariate Probability Distributions:

Multinomial Distribution: Generalization of the binomial distribution to more than two categories.
Multivariate Normal Distribution: Generalization of the normal distribution to multiple dimensions, used to model correlated random variables.
Other Distributions:

Geometric Distribution: Models the number of trials needed to achieve the first success in a sequence of independent Bernoulli trials.
Hypergeometric Distribution: Models the number of successes in a sample drawn without replacement from a finite population of two types of items.
Negative Binomial Distribution: Generalization of the geometric distribution, models the number of failures before a specified number of successes occurs.





Answer 3:
    
import math

def normal_pdf(x, mu, sigma):
    """
    Calculate the probability density function (PDF) of a normal distribution
    at a given point x with mean mu and standard deviation sigma.
    
    Parameters:
    - x: The point at which to evaluate the PDF.
    - mu: The mean of the normal distribution.
    - sigma: The standard deviation of the normal distribution.
    
    Returns:
    - pdf: The value of the PDF at the given point x.
    """
    # Calculate the exponential term in the PDF formula
    exp_term = math.exp(-((x - mu) ** 2) / (2 * sigma ** 2))
    
    # Calculate the coefficient term in the PDF formula
    coeff_term = 1 / (math.sqrt(2 * math.pi) * sigma)
    
    # Calculate the PDF value
    pdf = coeff_term * exp_term
    
    return pdf

# Example usage:
mean = 0  # Mean of the normal distribution
std_dev = 1  # Standard deviation of the normal distribution
point = 1  # Point at which to evaluate the PDF

pdf_value = normal_pdf(point, mean, std_dev)
print("PDF value at x =", point, ":", pdf_value)





Answer 4:
    

The binomial distribution is a discrete probability distribution that describes the number of successes in a fixed number of independent Bernoulli trials, where each trial has only two possible outcomes (usually labeled as success and failure) and the probability of success remains constant across trials. The properties of the binomial distribution include:

Fixed Number of Trials: The binomial distribution is defined for a fixed number of independent trials, denoted as n.

Two Possible Outcomes: Each trial in the binomial distribution has only two possible outcomes: success and failure.

Constant Probability of Success: The probability of success, denoted as p, remains constant across all trials.

Independent Trials: The outcome of one trial does not affect the outcome of any other trial. The trials are assumed to be independent.

Discrete Distribution: The binomial distribution is a discrete probability distribution, meaning it is defined for discrete values of the random variable representing the number of successes.




Answer 5:
    
import numpy as np
import matplotlib.pyplot as plt

# Parameters
n = 1000  # Sample size
p = 0.4   # Probability of success

# Generate random sample from binomial distribution
random_sample = np.random.binomial(n=1, p=p, size=n)

# Plot histogram
plt.figure(figsize=(8, 6))
plt.hist(random_sample, bins=2, color='skyblue', edgecolor='black', alpha=0.7)
plt.xlabel('Outcome (0 = Failure, 1 = Success)')
plt.ylabel('Frequency')
plt.title('Histogram of Random Sample from Binomial Distribution')
plt.xticks([0, 1], ['Failure', 'Success'])
plt.grid(axis='y', linestyle='--', alpha=0.7)
plt.show()






Answer 6:

import math

def poisson_cdf(k, lam):
    """
    Calculate the cumulative distribution function (CDF) of a Poisson distribution
    at a given point k with mean lam.
    
    Parameters:
    - k: The point at which to evaluate the CDF.
    - lam: The mean of the Poisson distribution.
    
    Returns:
    - cdf: The cumulative distribution function value at the given point k.
    """
    # Initialize the cumulative probability
    cdf = 0
    
    # Calculate the cumulative probability using the Poisson distribution formula
    for i in range(k + 1):
        cdf += (math.exp(-lam) * (lam ** i)) / math.factorial(i)
    
    return cdf

# Example usage:
mean = 2.5  # Mean of the Poisson distribution
point = 3  # Point at which to evaluate the CDF

cdf_value = poisson_cdf(point, mean)
print("CDF value at k =", point, ":", cdf_value)





Answer 7:
    

The binomial distribution and the Poisson distribution are both probability distributions used to model the number of events occurring within a fixed interval or a fixed number of trials. However, they differ in terms of their underlying assumptions, characteristics, and applications. Here are the main differences between the binomial and Poisson distributions:

Nature of Events:

Binomial Distribution: The binomial distribution is used to model the number of successes (or failures) in a fixed number of independent Bernoulli trials, where each trial has only two possible outcomes (success or failure).
Poisson Distribution: The Poisson distribution is used to model the number of events occurring in a fixed interval of time or space, given the average rate of occurrence, without specifying the number of trials.
Number of Trials:

Binomial Distribution: The binomial distribution is defined for a fixed number of independent trials, denoted as n.
Poisson Distribution: The Poisson distribution does not involve a fixed number of trials. It models events occurring over a continuous interval, such as time or space.
Probability of Success:

Binomial Distribution: In the binomial distribution, the probability of success, denoted as p, remains constant across all trials.
Poisson Distribution: In the Poisson distribution, the rate parameter λ represents the average rate of occurrence of events within the interval. It is the mean and variance of the distribution.
Assumptions:

Binomial Distribution: The binomial distribution assumes that the trials are independent, and the probability of success is constant across all trials.
Poisson Distribution: The Poisson distribution assumes that events occur randomly in time or space, and the occurrence of one event does not affect the occurrence of another event.
Applicability:

Binomial Distribution: The binomial distribution is applicable when there is a fixed number of trials and each trial has only two possible outcomes.
Poisson Distribution: The Poisson distribution is applicable when events occur randomly and independently over a continuous interval, and the average rate of occurrence is known.




Answer 8:
    

To generate a random sample of size 1000 from a Poisson distribution with mean=5
λ=5 and calculate the sample mean and variance, you can use NumPy, which provides functions for random number generation and statistical calculations. Here's how you can do it in Python:

import numpy as np

# Parameters
sample_size = 1000
mean = 5

# Generate random sample from Poisson distribution
random_sample = np.random.poisson(mean, size=sample_size)

# Calculate sample mean and variance
sample_mean = np.mean(random_sample)
sample_variance = np.var(random_sample)

# Print results
print("Sample Mean:", sample_mean)
print("Sample Variance:", sample_variance)





Answer 10:
    

In a normal distribution, the least frequent data points appear farthest from the mean. Since the normal distribution is symmetric around its mean, the least frequent data points are located in the tails of the distribution, farthest from the center.

To be more precise:

In a normal distribution:

Approximately 68% of the data falls within one standard deviation (σ) of the mean.
Approximately 95% of the data falls within two standard deviations (2σ) of the mean.
Approximately 99.7% of the data falls within three standard deviations (3σ) of the mean.
Beyond three standard deviations from the mean, the data points become increasingly rare.

Therefore, the least frequent data points in a normal distribution are located in the tails, more than three standard deviations away from the mean, both to the left and right of the mean.
