Q1. **What is the Probability density function?**

The probability density function (PDF) is a function that describes the likelihood of a continuous random variable taking on a particular value. It gives the probability of the variable falling within a particular range of values, rather than at a specific point. The integral of the PDF over a range gives the probability that the random variable falls within that range.

Q2. **What are the types of Probability distribution?**

There are several types of probability distributions, including:

- **Normal Distribution**: Describes a symmetrical, bell-shaped curve.
- **Binomial Distribution**: Models the number of successes in a fixed number of independent Bernoulli trials.
- **Poisson Distribution**: Models the number of events occurring in a fixed interval of time or space.
- **Uniform Distribution**: All outcomes are equally likely.
- **Exponential Distribution**: Describes the time between events in a Poisson process.
- **Chi-Square Distribution**: Arises in tests of statistical significance.

Q3. **Write a Python function to calculate the probability density function of a normal distribution with given mean and standard deviation at a given point.**

```python
import numpy as np

def normal_pdf(x, mean, std_dev):
    coefficient = 1 / (std_dev * np.sqrt(2 * np.pi))
    exponent = -0.5 * ((x - mean) / std_dev) ** 2
    return coefficient * np.exp(exponent)
```

Q4. **What are the properties of Binomial distribution? Give two examples of events where binomial distribution can be applied.**

Properties of the Binomial distribution:

- It models the number of successes (e.g., heads in coin flips, successes in trials) out of a fixed number of independent Bernoulli trials.
- It has two parameters: the number of trials (n) and the probability of success on each trial (p).
- The mean of a Binomial distribution is \( \mu = np \) and the variance is \( \sigma^2 = np(1-p) \).

Examples of events where Binomial distribution can be applied:
1. Tossing a coin multiple times and counting the number of heads.
2. Conducting a series of medical tests and counting the number of positive results.

Q5. **Generate a random sample of size 1000 from a binomial distribution with probability of success 0.4 and plot a histogram of the results using matplotlib.**

```python
import numpy as np
import matplotlib.pyplot as plt

# Generate random sample
sample = np.random.binomial(n=1, p=0.4, size=1000)

# Plot histogram
plt.hist(sample, bins=2, density=True, alpha=0.7)
plt.xlabel('Outcome')
plt.ylabel('Probability')
plt.title('Histogram of Binomial Distribution')
plt.show()
```

Q6. **Write a Python function to calculate the cumulative distribution function of a Poisson distribution with given mean at a given point.**

```python
from scipy.stats import poisson

def poisson_cdf(x, mean):
    return poisson.cdf(x, mu=mean)
```

Q7. **How is Binomial distribution different from Poisson distribution?**

The main differences between Binomial and Poisson distributions are:

- **Number of Trials**: In a Binomial distribution, the number of trials (n) is fixed and finite, while in a Poisson distribution, the number of events is not fixed and can be infinite.
- **Probability of Success**: In a Binomial distribution, each trial has a constant probability of success (p), whereas in a Poisson distribution, the probability of an event occurring in a fixed interval is proportional to the length of the interval.
- **Shape**: A Binomial distribution tends to be symmetric when \( np(1-p) \) is not too small, while a Poisson distribution is skewed to the right.

Q8. **Generate a random sample of size 1000 from a Poisson distribution with mean 5 and calculate the sample mean and variance.**

```python
sample = np.random.poisson(lam=5, size=1000)
sample_mean = np.mean(sample)
sample_variance = np.var(sample)
print("Sample Mean:", sample_mean)
print("Sample Variance:", sample_variance)
```

Q9. **How are mean and variance related in Binomial distribution and Poisson distribution?**

In a Binomial distribution, the mean \( \mu \) and variance \( \sigma^2 \) are related by \( \sigma^2 = np(1-p) \).

In a Poisson distribution, both the mean \( \lambda \) and variance \( \lambda \) are equal, i.e., \( \sigma^2 = \lambda \).

Q10. **In normal distribution with respect to mean position, where does the least frequent data appear?**

In a normal distribution, the least frequent data appears at the tails, which are the extreme ends of the distribution. As you move away from the mean in either direction, the frequency of data points decreases.

In [None]:
Data can be categorized into different types based on their nature and characteristics. The main types of data are:

1. Nominal Data:
Nominal data, also known as categorical data, consist of categories or labels with no inherent order or ranking. Nominal data can be qualitative or quantitative, but the numbers assigned to categories are arbitrary and do not represent any inherent value or order.

Example: 
Gender (male, female, other), marital status (single, married, divorced), eye color (blue, brown, green) are examples of nominal data. Each category represents a distinct group, but there is no inherent order or numerical significance to the categories.

2. Ordinal Data:
Ordinal data represent categories with a meaningful order or ranking but do not have a consistent unit of measurement or equal intervals between categories. While there is a ranking, the differences between the categories may not be equal or quantifiable.

Example:
Educational attainment (high school diploma, bachelor's degree, master's degree, PhD) is an example of ordinal data. While there is a clear ranking in terms of educational achievement, the difference between each level of education is not necessarily uniform or quantifiable.

3. Interval Data:
Interval data represent categories with a meaningful order, and the intervals between categories are equal and measurable. However, interval data do not have a true zero point, meaning that zero does not represent the absence of the measured attribute but rather a point on the scale.

Example:
Temperature measured in Celsius or Fahrenheit is an example of interval data. The difference between 20°C and 30°C is the same as the difference between 30°C and 40°C, but zero degrees does not represent the absence of temperature.

4. Ratio Data:
Ratio data are similar to interval data but have a true zero point, which represents the absence of the measured attribute. Ratio data have a meaningful order, equal intervals between categories, and a true zero point, allowing for meaningful ratios and mathematical operations.

Example:
Height, weight, and age are examples of ratio data. For example, if someone's height is twice another person's height, it means they are truly twice as tall. Similarly, if someone weighs zero kilograms, it means they have no weight.

In summary, the main types of data are nominal, ordinal, interval, and ratio data, each differing in terms of the nature of categories, the presence of order or ranking, and the presence of a true zero point. Understanding the type of data being analyzed is crucial for selecting appropriate statistical methods and interpreting results accurately.