
## Binomial Distribution
<center>
    <img src="https://statisticsbyjim.com/wp-content/uploads/2022/10/negative_binomial_distribution_plot.png" alt="description" width="auto">
</center>

### Introduction

The **binomial distribution** is a foundational concept in statistics and probability theory. It is a discrete probability distribution that models the number of successes in a fixed number of independent trials, each with the same probability of success. This distribution is often encountered when dealing with scenarios where there are only two possible outcomes for each trial, such as success/failure, true/false, or yes/no. Due to its simplicity and wide applicability, the binomial distribution is crucial in fields like economics, biology, engineering, and data science.

### Definition

The binomial distribution describes the probability of obtaining exactly $  k $  successes in $  n $  independent trials of a binary experiment, where each trial has only two outcomes: success (with probability $  p $ ) and failure (with probability $  1 - p $ ).

The probability mass function (PMF) of the binomial distribution is defined as:

$$ 
P(X = k) = \binom{n}{k} p^k (1 - p)^{n-k}
$$ 

Where:
- $  P(X = k) $ : Probability of getting exactly $  k $  successes.
- $  n $ : Number of trials.
- $  k $ : Number of successes.
- $  p $ : Probability of success in each trial.
- $  1 - p $ : Probability of failure.
- $  \binom{n}{k} $ : The binomial coefficient, representing the number of ways to choose $  k $  successes from $  n $  trials, calculated as $  \frac{n!}{k!(n-k)!} $ .

### Formula

The general formula for the binomial distribution is:

$$ 
P(X = k) = \binom{n}{k} p^k (1 - p)^{n-k}
$$ 

This formula allows us to calculate the probability of observing $  k $  successes in $  n $  trials when each trial has a success probability of $  p $ .

Some key quantities associated with the binomial distribution include:
- **Mean** ($  \mu $ ): $  \mu = np $ 
- **Variance** ($  \sigma^2 $ ): $  \sigma^2 = np(1 - p) $ 
- **Standard Deviation** ($  \sigma $ ): $  \sigma = \sqrt{np(1 - p)} $ 

### Key Characteristics

- **Discrete Distribution**: The binomial distribution is a discrete distribution, meaning it only takes integer values, corresponding to the number of successes.
- **Fixed Number of Trials**: The number of trials ($  n $ ) is fixed in advance.
- **Binary Outcomes**: Each trial results in either a success or failure.
- **Independence**: The outcome of each trial is independent of others.
- **Constant Probability**: The probability of success ($  p $ ) remains constant across all trials.

### When to Use Binomial Distribution

The binomial distribution is used when the following conditions are met:
1. **Binary outcomes**: There are only two possible outcomes for each trial.
2. **Fixed number of trials**: The number of trials ($  n $ ) is predetermined.
3. **Constant probability**: The probability of success ($  p $ ) remains the same for all trials.
4. **Independence**: Each trial is independent of the others.

This distribution is commonly used in situations where we are interested in counting the number of successes in a sequence of independent binary trials.

### Real-World Applications

The binomial distribution is applied in a wide range of real-world problems:
- **Business**: Predicting the number of successful sales calls in a given number of attempts.
- **Medicine**: Estimating the number of patients who will recover after a specific treatment.
- **Manufacturing**: Assessing the number of defective items in a batch of products.
- **Economics**: Analyzing voter behavior in elections where individuals either vote for or against a candidate.
- **Data Science**: Modeling the success rate of email campaigns or A/B testing outcomes.

### Problem Example

**Problem:**

Suppose a factory produces light bulbs, and the probability that a light bulb is defective is $  0.05 $ . If a random sample of 10 light bulbs is selected, what is the probability that exactly 2 bulbs will be defective?

**Solution:**

Here, we have:
- $  n = 10 $  (number of light bulbs tested),
- $  p = 0.05 $  (probability of a defective bulb),
- $  k = 2 $  (we want the probability of exactly 2 defective bulbs).

Using the binomial probability formula:

$$ 
P(X = 2) = \binom{10}{2} (0.05)^2 (1 - 0.05)^{8}
$$ 

First, calculate $  \binom{10}{2} $ :
$$ 
\binom{10}{2} = \frac{10!}{2!(10 - 2)!} = \frac{10 \times 9}{2} = 45
$$ 

Now, calculate $  P(X = 2) $ :
$$ 
P(X = 2) = 45 \times (0.05)^2 \times (0.95)^8
$$ 
$$ 
P(X = 2) \approx 45 \times 0.0025 \times 0.6634 = 0.0745
$$ 

Thus, the probability that exactly 2 light bulbs will be defective is approximately $  0.0745 $  or 7.45%.

### Conclusion

The binomial distribution is a critical tool for modeling binary outcomes in a fixed number of trials. Its applications span across various industries, helping to solve practical problems such as quality control, medical studies, and business decisions. By understanding its characteristics, formula, and conditions for use, we can apply the binomial distribution to estimate probabilities in real-world scenarios effectively. Its simplicity, combined with broad applicability, makes it a core concept in statistics, especially in data-driven decision-making.