## Bayes' Theorem

<center>
    <img src="https://upload.wikimedia.org/wikipedia/commons/thumb/c/c9/Bayes_theorem_visual_proof.svg/512px-Bayes_theorem_visual_proof.svg.png" alt="description" width="auto">
</center>

### Introduction
**Bayes' Theorem** is a powerful tool in probability theory that allows us to update our beliefs based on new evidence. Developed by the English statistician Thomas Bayes, this theorem is widely applied in fields such as medicine, finance, and data science to refine predictions and make decisions in the face of uncertainty.

In simple terms, Bayes' Theorem answers the question: "Given what I know now, how likely is something to be true?" It balances prior knowledge with new evidence to update probabilities, making it particularly useful for dealing with uncertain or incomplete data.

### The Basics of Bayes’ Theorem

Mathematically, Bayes' Theorem is expressed as:

$$
P(H|E) = \frac{P(E|H) \cdot P(H)}{P(E)}
$$

Where:
- $  P(H|E) $  is the **posterior probability**: the updated probability of a hypothesis $  H $  being true after considering new evidence $  E $ .
- $  P(E|H) $  is the **likelihood**: the probability of observing the evidence $  E $ , assuming the hypothesis $  H $  is true.
- $  P(H) $  is the **prior probability**: the initial probability of the hypothesis before seeing the new evidence.
- $  P(E) $  is the **marginal likelihood**: the total probability of observing the evidence, regardless of the hypothesis.

### Key Concepts: Conditional, Joint, and Marginal Probabilities

Before diving into Bayes’ Theorem, it's important to understand three foundational concepts: **conditional probability**, **joint probability**, and **marginal probability**.

#### 1. **Conditional Probability**
Conditional probability represents the probability of an event occurring, given that another event has already occurred. For example, let's consider the probability of a person having long hair given that they are a woman:

$$
P(\text{Long Hair | Woman}) = \frac{P(\text{Woman and Long Hair})}{P(\text{Woman})}
$$

If 60% of women have long hair, the conditional probability $  P(\text{Long Hair | Woman}) $  would be 0.6. This is different from $  P(\text{Woman | Long Hair}) $ , which represents the probability that a person is a woman, given that they have long hair.

#### 2. **Joint Probability**
Joint probability refers to the probability of two events happening together. For example, the probability of a person being a woman and having long hair is:

$$
P(\text{Woman and Long Hair}) = P(\text{Woman}) \times P(\text{Long Hair | Woman})
$$

If 50% of people are women and 60% of women have long hair, the joint probability would be:

$$
P(\text{Woman and Long Hair}) = 0.5 \times 0.6 = 0.3
$$

This means 30% of the population are women with long hair.

#### 3. **Marginal Probability**
Marginal probability refers to the overall probability of an event occurring, regardless of other conditions. In our example, the marginal probability of having long hair, considering both men and women, would be:

$$
P(\text{Long Hair}) = P(\text{Woman and Long Hair}) + P(\text{Man and Long Hair})
$$

If 30% of people are women with long hair and 10% are men with long hair, then:

$$
P(\text{Long Hair}) = 0.3 + 0.1 = 0.4
$$

This means 40% of the population has long hair.

### Deriving Bayes’ Theorem
Now, let's connect these ideas to derive Bayes' Theorem. Suppose we want to calculate the probability that a person is a man, given that they have long hair:

$$
P(\text{Man | Long Hair})
$$

Using the concept of joint probability:

$$
P(\text{Man and Long Hair}) = P(\text{Man}) \times P(\text{Long Hair | Man})
$$

This is equivalent to:

$$
P(\text{Man and Long Hair}) = P(\text{Long Hair}) \times P(\text{Man | Long Hair})
$$

Setting these equal:

$$
P(\text{Man}) \times P(\text{Long Hair | Man}) = P(\text{Long Hair}) \times P(\text{Man | Long Hair})
$$

Solving for $  P(\text{Man | Long Hair}) $ :

$$
P(\text{Man | Long Hair}) = \frac{P(\text{Man}) \times P(\text{Long Hair | Man})}{P(\text{Long Hair})}
$$

### Example of Bayes’ Theorem
Let’s use an example:

- $  P(\text{Man}) = 0.5 $  (50% of the population are men).
- $  P(\text{Long Hair | Man}) = 0.1 $  (10% of men have long hair).
- $  P(\text{Long Hair}) = 0.4 $  (40% of the population has long hair).

Using Bayes' Theorem:

$$
P(\text{Man | Long Hair}) = \frac{0.5 \times 0.1}{0.4} = \frac{0.05}{0.4} = 0.125
$$

So, given that a person has long hair, there is a **12.5% chance** they are a man.

### Real-World Applications

#### 1. **Medical Diagnosis**
In healthcare, Bayes' Theorem is used to interpret test results by considering both the accuracy of the test and the prevalence of the disease. For example, if a disease affects 1% of the population and a test has a 90% detection rate, Bayes’ Theorem helps estimate the true likelihood of the disease given a positive result.

#### 2. **Spam Filtering**
Email services apply Bayes' Theorem to detect spam. Given that certain keywords are more likely to appear in spam emails, the algorithm calculates the probability that a message is spam based on its content.

#### 3. **Risk Assessment in Finance**
Bayes' Theorem helps financial institutions assess the likelihood of loan defaults by updating predictions based on new financial data like credit scores or payment history.

### Conclusion
By combining **conditional**, **joint**, and **marginal probabilities**, Bayes' Theorem becomes a powerful tool for updating our beliefs in light of new evidence. Whether in data science, medicine, or everyday decisions, it helps us make more informed choices in uncertain situations.
