In [None]:
### **Q1. What is Bayes' Theorem?**

Bayes' Theorem is a mathematical formula used for determining the **conditional probability** of an event, based on prior knowledge of conditions that might be related to the event. It helps us **update our beliefs** based on new evidence.

---

### **Q2. What is the formula for Bayes' Theorem?**

\[
P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)}
\]

Where:
- \( P(A|B) \) is the **posterior probability**: probability of A given B is true.
- \( P(B|A) \) is the **likelihood**: probability of B given A is true.
- \( P(A) \) is the **prior probability** of A.
- \( P(B) \) is the **marginal probability** of B.

---

### **Q3. How is Bayes' Theorem used in practice?**

Bayes’ Theorem is used in many fields:
- **Spam detection** (classifying emails as spam or not)
- **Medical diagnosis** (updating disease probability given symptoms)
- **Machine learning** (especially in Naive Bayes classifiers)
- **Risk assessment** (updating risk levels based on new data)

---

### **Q4. What is the relationship between Bayes' Theorem and conditional probability?**

Bayes' Theorem **is derived from** the definition of conditional probability. It essentially **reverses the condition**:

\[
P(A|B) = \frac{P(B|A) \cdot P(A)}{P(B)}
\]

This allows us to compute the probability of A happening given B, even if we only know how likely B is given A.

---

### **Q5. How do you choose which type of Naive Bayes classifier to use?**

The choice depends on the nature of your **features**:

| Type               | Use When Features Are...               |
|--------------------|----------------------------------------|
| **Gaussian NB**    | Continuous and normally distributed    |
| **Multinomial NB** | Discrete counts (e.g., word counts)    |
| **Bernoulli NB**   | Binary (0 or 1) features               |

So, check your data distribution. If you have word frequencies → use **Multinomial**. If binary flags (like presence/absence) → use **Bernoulli**. If continuous features → use **Gaussian**.

---

### **Q6. Assignment (Naive Bayes classification)**

We are given:

#### Frequencies:

| Class | X1=3 | X2=4 |
|-------|------|------|
| A     | 4    | 3    |
| B     | 1    | 3    |

- Assume equal priors:  
  \[
  P(A) = P(B) = 0.5
  \]

Now, we compute the probabilities of each class for a new instance where **X1 = 3** and **X2 = 4** using **Naive Bayes**:

\[
P(A|X1=3, X2=4) \propto P(X1=3|A) \cdot P(X2=4|A) \cdot P(A)
\]
\[
P(B|X1=3, X2=4) \propto P(X1=3|B) \cdot P(X2=4|B) \cdot P(B)
\]

We compute relative likelihoods using **frequency-based probabilities**:

#### For Class A:
- Total counts for X1: \(3 + 3 + 4 = 10\) → \(P(X1=3|A) = \frac{4}{10} = 0.4\)
- Total counts for X2: \(4 + 3 + 3 + 3 = 13\) → \(P(X2=4|A) = \frac{3}{13} \approx 0.2308\)

\[
P(A|X1=3, X2=4) \propto 0.4 \cdot 0.2308 \cdot 0.5 = 0.0462
\]

#### For Class B:
- Total X1 counts: \(2 + 2 + 1 = 5\) → \(P(X1=3|B) = \frac{1}{5} = 0.2\)
- Total X2 counts: \(2 + 2 + 2 + 3 = 9\) → \(P(X2=4|B) = \frac{3}{9} = 0.3333\)

\[
P(B|X1=3, X2=4) \propto 0.2 \cdot 0.3333 \cdot 0.5 = 0.0333
\]
- \( P(A|X1=3, X2=4) = 0.0462 \)
- \( P(B|X1=3, X2=4) = 0.0333 \)
