### 1. **Bernoulli Naïve Bayes**

* **Assumption:** Features are binary (0 or 1).
* **Distribution used:** Bernoulli distribution.
* **When to use:** Presence/absence type features.
* **Examples:**

  * Word present vs absent in email (spam classification).
  * Customer churn (churn = 1, not churn = 0).

**Formula (for one feature):**

$$
P(x_i|y) = p^{x_i}(1-p)^{1-x_i}, \quad x_i \in \{0,1\}
$$

---

### 2. **Multinomial Naïve Bayes**

* **Assumption:** Features are discrete counts or frequencies.
* **Distribution used:** Multinomial distribution.
* **When to use:** Text classification with word counts, term frequency.
* **Examples:**

  * Document classification (sports, politics, tech).
  * Spam detection with Bag of Words or TF-IDF features.

**Formula (for one feature):**

$$
P(x_i|y) = \frac{(n_i)!}{x_{i1}!x_{i2}!\dots x_{ik}!} \prod_{j=1}^k p_j^{x_{ij}}
$$

---

### 3. **Gaussian Naïve Bayes**

* **Assumption:** Features are continuous and normally distributed.
* **Distribution used:** Gaussian (normal) distribution.
* **When to use:** Real-valued input data.
* **Examples:**

  * Iris dataset (sepal length, petal width).
  * Medical data like blood pressure, weight, age.

**Formula (for one feature):**

$$
P(x_i|y) = \frac{1}{\sqrt{2\pi\sigma^2}} \exp\left(-\frac{(x_i-\mu)^2}{2\sigma^2}\right)
$$

---

### Key Difference Table

| Type        | Feature Type      | Example Use Case                       |
| ----------- | ----------------- | -------------------------------------- |
| Bernoulli   | Binary (0/1)      | Email: word present/absent             |
| Multinomial | Discrete counts   | Document classification (Bag of Words) |
| Gaussian    | Continuous values | Iris dataset, medical diagnosis        |

