#NAIVE BAYES EXAMPLE

To illustrate the Naive Bayes classifier, let's consider a practical example involving weather conditions and playing golf.

### Example: Weather and Golf

Imagine we have a dataset that records whether a game of golf was played based on various weather conditions. The dataset includes the following features: **Outlook** (Sunny, Overcast, Rainy), **Temperature** (Hot, Mild, Cool), **Humidity** (High, Normal), and **Windy** (True, False). The target variable is whether golf was played (Yes or No).

#### Sample Dataset

| Outlook  | Temperature | Humidity | Windy | Play Golf |
|----------|-------------|----------|-------|-----------|
| Sunny    | Hot         | High     | False | No        |
| Sunny    | Hot         | High     | True  | No        |
| Overcast | Hot         | High     | False | Yes       |
| Rainy    | Mild        | High     | False | Yes       |
| Rainy    | Cool        | Normal   | False | Yes       |
| Rainy    | Cool        | Normal   | True  | No        |
| Overcast | Cool        | Normal   | True  | Yes       |
| Sunny    | Mild        | High     | False | No        |
| Sunny    | Mild        | Normal   | True  | Yes       |
| Rainy    | Mild        | Normal   | False | Yes       |
| Overcast | Mild        | High     | True  | Yes       |
| Overcast | Hot         | Normal   | False | Yes       |
| Rainy    | Mild        | High     | True  | No        |

### Step 1: Calculate Prior Probabilities

First, we calculate the prior probabilities for the target variable (Play Golf):

- **Total instances**: 14
- **Play Golf = Yes**: 9 instances
- **Play Golf = No**: 5 instances

The prior probabilities are:

$$
P(Yes) = \frac{9}{14} \approx 0.643
$$

$$
P(No) = \frac{5}{14} \approx 0.357
$$

### Step 2: Calculate Likelihoods

Next, we calculate the likelihood of each feature given the target variable. For example, if we want to predict whether golf will be played on a **Sunny** day with **High** humidity:

- **P(Sunny | Yes)**: Out of 9 instances where golf was played, 2 were Sunny.

$$
P(Sunny | Yes) = \frac{2}{9} \approx 0.222
$$

- **P(High | Yes)**: Out of 9 instances where golf was played, 3 had High humidity.

$$
P(High | Yes) = \frac{3}{9} \approx 0.333
$$

- Similarly, we calculate for the No class:

$$
P(Sunny | No) = \frac{3}{5} = 0.600
$$

$$
P(High | No) = \frac{2}{5} = 0.400
$$

### Step 3: Apply Naive Bayes Formula

Now, we can use the Naive Bayes formula to predict whether golf will be played given that it is Sunny and Humid:

$$
P(Yes | Sunny, High) = \frac{P(Sunny | Yes) \times P(High | Yes) \times P(Yes)}{P(Sunny, High)}
$$

Since $$ P(Sunny, High) $$ is a constant for our calculation, we can ignore it for comparison purposes. Thus, we calculate:

$$
P(Yes | Sunny, High) \propto P(Sunny | Yes) \times P(High | Yes) \times P(Yes)
$$

Substituting the values:

$$
\propto 0.222 \times 0.333 \times 0.643 \approx 0.047
$$

Now for the No class:

$$
P(No | Sunny, High) \propto P(Sunny | No) \times P(High | No) \times P(No)
$$

$$
\propto 0.600 \times 0.400 \times 0.357 \approx 0.080
$$

### Step 4: Make a Prediction

Comparing the two probabilities:

- $$ P(Yes | Sunny, High) \approx 0.047 $$
- $$ P(No | Sunny, High) \approx 0.080 $$

Since $$ P(No | Sunny, High) > P(Yes | Sunny, High) $$, we predict that golf will **not** be played on a Sunny and Humid day.

### Conclusion

This example demonstrates how the Naive Bayes classifier uses prior probabilities and likelihoods to make predictions based on the assumption of feature independence. Despite the "naive" assumption, it can effectively classify instances in various applications, including text classification and spam detection.