# **Comparison of Classification Reports after using undersampling (Before vs. After PCA)**

## **RandomForest Classification**

| Metric        | **Before PCA** | **After PCA (150 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.84** | **0.86** (Improved) |
| F1-score (1)  | **0.26** | **0.19** (Dropped) |
| Accuracy      | **0.74** | **0.76** (Slight Improvement) |
| Macro Avg F1  | **0.55** | **0.53** (Slight Drop) |
| Weighted Avg F1 | **0.80** | **0.82** (Improved) |

### **Observations:**
- The majority class (-1) is slightly better classified, improving its F1-score.
- The minority class (1) has a lower F1-score after PCA.
- Overall accuracy has slightly improved, but the model struggles more with the minority class.

---

## **SVM Classification**

| Metric        | **Before PCA** | **After PCA (150 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.74** | **0.01** (Major Drop) |
| F1-score (1)  | **0.18** | **0.13** (Slight Drop) |
| Accuracy      | **0.60** | **0.07** (Severe Drop) |
| Macro Avg F1  | **0.46** | **0.07** (Major Drop) |
| Weighted Avg F1 | **0.70** | **0.01** (Severe Drop) |

### **Observations:**
- SVM performance dropped drastically after PCA.
- The model fails to correctly classify the majority class.
- Overall, **SVM is not performing well after PCA**.

---

## **NaiveBayes Classification**

| Metric        | **Before PCA** | **After PCA (150 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.94** | **0.94** (No Change) |
| F1-score (1)  | **0.22** | **0.15** (Dropped) |
| Accuracy      | **0.89** | **0.89** (No Change) |
| Macro Avg F1  | **0.58** | **0.55** (Slight Drop) |
| Weighted Avg F1 | **0.89** | **0.89** (No Change) |

### **Observations:**
- Naïve Bayes remains **consistent** before and after PCA.
- The F1-score of the minority class (1) has slightly dropped.
- No major impact on accuracy.

---

## **Final Conclusion**
- **RandomForest** saw **slight improvements in accuracy**, but minority class classification worsened.
- **SVM performed terribly after PCA**, showing **severe drops** in all metrics.
- **Naïve Bayes remained stable**, with minor changes.
- **PCA (150 components) helped RandomForest but significantly hurt SVM.**

### **Recommendation**
- If using PCA, **RandomForest is the best choice**, but fine-tuning is needed for better minority class detection.
- **SVM should not be used with PCA in this case** as it failed completely.
- **Naïve Bayes remains stable**, making it a **safe option**.


# **Comparison of Classification Reports after using SMOTE (Before vs. After PCA)**

## **RandomForest Classification**

| Metric        | **Before PCA** | **After PCA (3 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.97** | **0.88** (Dropped) |
| F1-score (1)  | **0.00** | **0.13** (Improved) |
| Accuracy      | **0.93** | **0.79** (Dropped) |
| Macro Avg F1  | **0.48** | **0.50** (Slight Improvement) |
| Weighted Avg F1 | **0.90** | **0.83** (Dropped) |

### **Observations:**
- After PCA, **minority class (1) classification improved slightly**, but still remains poor.
- Overall **accuracy dropped significantly** after PCA.
- **Macro F1 improved slightly**, but its mainly due to better minority class recall.
- **PCA negatively impacted RandomForest performance.**

---

## **SVM Classification**

| Metric        | **Before PCA** | **After PCA (3 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.97** | **0.67** (Major Drop) |
| F1-score (1)  | **0.09** | **0.16** (Slight Improvement) |
| Accuracy      | **0.93** | **0.52** (Severe Drop) |
| Macro Avg F1  | **0.53** | **0.41** (Dropped) |
| Weighted Avg F1 | **0.91** | **0.63** (Severe Drop) |

### **Observations:**
- PCA **drastically hurt SVM performance**.
- Accuracy **dropped from 93% to 52%**, a severe degradation.
- Minority class classification saw a **small improvement**, but at the cost of the majority class.
- **SVM is not recommended after PCA in this scenario.**

---

## **NaiveBayes Classification**

| Metric        | **Before PCA** | **After PCA (3 Components)** |
|--------------|--------------|---------------------------|
| F1-score (-1) | **0.33** | **0.45** (Improved) |
| F1-score (1)  | **0.13** | **0.15** (Slight Improvement) |
| Accuracy      | **0.24** | **0.33** (Improved) |
| Macro Avg F1  | **0.23** | **0.30** (Improved) |
| Weighted Avg F1 | **0.31** | **0.43** (Improved) |

### **Observations:**
- Naïve Bayes **benefited from PCA**, showing a **slight improvement** in all metrics.
- **Accuracy increased** from **24% to 33%**.
- However, performance remains **poor overall** compared to other models.
- **Naïve Bayes performs better after PCA but is still not ideal.**

---

## **Final Conclusion**
- **RandomForest performed well before PCA**, but accuracy **dropped after PCA** while minority class recall slightly improved.
- **SVM suffered significantly** after PCA, making it **unsuitable** in this case.
- **Naïve Bayes saw slight improvements** after PCA, but still **remains weak overall**.

### **Recommendation**
- If using SMOTE **without PCA**, **RandomForest is the best choice**.
- **Avoid using SVM with PCA in this case** as it performs poorly.
- **Naïve Bayes shows slight improvement with PCA** but is still **not a strong candidate**.


# **Final Comparison: Before & After PCA (Undersampling vs. SMOTE)**  

## **Summary of Classification Performance**

| **Model**         | **Scenario**                               | **Accuracy** | **Pass Precision** | **Pass Recall** | **Fail Precision** | **Fail Recall** |
|-------------------|-------------------------------------------|-------------|--------------------|----------------|-------------------|----------------|
| **Random Forest** | Before PCA (Undersampling)               | 74%         | 0.97               | 0.75           | 0.16              | 0.67           |
| **Random Forest** | After PCA (Undersampling, 150 components) | 76%         | 0.95               | 0.78           | 0.12              | 0.43           |
| **Random Forest** | After PCA (SMOTE, 3 components)          | 79%         | 0.94               | 0.83           | 0.09              | 0.24           |
| **SVM**          | Before PCA (Undersampling)               | 60%         | 0.96               | 0.60           | 0.11              | 0.67           |
| **SVM**          | After PCA (Undersampling, 150 components) | 07%         | 1.00               | 0.00           | 0.07              | 1.00           |
| **SVM**          | After PCA (SMOTE, 3 components)          | 52%         | 0.96               | 0.51           | 0.09              | 0.67           |
| **Naive Bayes**  | Before PCA (Undersampling)               | 89%         | 0.94               | 0.94           | 0.21              | 0.24           |
| **Naive Bayes**  | After PCA (Undersampling, 150 components) | 89%         | 0.94               | 0.95           | 0.17              | 0.14           |
| **Naive Bayes**  | After PCA (SMOTE, 3 components)          | 33%         | 0.97               | 0.29           | 0.08              | 0.86           |
