# Performance Metrics

Essential for evaluating the effectiveness of machine learning models. The choice of metric depends on the type of problem (classification, regression, clustering, or time series) and the specific goals of the analysis. Below, we break down the most commonly used metrics for each type of problem.

Learning Curves: Plot training and validation performance over time to diagnose overfitting or underfitting.

## Classification Metrics
- Confusion Matrix: For classification problems, use a confusion matrix to visualize true positives, false positives, etc.
- Accuracy: Proportion of correctly classified instances. When to use: Balanced classes, equal misclassification costs
- Precision/Recall: Focus on positive class performance When to use: Imbalanced classes, asymmetric costs (e.g., fraud detection)
- F1-Score: Harmonic mean of precision and recall. When to use: Need balance between precision and recall
- ROC-AUC: Area under ROC curve, measures discrimination.When to use: Ranking quality, threshold-independent evaluation
- PR-AUC: Area under precision-recall curve.When to use: Highly imbalanced datasets

## Regression Metrics
- Mean Squared Error (MSE): Average squared difference between predictions and actuals.When to use: General purpose, penalizes large errors
- Mean Absolute Error (MAE): Average absolute difference.When to use: Need for interpretability, less sensitivity to outliers
- R-squared: Proportion of variance explained.When to use: Comparing models, explaining fit to stakeholders
- MAPE: Mean absolute percentage error.When to use: Comparing errors across different scales

## Clustering: 
- Silhouette score
- Davies-Bouldin index.

## Time Series: 
- Mean Absolute Percentage Error (MAPE)
- AIC
- BIC.

### Learning Curves

Plot training and validation performance (e.g., accuracy, loss) over time (epochs or iterations) to diagnose overfitting or underfitting.

- When to Use:
During model training to monitor performance.
To determine if the model needs more data, regularization, or architectural changes.
- Interpretation:
Overfitting: Training performance improves, but validation performance plateaus or worsens.
Underfitting: Both training and validation performance are poor.

### Classification Metrics

#### Confusion Matrix

A table that visualizes the performance of a classification model by showing true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN).


### Accuracy