# Evaluation Metrics for Classification and Regression

Evaluation metrics are crucial for assessing the performance of machine learning models. They help in determining how well a model is performing in terms of making predictions for both classification and regression tasks. Here's an overview of the key metrics used for each type of task:

#### Evaluation Metrics for Classification
1. Accuracy:

- The ratio of correctly predicted instances to the total instances.
- Accuracy = Number of Correct Predictions/Total Number of Predictions
2. Precision (Positive Predictive Value):

- The ratio of correctly predicted positive observations to the total predicted positives.
- Precision = TP / (TP + FP)
- Important when the cost of False Positives is high (e.g., spam detection).

3. Recall (Sensitivity, Hit Rate, or True Positive Rate):

- The ratio of correctly predicted positive observations to all observations in actual class.
Recall
=
TP
TP
+
FN
Recall= 
TP+FN
TP
​
 
Important when the cost of False Negatives is high (e.g., fraud detection).
4. F1 Score (Harmonic Mean of Precision and Recall):

The weighted average of Precision and Recall.
F1 Score
=
2
×
Precision
×
Recall
Precision
+
Recall
F1 Score=2× 
Precision+Recall
Precision×Recall
​
 
Useful when you seek a balance between Precision and Recall.
5. Confusion Matrix:

A table used to describe the performance of a classification model.
Shows TP (True Positives), TN (True Negatives), FP (False Positives), and FN (False Negatives).
6. ROC (Receiver Operating Characteristic) Curve:

- A graphical representation of the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings.
7. AUC (Area Under the ROC Curve):

- A measure of the ability of a classifier to distinguish between classes.
- Higher AUC indicates a better performing model.
#### Evaluation Metrics for Regression
1. Mean Absolute Error (MAE):

The mean of the absolute values of the individual prediction errors on over all instances in the test set.
MAE
=
1
𝑛
∑
𝑖
=
1
𝑛
∣
𝑦
𝑖
−
𝑦
^
𝑖
∣
MAE= 
n
1
​
 ∑ 
i=1
n
​
 ∣y 
i
​
 − 
y
^
​
  
i
​
 ∣
Represents average error.
Mean Squared Error (MSE):

The mean of the squared differences between the actual and the predicted values.
MSE
=
1
𝑛
∑
𝑖
=
1
𝑛
(
𝑦
𝑖
−
𝑦
^
𝑖
)
2
MSE= 
n
1
​
 ∑ 
i=1
n
​
 (y 
i
​
 − 
y
^
​
  
i
​
 ) 
2
 
More sensitive to outliers than MAE.
Root Mean Squared Error (RMSE):

The square root of the MSE.
RMSE
=
1
𝑛
∑
𝑖
=
1
𝑛
(
𝑦
𝑖
−
𝑦
^
𝑖
)
2
RMSE= 
n
1
​
 ∑ 
i=1
n
​
 (y 
i
​
 − 
y
^
​
  
i
​
 ) 
2
 
​
 
More commonly used than MSE as it's in the same units as the response variable.
R-squared (Coefficient of Determination):

The proportion of the variance in the dependent variable that is predictable from the independent variables.
𝑅
2
=
1
−
Sum of Squares of Residuals
Total Sum of Squares
R 
2
 =1− 
Total Sum of Squares
Sum of Squares of Residuals
​
 
Indicates how well the regression predictions approximate the real data points.

5. Adjusted R-squared:

- Adjusted R-squared considers the number of predictors in the model and adjusts the R-squared accordingly.
- More useful than R-squared when comparing models with different numbers of predictors.

Choosing the right evaluation metric is crucial as it influences how the performance of a machine learning model is measured and compared. These metrics provide insights into areas that require improvement and help in selecting the model that best fits the business needs.