## Regression Model Evaluation Metrics
- For evaluation of regression model, following metrics are used
  - MAE
  - MSE
  - RMSE
  - R2
  - Adjusted R2


#### Mean Absolute Error (MAE)
- The MAE measures the average magnitude of the errors in a set of forecasts, without considering 
their direction
- It measures accuracy for continuous variables
- The MAE is the average over the verification sample of the absolute values of the differences 
between forecast and the corresponding observation
- The MAE is a linear score which means that all the individual differences are weighted equally in the 
average

#### Mean Squared Error (MSE)
- In statistics, the mean squared error (MSE) or mean squared deviation (MSD) of an estimator (of a 
procedure for estimating an unobserved quantity) measures the average of the squares of the error
- That is, the average squared difference between the estimated values and the actual value
- MSE is a risk function, corresponding to the expected value of the squared error loss
- The fact that MSE is almost always strictly positive (and not zero) is because of randomness or 
because the estimator does not account for information that could produce a more accurate estimate
- The MSE is a measure of the quality of an estimator
- As it is derived from the square of Euclidean distance, it is always a positive value with the error 
decreasing as the error approaches zero

### Root Mean Squared Error (RMSE)
- RMSE is the most popular evaluation metric used in regression problems
- It follows an assumption that error are unbiased and follow a normal distribution
- Here are the key points to consider on RMSE:
  - The power of ‘square root’ empowers this metric to show large number deviations
  - The ‘squared’ nature of this metric helps to deliver more robust results which prevents cancelling the positive 
and negative error values
- It avoids the use of absolute error values which is highly undesirable in mathematical calculations
- When we have more samples, reconstructing the error distribution using RMSE is considered to be 
more reliable
- RMSE is highly affected by outlier values. Hence, make sure you’ve removed outliers from your data 
set prior to using this metric.
- As compared to mean absolute error, RMSE gives higher weightage and punishes large errors



#### R-Squared (R2)
- We learned that when the RMSE decreases, the model’s performance will improve
- But these values alone are not intuitive
- When we talk about the RMSE metrics, we do not have a benchmark to compare
- This is where we can use R-Squared metric
- In other words how good our regression model as compared to a very simple model that just predicts 
the mean value of target from the train set as predictions

#### Adjusted R-Squared
- A model performing equal to baseline would give R-Squared as 0
- Better the model, higher the r2 value
- The best model with all correct predictions would give R-Squared as 1
- However, on adding new features to the model, the R-Squared value either increases or remains the 
same
- R-Squared does not penalize for adding features that add no value to the model
- So an improved version over the R-Squared is the adjusted R-Squared

- k: number of features ![image.png](attachment:faf1ed97-ce8d-4fc0-93cc-45c4e36968f3.png)
- n: number of samples

## Model Evaluation Classification


- For evaluation of classification model, following metrics are used
  - Confusion Matrix
  - F1 Score
  - Auc-Roc


### Sensitivity
- Sensitivity tells us what proportion of the positive class got correctly classified
- A simple example would be to determine what proportion of the actual sick people were correctly 
detected by the model
- Also known as
  - True Positive Rate (TPR)
  - Recall

### False Negative Rate
  - False Negative Rate (FNR) tells us what proportion of the positive class got incorrectly classified by 
the classifier
  - A higher TPR and a lower FNR is desirable since we want to correctly classify the positive class


### Specificity
- Specificity tells us what proportion of the negative class got correctly classified
- Taking the same example as in Sensitivity, Specificity would mean determining the proportion of 
healthy people who were correctly identified by the model
- Also known as
   - True Negative Rate


### False Positive Rate
- FPR tells us what proportion of the negative class got incorrectly classified by the classifier
- A higher TNR and a lower FPR is desirable since we want to correctly classify the negative class


### AUC-ROC curve
#### 1)
- The Receiver Operator Characteristic (ROC) curve is an evaluation metric for classification problems
- It is a probability curve that plots the TPR against FPR at various threshold values 
- It separates the ‘signal’ from the ‘noise’
- The Area Under the Curve (AUC) is the measure of the ability of a classifier to distinguish between 
classes and is used as a summary of the ROC curve
- The higher the AUC, the better the performance of the model at 
distinguishing between the positive and negative classes

![image.png](attachment:7e88a772-efbe-4b1b-bec2-ac743d7e3566.png)


#### 2)
-  This is an ideal situation
- When two curves don’t overlap at all means 
model has an ideal measure of separability
- It is perfectly able to distinguish between positive 
class and negative class

![image.png](attachment:b4271265-f919-4a22-ac6b-34e4ce32cb36.png)


#### 3)
- When two distributions overlap, we introduce 
type 1 and type 2 errors
- Depending upon the threshold, we can minimize 
or maximize them
- When AUC is 0.7, it means there is a 70% 
chance that the model will be able to distinguish 
between positive class and negative class

![image.png](attachment:fa0db7a7-31aa-4fa6-9611-8e615dcf34ab.png)

#### 4)
- This is the worst situation
- When AUC is approximately 0.5, the model has 
no discrimination capacity to distinguish between 
positive class and negative class

![image.png](attachment:92627a64-fc8d-4073-9ff3-2bd7530260c3.png)

#### 5)
- When AUC is approximately 0, the model is 
actually reciprocating the classes
- It means the model is predicting a negative class 
as a positive class and vice versa

![image.png](attachment:7cb76a01-037e-4567-ab3b-ad758433adbe.png)