# Regression Model Metrics: Detailed Notes

## 1. Cost Function

The **cost function** measures model performance by quantifying the difference between actual and predicted values.

- **Purpose**: Guides the model to minimize error during training.
- **Common Cost Function in Regression**:  
  - **Mean Squared Error (MSE)**
    $$
    J(\theta) = \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2
    $$
    Where \( y_i \) is the actual value, \( \hat{y}_i \) is the predicted value, and \( n \) is the number of samples.

## 2. R-squared (R²) – Coefficient of Determination

**R²** tells how much of the variation in the target variable is captured by the model’s features.

- **Range**: 0 to 1 (higher = better fit)
- **Interpretation**:  
  - 0 = Model explains none of the variance.
  - 1 = Model explains all the variance.
- **Formula**:
  $$
  R^2 = 1 - \frac{SSR}{TSS}
  $$
  Where SSR = Sum of Squared Residuals, TSS = Total Sum of Squares.

## 3. Adjusted R-squared

**Adjusted R²** corrects R-squared for the number of features used in the model, penalizing unnecessary complexity.

- **Why it matters**: Adding more features *always* increases R², even if they’re not useful. Adjusted R² will decrease if new features don’t actually improve the model.
- **Formula**:
  $$
  \overline{R}^2 = 1 - \frac{(1 - R^2) \times (n - 1)}{n - k - 1}
  $$
  Where \( n \) = number of observations, \( k \) = number of features.

## 4. Other Important Metrics

### - Mean Absolute Error (MAE)
Average of absolute differences between actual and predicted values.
$$
MAE = \frac{1}{n} \sum |y_i - \hat{y}_i|
$$

### - Mean Squared Error (MSE)
Average of squared differences.
$$
MSE = \frac{1}{n} \sum (y_i - \hat{y}_i)^2
$$

### - Root Mean Squared Error (RMSE)
Square root of MSE, giving interpretability in original units:
$$
RMSE = \sqrt{MSE}
$$

### - Residuals
The difference between observed and predicted values for each data point:
$$
Residual = y_i - \hat{y}_i
$$

## 5. Summary Table

| Metric               | Formula                                         | Usage/Interpretation                                |
|----------------------|-------------------------------------------------|-----------------------------------------------------|
| MAE                  | \( \frac{1}{n} \sum |y_i - \hat{y}_i| \)        | Average magnitude of error, less sensitive to outliers |
| MSE                  | \( \frac{1}{n} \sum (y_i - \hat{y}_i)^2 \)      | Penalizes large errors/heavy outliers                |
| RMSE                 | \( \sqrt{\frac{1}{n} \sum (y_i - \hat{y}_i)^2} \) | Typical error in original units                     |
| R²                   | \( 1 - \frac{SSR}{TSS} \)                      | Proportion of explained variance by model           |
| Adjusted R²          | See above                                      | R² corrected for model complexity                   |

> **Note:**  
> Although these metrics are often introduced alongside linear regression, they are broadly applicable for evaluating and comparing any type of regression model.

