# Regression Metrics Example with Ice Cream Sales Data

## Example Data

| Temperature (°C) | Actual Ice Cream Sales | Predicted Ice Cream Sales |
|------------------|------------------------|---------------------------|
| 20               | 100                    | 110                       |
| 22               | 150                    | 140                       |
| 25               | 200                    | 210                       |
| 30               | 300                    | 290                       |
| 35               | 400                    | 410                       |

### Mean Absolute Error (MAE)
MAE is the average of the absolute differences between actual and predicted values. It measures the average magnitude of the errors in a set of predictions, without considering their direction.

**Formula:**
$$ \text{MAE} = \frac{1}{n} \sum_{i=1}^n |y_i - \hat{y}_i| $$

**Calculation:**

$$
\text{MAE} = \frac{|100 - 110| + |150 - 140| + |200 - 210| + |300 - 290| + |400 - 410|}{5} = \frac{10 + 10 + 10 + 10 + 10}{5} = 10
$$

### Mean Squared Error (MSE)
MSE is the average of the squares of the errors. It measures the average squared difference between the actual and predicted values. Squaring the errors gives more weight to larger errors.

**Formula:**
$$ \text{MSE} = \frac{1}{n} \sum_{i=1}^n (y_i - \hat{y}_i)^2 $$

**Calculation:**

$$
\text{MSE} = \frac{(100 - 110)^2 + (150 - 140)^2 + (200 - 210)^2 + (300 - 290)^2 + (400 - 410)^2}{5} = \frac{100 + 100 + 100 + 100 + 100}{5} = 100
$$

### Root Mean Squared Error (RMSE)
RMSE is the square root of the average of the squared errors. It measures the square root of the average squared differences between actual and predicted values. RMSE is interpretable in the same units as the original data, making it more intuitive.

**Formula:**
$$ \text{RMSE} = \sqrt{\text{MSE}} $$

**Calculation:**

$$
\text{RMSE} = \sqrt{100} = 10
$$

### R-squared (Coefficient of Determination)
R-squared is a statistical measure that represents the proportion of the variance in the dependent variable that is predictable from the independent variables.

**Formula:**
$$
R^2 = 1 - \frac{\sum_{i=1}^n (y_i - \hat{y}_i)^2}{\sum_{i=1}^n (y_i - \bar{y})^2}
$$

**Calculation:**

1. Calculate the mean of the actual sales:
$$
\bar{y} = \frac{100 + 150 + 200 + 300 + 400}{5} = 230
$$

2. Calculate the total sum of squares (TSS):
$$
\sum_{i=1}^n (y_i - \bar{y})^2 = (100 - 230)^2 + (150 - 230)^2 + (200 - 230)^2 + (300 - 230)^2 + (400 - 230)^2 = 16900 + 6400 + 900 + 4900 + 28900 = 58000
$$

3. Calculate the residual sum of squares (RSS):
$$
\sum_{i=1}^n (y_i - \hat{y}_i)^2 = (100 - 110)^2 + (150 - 140)^2 + (200 - 210)^2 + (300 - 290)^2 + (400 - 410)^2 = 100 + 100 + 100 + 100 + 100 = 500
$$

4. Calculate R-squared:
$$
R^2 = 1 - \frac{500}{58000} \approx 0.9914
$$

### Interpretation
- **MAE**: The average absolute error is 10 ice creams. This means, on average, our predictions are off by 10 ice creams.
- **MSE**: The average of the squared errors is 100. Since the errors are squared, this value can be harder to interpret directly compared to MAE and RMSE.
- **RMSE**: The average error, considering the original scale of the data, is 10 ice creams. This provides a more intuitive measure of prediction error, similar to MAE but with more weight given to larger errors.
- **R-squared**: An R-squared value of approximately 0.9914 means that about 99.14% of the variability in ice cream sales can be explained by the model. The remaining 0.86% of the variability is due to other factors not included in the model.
