**Summary of "Neural Networks From Scratch - Lec 20 - Regression Losses - MAE, MAPE & MBE":**

1. **Regression**: Involves predicting continuous values (e.g., car prices).

2. **Loss Functions**: Quantify the difference between predicted (\( \hat{y} \)) and actual (\( y \)) values to adjust network parameters.

3. **Mean Bias Error (MBE)**:
   $$ 
   \text{MBE} = \frac{1}{n} \sum_{i=1}^{n} (\hat{y}_i - y_i) 
   $$
   - **Advantage**: Simple to compute.
   - **Disadvantage**: Can be misleading due to cancellation of positive and negative errors, potentially resulting in a zero error despite poor predictions.

4. **Mean Absolute Error (MAE)**:
   $$ 
   \text{MAE} = \frac{1}{n} \sum_{i=1}^{n} |\hat{y}_i - y_i| 
   $$
   - **Advantage**: More reliable than MBE; all errors contribute positively.
   - **Disadvantage**: Does not provide a scale-independent measure of error.

5. **L1 Loss**: Another term for MAE, emphasizing absolute error.

6. **Mean Absolute Percentage Error (MAPE)**:
   $$ 
   \text{MAPE} = \frac{1}{n} \sum_{i=1}^{n} \left| \frac{\hat{y}_i - y_i}{y_i} \right| \times 100 
   $$
   - **Advantage**: Expresses error as a percentage; robust to outliers and independent of scale.
   - **Disadvantage**: Can be undefined if any actual target values are zero.

7. **Conclusion**: Loss functions are crucial for evaluating regression models; more functions will be covered in future videos.


**Summary of "Neural Networks From Scratch - Lec 21 - Regression Losses - MSE & RMSE":**

1. **Regression Loss Functions**: Focus on Mean Squared Error (MSE) and Root Mean Squared Error (RMSE), commonly used for regression tasks.

2. **Mean Squared Error (MSE)**:
   $$
   \text{MSE} = \frac{1}{n} \sum_{i=1}^{n} (\hat{y}_i - y_i)^2
   $$
   - **Advantage**:
     - Simple to compute and implement.
     - Smooth loss surface, making it easier to optimize.
     - Penalizes larger errors more heavily due to squaring.
   - **Disadvantage**:
     - Sensitive to outliers, as they can significantly increase the MSE.

3. **Root Mean Squared Error (RMSE)**:
   $$
   \text{RMSE} = \sqrt{\text{MSE}} = \sqrt{\frac{1}{n} \sum_{i=1}^{n} (\hat{y}_i - y_i)^2}
   $$
   - **Advantage**:
     - Provides error in the same units as the original data, making interpretation easier.
     - Retains the benefits of squaring errors, penalizing large deviations.
   - **Disadvantage**:
     - Still sensitive to outliers, similar to MSE.

4. **Comparison**:
   - **MSE** is preferred for its optimization properties and ability to handle outliers during training.
   - **RMSE** is useful when interpretability is important, as it relates directly to the error scale.

5. **Conclusion**: Both loss functions have their use cases, and understanding their properties helps in choosing the right one for specific problems.
