## 3. What Comes Next: The Role of Loss Functions

Now that we've seen how neural networks use layers and weights to make predictions, the next question is: **how does the network know if its predictions are correct?** This is where the concept of a **loss function** comes in.

--- 
### What is a Loss Function?

A loss function is a way to measure how well the neural network's predictions match the actual target values. It provides a numerical value that indicates how much error there is in the predictions. The smaller the loss, the closer the network's predictions are to the actual targets.

![loss_understand.png](attachment:loss_understand.png)

There are many loss functions used for different cases, but today we will focus on the Mean Squared Error (MSE) loss, which is one of the simplest loss functions to understand. Here’s how it looks mathematically:

![loss_graph.png](attachment:loss_graph.png)

### Mathematical Representation of MSE

The Mean Squared Error (MSE) measures the average squared difference between the predicted values and the actual target values. It is calculated using the following formula:

$$
\text{MSE} = \frac{1}{n} \sum_{i=1}^{n} (y_i - \hat{y}_i)^2
$$


Where:

- $n$ is the number of data points.
- $y_i$ is the actual target value for the $i-th$ data point.
- $ŷ_i$ is the predicted value for the $i-th$ data point.

*it represent the the average squared difference between the predicted values and the actual target values*.
***Here is a [video](https://www.youtube.com/watch?v=VaOlkbKQFcY) to understand it better***



---
#### Let's Try to implement the MSE function !

In [None]:
import numpy as np

target_output = np.array([1.0, 0.0, 1.0, 0.0])

predicted_output = np.array([0.9, 0.1, 0.8, 0.2])

# Todo : Calculate the Mean Squared Error
mse_loss = ...

print("Mean Squared Error", mse_loss)

Well done! Now you understand how a loss function works! PyTorch implements many different loss functions, such as Mean Squared Error (MSE) and Cross-Entropy Loss, which is one of the most commonly used loss functions for classification tasks.

As you can see below, here’s how you can use these loss functions in PyTorch **try to implement it !**

In [None]:
import torch
import torch.nn as nn

# Example using Mean Squared Error (MSE) Loss
mse_loss = ...

# Example using Cross-Entropy Loss
cross_entropy_loss = ...

predicted_output = torch.tensor([0.9, 0.1, 0.8, 0.2])
target_output = torch.tensor([0.9, 0.2, 0.79, 0.3])

result_mse = mse_loss(predicted_output, target_output)
print("MSE Loss", result_mse)

result_cross_entropy = cross_entropy_loss(predicted_output, target_output)
print("Cross-Entropy Loss", result_cross_entropy)

___
# ***Bonus***
### Understanding Cross-Entropy Loss

**Cross-Entropy Loss** is a crucial loss function used primarily for classification tasks. It measures the difference between the actual distribution of classes and the predicted distribution, making it ideal for tasks where the goal is to predict which category or class an input belongs to.

### How Cross-Entropy Loss Works

- **Target Representation**: For classification, the target output is often represented as a one-hot encoded vector. For example, if there are three classes and the correct class is the second one, the target vector would look like \([0, 1, 0]\).

- **Predicted Probabilities**: The model generates a vector of predicted probabilities for each class, which sums to 1. For example, the prediction might be \([0.2, 0.7, 0.1]\).

- **Calculation**: Cross-Entropy Loss measures the difference between the true class (the one-hot vector) and the predicted probabilities. The formula is:

  $$
  L = -\sum_{i=1}^{n} y_i \cdot \log(\hat{y}_i)
  $$

  Where:
  - $y_i$ is the actual target value (1 for the correct class, 0 otherwise).
  - $ŷ_i$ is the predicted probability for each class.

### Why Use Cross-Entropy Loss?

- **Better for Classification**: Unlike Mean Squared Error, which measures squared differences, Cross-Entropy Loss directly compares the probability distributions, making it more suitable for classification problems. It heavily penalizes incorrect predictions, encouraging the model to produce confident and accurate predictions.

### Summary

Cross-Entropy Loss is a powerful tool for training classification models because it directly measures how well the predicted probabilities match the actual classes. By minimizing this loss, a model learns to make more accurate predictions.
