# Loss Function

Loss functions, also known as cost functions or objective functions, play a crucial role in training neural networks. They quantify the error or discrepancy between the predicted outputs of a neural network and the actual target values. In this detailed explanation, We'll cover various loss functions commonly used in neural networks, along with code examples in PyTorch.


### Loss Function

Loss functions are critical in the training of neural networks. They serve as the guidance for the optimization process, helping the model to learn and adjust its parameters. The choice of loss function depends on the type of task and the nature of the data. Here are some common loss functions:

### 1. Mean Squared Error (MSE) Loss:

* Suitable for regression problems.
* Measures the average squared difference between predicted values and target values.
* Encourages the model to produce predictions close to the targets.

In [6]:
import torch
import torch.nn as nn

criterion = nn.MSELoss()
predicted = torch.tensor([1.0, 2.0, 3.0], requires_grad=True)
target = torch.tensor([0.8, 1.9, 3.2])

loss = criterion(predicted, target)

### 2. Cross-Entropy Loss (Binary Cross-Entropy and Categorical Cross-Entropy):

* Suitable for classification problems.
* Measures the dissimilarity between predicted class probabilities and true class labels.
* Binary Cross-Entropy is used for binary classification, while Categorical Cross-Entropy is used for multi-class classification.

In [9]:
# Binary Cross-Entropy Loss
criterion_binary = nn.BCELoss()
predicted = torch.tensor([0.8, 0.2], requires_grad=True)
target = torch.tensor([1.0, 0.0])

loss_binary = criterion_binary(predicted, target)

In [10]:
# Categorical Cross-Entropy Loss
criterion_categorical = nn.CrossEntropyLoss()
predicted = torch.tensor([[0.2, 0.8],[0.9, 0.1],[0.4, 0.6]], requires_grad=True)
target = torch.tensor([1, 0, 1], dtype=torch.long)

loss_categorical = criterion_categorical(predicted, target)

### 3. Hinge Loss:

* Suitable for support vector machines (SVMs) and binary classification.
* Encourages correct classification by maximizing the margin between classes.

In [12]:
criterion = nn.HingeEmbeddingLoss()
predicted = torch.tensor([0.9, -0.5], requires_grad=True)
target = torch.tensor([1.0, -1.0])

loss = criterion(predicted, target)

### 4. Huber Loss:

* A combination of L1 and L2 losses.
* Less sensitive to outliers compared to MSE.
* Suitable for regression tasks.

In [13]:
criterion = nn.HuberLoss()
predicted = torch.tensor([2.0, 6.0], requires_grad=True)
target = torch.tensor([1.0, 5.0])

loss = criterion(predicted, target)

### 5. Kullback-Leibler Divergence (KL Divergence):
* Used in probabilistic models and variational autoencoders.
* Measures the difference between two probability distributions.

In [16]:
import torch.nn.functional as F

p = torch.tensor([0.25, 0.75])
q = torch.tensor([0.3, 0.7])

loss = F.kl_div(torch.log(q), p)

### 6. Custom Loss Function:

* You can define custom loss functions tailored to specific tasks.
* This is often necessary for tasks with unique requirements.

In [17]:
def custom_loss(predicted, target):
    error  = torch.abs(predicted - target)
    return torch.mean(error)

predicted = torch.tensor([2.0, 6.0], requires_grad=True)
target = torch.tensor([1.5, 5.0])

loss = custom_loss(predicted, target)