<h1>Building and Training a Simple Neural Network in PyTorch</h1>

Now that we have a grasp on tensors and autograd, let's build a simple neural network from scratch using PyTorch.<br>
We'll cover defining the <b>network architecture</b>,  <b>preparing the dataset</b>, <b>defining the loss function</b> and <b>optimizer</b>, and <b>training the model</b>.

Step-by-Step Guide to Building a Neural Network
<ol>
<li><b>Define the Neural Network Architecture:</b> Create a class inheriting from nn.Module.</li>
<li><b>Prepare the Dataset:</b> Use PyTorch's Dataset and DataLoader classes.</li>
<li><b>Define the Loss Function and Optimizer:</b> Use built-in loss functions and optimizers.</li>
<li><b>Train the Model:</b> Implement the training loop.</li>
</ol>

<h4>1. Define the Neural Network Architecture</h4>
We'll define a simple feedforward neural network with one hidden layer.

In [14]:
import torch
import torch.nn as nn
import torch.optim as optim

# Define the neural network
class SimpleNet(nn.Module):
    def __init__(self):
        super(SimpleNet, self).__init__()
        self.hidden = nn.Linear(2, 5)  # Hidden layer with 5 neurons | 2 input features and 5 output features
        self.output = nn.Linear(5, 1)  # Output layer | 5 input features and 1 output feature.

    def forward(self, x):
        x = torch.relu(self.hidden(x))  # Apply ReLU activation
        x = self.output(x)
        return x

# Instantiate the network
net = SimpleNet()
print(net)

SimpleNet(
  (hidden): Linear(in_features=2, out_features=5, bias=True)
  (output): Linear(in_features=5, out_features=1, bias=True)
)


<h4>2. Prepare the Dataset </h4>
We'll create a simple synthetic dataset for training.

In [20]:
from torch.utils.data import Dataset, DataLoader

# Create a synthetic dataset
class SimpleDataset(Dataset):
    def __init__(self):
        self.data = torch.tensor([[1.0, 2.0], [2.0, 3.0], [3.0, 4.0], [4.0, 5.0]]) # (4,2)
        self.targets = torch.tensor([[1.0], [2.0], [3.0], [4.0]]) # (4,1)

    def __len__(self):
        return len(self.data)

    def __getitem__(self, idx):
        return self.data[idx], self.targets[idx]

# Instantiate the dataset and dataloader
dataset = SimpleDataset()
dataloader = DataLoader(dataset, batch_size=2, shuffle=True)  #  Loads the dataset with specified batch size and shuffling.

for data, target in dataloader:
    print("Data:", data)
    print("Target:", target)


Data: tensor([[2., 3.],
        [1., 2.]])
Target: tensor([[2.],
        [1.]])
Data: tensor([[4., 5.],
        [3., 4.]])
Target: tensor([[4.],
        [3.]])


<h4> 3. Define the Loss Function and Optimizer </h4>
We'll use Mean Squared Error (MSE) as the loss function and Stochastic Gradient Descent (SGD) as the optimizer

In [25]:
# Define the loss function and optimizer
criterion = nn.MSELoss()                            # Mean Squared Error (MSE)
optimizer = optim.SGD(net.parameters(), lr=0.01)    # Stochastic Gradient Descent (SGD) 

<h4> 4. Train the Model </h4>
Implement the training loop to train the model over multiple epochs.

In [30]:
# Training loop
num_epochs = 1000

for epoch in range(num_epochs):
    for data, target in dataloader:
        # Zero gradients
        optimizer.zero_grad()     # Clears gradients to avoid accumulation.
        
        # Forward pass
        output = net(data)
        
        # Compute loss
        loss = criterion(output, target)
        
        # Backward pass (compute gradients)
        loss.backward()
        
        # Update weights
        optimizer.step()
    
    if (epoch + 1) % 100 == 0:           # Print the loss at every 100th Epoch
        print(f'Epoch [{epoch + 1}/{num_epochs}], Loss: {loss.item():.4f}')


Epoch [100/1000], Loss: 0.0169
Epoch [200/1000], Loss: 0.0013
Epoch [300/1000], Loss: 0.0002
Epoch [400/1000], Loss: 0.0002
Epoch [500/1000], Loss: 0.0000
Epoch [600/1000], Loss: 0.0000
Epoch [700/1000], Loss: 0.0000
Epoch [800/1000], Loss: 0.0000
Epoch [900/1000], Loss: 0.0000
Epoch [1000/1000], Loss: 0.0000
