In [1]:
# Import necessary modules
import torch
import torch.nn.functional as F

# Define the model
class SimpleNet(torch.nn.Module):
    def __init__(self):
        super(SimpleNet, self).__init__()
        self.fc1 = torch.nn.Linear(5, 3)  # first linear transformation
        self.fc2 = torch.nn.Linear(3, 1)  # second linear transformation

    def forward(self, x):
        x = self.fc1(x)
        x = F.relu(x)  # first activation layer
        x = self.fc2(x)
        output = torch.sigmoid(x)  # second activation layer (sigmoid to output probabilities)
        return output


As I progress through Part 2 of the fastai course, I've discovered the value of occasionally retracing my steps to assess my grasp on the fundamental concepts. In my journey through Part 1, I recall Chapter 4 of the book being particularly daunting, as it covered a broad range of essential topics early on. However, this comprehensive approach was necessary to establish a strong foundation for everything I would subsequently learn.

Whenever I revisit those earlier chapters and analyze the code line by line, I'm often pleasantly surprised by the level of intuition I've developed in certain areas. Simultaneously, I'm also confronted with aspects that still elude my immediate understanding. Engaging in quick drills to reinforce these concepts has proven to be an effective method for staying sharp and achieving complete comprehension, even if it may feel somewhat repetitive. It's like eating vegetables as a child – you may not want to, but it's ultimately beneficial and necessary for growth. I believe that through a similar positive feedback loop, I will acquire a taste for this process and derive great rewards from it.

So, grab a fork and let's dig in

## Predicting Bank Loan Default from Synthetic Data

For this exercise I'm going to create synthetic data to predict whether a bank loan will default. I've chosen the following to create as features:

- Loan Amount 
- Term 
- Interest Rate 
- Borrower's Income 
- Borrower's Credit Score

## Generate Synthetic Data

In [2]:
torch.set_printoptions(precision=4, sci_mode=False)

We're first going to create the data for 1000 samples randomly with `torch.normal`, to which we'll pass arguments for a *mean*, *standard deviation*, and *size*.

The size could be either a tuple or a list, but for this exercise we'll just use a tuple with a single value.

Also worth noting that `term` will be using `randint` rather than `normal`, because loan terms are generally in whole months.

In [7]:
n_samples = 1000
loan_amount = torch.normal(5000., 1500, size=(n_samples,))  # average loan amount is $5000
term = torch.randint(12, 60, size=(n_samples,))  # loan term varies between 1 and 5 years
interest_rate = torch.normal(0.05, 0.01, size=(n_samples,))  # average interest rate is 5%
income = torch.normal(50000, 10000, size=(n_samples,))  # average income is $50,000
credit_score = torch.normal(600, 50, size=(n_samples,))  # average credit score is 600

Next we'll stack the tensors and create a target variable where 10% of loans default, and convert them to *float32* data types in the process

In [22]:
X = torch.stack([loan_amount, term, interest_rate, income, credit_score]).float()

In [24]:
y = torch.distributions.categorical.Categorical(torch.tensor([0.9, 0.1])).sample((n_samples,)).float()