# How to train the Baseline Models for the SENSORIUM+ track

### This notebook will show how to
- instantiate dataloader for the Sensorium+ track
- instantiate pytorch model
- instantiate a trainer function
- train two baselines for this competition track
- save the model weights (the model weights can already be found in './model_checkpoints/pretrained/')

### Imports

In [1]:
import torch
import numpy as np
import pandas as pd

import matplotlib.pyplot as plt
import seaborn as sns

import warnings
warnings.filterwarnings('ignore')

from nnfabrik.builder import get_data, get_model, get_trainer

### Instantiate DataLoader for Sensorium+

The only difference to the Sensorium track is that here, we include the behavioral variables and the eye position,
by setting include_behavior=True, and include_eye_position=True.
this will append the behavioral variables to the input images, and the eye position will be passed to
the shifter network of the model.


In [2]:
# loading the SENSORIUM+ dataset
filenames = ['../data/static27204-5-13-GrayImageNet-94c6ff995dac583098847cfecd43e7b6.zip', ]


dataset_fn = 'sensorium.datasets.static_loaders'
dataset_config = {'paths': filenames,
                 'normalize': True,
                 'include_behavior': True,
                 'include_eye_position': True,
                 'batch_size': 128,
                 'scale':.25,
                 }

dataloaders = get_data(dataset_fn, dataset_config)

# Instantiate State of the Art Model (SOTA)

Because the behavioral variables are available, we instantiate the Shifter network
by setting Shifter=True in the model configuration.

In [3]:
model_fn = 'sensorium.models.stacked_core_full_gauss_readout'
model_config = {'pad_input': False,
  'stack': -1,
  'layers': 4,
  'input_kern': 9,
  'gamma_input': 6.3831,
  'gamma_readout': 0.0076,
  'hidden_kern': 7,
  'hidden_channels': 64,
  'depth_separable': True,
  'grid_mean_predictor': {'type': 'cortex',
   'input_dimensions': 2,
   'hidden_layers': 1,
   'hidden_features': 30,
   'final_tanh': True},
  'init_sigma': 0.1,
  'init_mu_range': 0.3,
  'gauss_type': 'full',
  'shifter': True,
}

model = get_model(model_fn=model_fn,
                  model_config=model_config,
                  dataloaders=dataloaders,
                  seed=42,)

Because the behavioral variables are available, we instantiate the Shifter network
by setting Shifter=True in the model configuration.

In [4]:
model_fn = 'sensorium.models.stacked_core_full_gauss_readout'
model_config = {'pad_input': False,
  'stack': -1,
  'layers': 4,
  'input_kern': 9,
  'gamma_input': 6.3831,
  'gamma_readout': 0.0076,
  'hidden_kern': 7,
  'hidden_channels': 64,
  'depth_separable': True,
  'grid_mean_predictor': {'type': 'cortex',
   'input_dimensions': 2,
   'hidden_layers': 1,
   'hidden_features': 30,
   'final_tanh': True},
  'init_sigma': 0.1,
  'init_mu_range': 0.3,
  'gauss_type': 'full',
  'shifter': True,
}

model = get_model(model_fn=model_fn,
                  model_config=model_config,
                  dataloaders=dataloaders,
                  seed=42,)

## Configure Trainer

In [5]:
trainer_fn = "sensorium.training.standard_trainer"

trainer_config = {'max_iter': 200,
                 'verbose': False,
                 'lr_decay_steps': 4,
                 'avg_loss': False,
                 'lr_init': 0.009,
                 }

trainer = get_trainer(trainer_fn=trainer_fn, 
                     trainer_config=trainer_config)

# Run model training

In [6]:
validation_score, trainer_output, state_dict = trainer(model, dataloaders, seed=42)

Epoch 1: 100%|██████████| 35/35 [00:33<00:00,  1.04it/s]
Epoch 2: 100%|██████████| 35/35 [00:05<00:00,  6.81it/s]
Epoch 3: 100%|██████████| 35/35 [00:05<00:00,  6.99it/s]
Epoch 4: 100%|██████████| 35/35 [00:04<00:00,  7.11it/s]
Epoch 5: 100%|██████████| 35/35 [00:05<00:00,  6.98it/s]
Epoch 6: 100%|██████████| 35/35 [00:04<00:00,  7.06it/s]
Epoch 7: 100%|██████████| 35/35 [00:04<00:00,  7.11it/s]
Epoch 8: 100%|██████████| 35/35 [00:04<00:00,  7.05it/s]
Epoch 9: 100%|██████████| 35/35 [00:05<00:00,  6.94it/s]
Epoch 10: 100%|██████████| 35/35 [00:05<00:00,  6.98it/s]
Epoch 11: 100%|██████████| 35/35 [00:04<00:00,  7.06it/s]
Epoch 12: 100%|██████████| 35/35 [00:04<00:00,  7.11it/s]
Epoch 13: 100%|██████████| 35/35 [00:04<00:00,  7.03it/s]
Epoch 14: 100%|██████████| 35/35 [00:04<00:00,  7.12it/s]
Epoch 15: 100%|██████████| 35/35 [00:04<00:00,  7.10it/s]
Epoch 16: 100%|██████████| 35/35 [00:04<00:00,  7.13it/s]
Epoch 17: 100%|██████████| 35/35 [00:04<00:00,  7.13it/s]
Epoch 18: 100%|████████

## Save model checkpoints

In [7]:
torch.save(model.state_dict(), './model_checkpoints/sensorium_p_sota_model.pth')

In [8]:
validation_score

0.39031154

## Load Model Checkpoints

In [9]:
# model.load_state_dict(torch.load("./model_checkpoints/pretrained/sensorium_p_sota_model.pth"));

---

# Train a simple LN model

In [10]:
# this will remove all nonlinearities from the CNN, and creates essentially a ln model: linear core + readout, with a subsequent non-linearity

model_fn = 'sensorium.models.stacked_core_full_gauss_readout'
model_config = {'pad_input': False,
              'stack': -1,
              'layers': 3,
              'input_kern': 9,
              'gamma_input': 6.3831,
              'gamma_readout': 0.0076,
              'hidden_kern': 7,
              'hidden_channels': 64,
              'grid_mean_predictor': {'type': 'cortex',
              'input_dimensions': 2,
              'hidden_layers': 1,
              'hidden_features': 30,
              'final_tanh': True},
              'depth_separable': True,
              'init_sigma': 0.1,
              'init_mu_range': 0.3,
              'gauss_type': 'full',
              'linear': True,
              'shifter': True,
               }
model = get_model(model_fn=model_fn,
                  model_config=model_config,
                  dataloaders=dataloaders,
                  seed=42,)

In [11]:
validation_score, trainer_output, state_dict = trainer(model, dataloaders, seed=42)

Epoch 1: 100%|██████████| 35/35 [00:04<00:00,  7.62it/s]
Epoch 2: 100%|██████████| 35/35 [00:04<00:00,  7.65it/s]
Epoch 3: 100%|██████████| 35/35 [00:04<00:00,  7.68it/s]
Epoch 4: 100%|██████████| 35/35 [00:04<00:00,  7.65it/s]
Epoch 5: 100%|██████████| 35/35 [00:04<00:00,  7.65it/s]
Epoch 6: 100%|██████████| 35/35 [00:04<00:00,  7.63it/s]
Epoch 7: 100%|██████████| 35/35 [00:04<00:00,  7.62it/s]
Epoch 8: 100%|██████████| 35/35 [00:04<00:00,  7.55it/s]
Epoch 9: 100%|██████████| 35/35 [00:04<00:00,  7.62it/s]
Epoch 10: 100%|██████████| 35/35 [00:04<00:00,  7.52it/s]
Epoch 11: 100%|██████████| 35/35 [00:04<00:00,  7.47it/s]
Epoch 12: 100%|██████████| 35/35 [00:04<00:00,  7.51it/s]
Epoch 13: 100%|██████████| 35/35 [00:04<00:00,  7.57it/s]
Epoch 14: 100%|██████████| 35/35 [00:04<00:00,  7.60it/s]
Epoch 15: 100%|██████████| 35/35 [00:04<00:00,  7.47it/s]
Epoch 16: 100%|██████████| 35/35 [00:04<00:00,  7.59it/s]
Epoch 17: 100%|██████████| 35/35 [00:04<00:00,  7.58it/s]
Epoch 18: 100%|████████

In [12]:
torch.save(model.state_dict(), './model_checkpoints/sensorium_p_ln_model.pth')

In [13]:
validation_score

0.26657087

In [14]:
# model.load_state_dict(torch.load("./model_checkpoints/pretrained/sensorium_p_ln_model.pth"));

---