# How to train the Baseline Models for the SENSORIUM track

### This notebook will show how to
- instantiate dataloader for the Sensorium track
- instantiate pytorch model
- instantiate a trainer function
- train two baselines for this competition track
- save the model weights (the model weights can already be found in './model_checkpoints/pretrained/')

### Imports

In [1]:
import torch
import numpy as np
import pandas as pd

import matplotlib.pyplot as plt
import seaborn as sns

import warnings
warnings.filterwarnings('ignore')

from nnfabrik.builder import get_data, get_model, get_trainer

### Instantiate DataLoader

In [2]:
# loading the SENSORIUM dataset
filenames = ['../data/static26872-17-20-GrayImageNet-94c6ff995dac583098847cfecd43e7b6.zip', ]

dataset_fn = 'sensorium.datasets.static_loaders'
dataset_config = {'paths': filenames,
                 'normalize': True,
                 'include_behavior': False,
                 'include_eye_position': False,
                 'batch_size': 128,
                 'scale':0.25,
                 }

dataloaders = get_data(dataset_fn, dataset_config)

# Instantiate State of the Art Model (SOTA)

In [3]:
model_fn = 'sensorium.models.stacked_core_full_gauss_readout'
model_config = {'pad_input': False,
  'stack': -1,
  'layers': 4,
  'input_kern': 9,
  'gamma_input': 6.3831,
  'gamma_readout': 0.0076,
  'hidden_kern': 7,
  'hidden_channels': 64,
  'depth_separable': True,
  'grid_mean_predictor': {'type': 'cortex',
   'input_dimensions': 2,
   'hidden_layers': 1,
   'hidden_features': 30,
   'final_tanh': True},
  'init_sigma': 0.1,
  'init_mu_range': 0.3,
  'gauss_type': 'full',
  'shifter': False,
}

model = get_model(model_fn=model_fn,
                  model_config=model_config,
                  dataloaders=dataloaders,
                  seed=42,)

## Configure Trainer

In [4]:
trainer_fn = "sensorium.training.standard_trainer"

trainer_config = {'max_iter': 200,
                 'verbose': False,
                 'lr_decay_steps': 4,
                 'avg_loss': False,
                 'lr_init': 0.009,
                 }

trainer = get_trainer(trainer_fn=trainer_fn, 
                     trainer_config=trainer_config)

# Run model training

In [5]:
validation_score, trainer_output, state_dict = trainer(model, dataloaders, seed=42)

Epoch 1: 100%|██████████| 35/35 [00:13<00:00,  2.59it/s]
Epoch 2: 100%|██████████| 35/35 [00:09<00:00,  3.67it/s]
Epoch 3: 100%|██████████| 35/35 [00:09<00:00,  3.67it/s]
Epoch 4: 100%|██████████| 35/35 [00:09<00:00,  3.65it/s]
Epoch 5: 100%|██████████| 35/35 [00:09<00:00,  3.59it/s]
Epoch 6: 100%|██████████| 35/35 [00:11<00:00,  3.15it/s]
Epoch 7: 100%|██████████| 35/35 [00:12<00:00,  2.80it/s]
Epoch 8: 100%|██████████| 35/35 [00:09<00:00,  3.84it/s]
Epoch 9: 100%|██████████| 35/35 [00:10<00:00,  3.49it/s]
Epoch 10: 100%|██████████| 35/35 [00:10<00:00,  3.35it/s]
Epoch 11: 100%|██████████| 35/35 [00:14<00:00,  2.48it/s]
Epoch 12: 100%|██████████| 35/35 [00:09<00:00,  3.54it/s]
Epoch 13: 100%|██████████| 35/35 [00:10<00:00,  3.24it/s]
Epoch 14: 100%|██████████| 35/35 [00:13<00:00,  2.59it/s]
Epoch 15: 100%|██████████| 35/35 [00:13<00:00,  2.62it/s]
Epoch 16: 100%|██████████| 35/35 [00:13<00:00,  2.56it/s]
Epoch 17: 100%|██████████| 35/35 [00:19<00:00,  1.83it/s]
Epoch 18: 100%|████████

### Save model checkpoints after training is complete

In [6]:
torch.save(model.state_dict(), './model_checkpoints/sensorium_sota_model.pth')

## Load Model Checkpoints

In [7]:
model.load_state_dict(torch.load("./model_checkpoints/pretrained/sensorium_sota_model.pth"));

---

# Train a simple LN model

Our LN model has the same architecture as our CNN model (a convolutional core followed by a gaussian readout)
but with all non-linearities removed except the final ELU+1 nonlinearity.
Thus turning the CNN model effectively into a fully linear model followed by a single output non-linearity.


In [8]:
model_fn = 'sensorium.models.stacked_core_full_gauss_readout'
model_config = {'pad_input': False,
              'stack': -1,
              'layers': 3,
              'input_kern': 9,
              'gamma_input': 6.3831,
              'gamma_readout': 0.0076,
              'hidden_kern': 7,
              'hidden_channels': 64,
              'grid_mean_predictor': {'type': 'cortex',
              'input_dimensions': 2,
              'hidden_layers': 1,
              'hidden_features': 30,
              'final_tanh': True},
              'depth_separable': True,
              'init_sigma': 0.1,
              'init_mu_range': 0.3,
              'gauss_type': 'full',
              'linear': True
               }
model = get_model(model_fn=model_fn,
                  model_config=model_config,
                  dataloaders=dataloaders,
                  seed=42,)

In [9]:
validation_score, trainer_output, state_dict = trainer(model, dataloaders, seed=42)

Epoch 1: 100%|██████████| 35/35 [00:18<00:00,  1.88it/s]
Epoch 2: 100%|██████████| 35/35 [00:18<00:00,  1.85it/s]
Epoch 3: 100%|██████████| 35/35 [00:18<00:00,  1.86it/s]
Epoch 4: 100%|██████████| 35/35 [00:18<00:00,  1.85it/s]
Epoch 5: 100%|██████████| 35/35 [00:19<00:00,  1.83it/s]
Epoch 6: 100%|██████████| 35/35 [00:19<00:00,  1.79it/s]
Epoch 7: 100%|██████████| 35/35 [00:18<00:00,  1.85it/s]
Epoch 8: 100%|██████████| 35/35 [00:19<00:00,  1.81it/s]
Epoch 9: 100%|██████████| 35/35 [00:19<00:00,  1.83it/s]
Epoch 10: 100%|██████████| 35/35 [00:19<00:00,  1.79it/s]
Epoch 11: 100%|██████████| 35/35 [00:18<00:00,  1.86it/s]
Epoch 12: 100%|██████████| 35/35 [00:19<00:00,  1.82it/s]
Epoch 13: 100%|██████████| 35/35 [00:19<00:00,  1.82it/s]
Epoch 14: 100%|██████████| 35/35 [00:19<00:00,  1.80it/s]
Epoch 15: 100%|██████████| 35/35 [00:19<00:00,  1.83it/s]
Epoch 16: 100%|██████████| 35/35 [00:19<00:00,  1.81it/s]
Epoch 17: 100%|██████████| 35/35 [00:19<00:00,  1.82it/s]
Epoch 18: 100%|████████

In [10]:
torch.save(model.state_dict(), './model_checkpoints/sensorium_ln_model.pth')

In [11]:
model.load_state_dict(torch.load("./model_checkpoints/pretrained/sensorium_ln_model.pth"));

---