# Example for training Spiking CNN on subset of NMNIST digits

## The problem:
Training digit classifier(0-9) on a subset(1000 training and 100 testing) of NMNIST digit spikes recorded using DVS camera. Just chagne the training list to for full NMNIST training.

## Load proper paths for SLAYER Pytorch source modules

In [2]:
import sys, os
CURRENT_TEST_DIR = os.getcwd()
sys.path.append(CURRENT_TEST_DIR + "/../slayerPytorch/src")

## Load required modules

SLAYER modules are available as `snn`
* The `spike-layer` module will be available as `snn.layer`.
* The `yaml-parameter` module will be availabe as `snn.params`.
* The `spike-loss` module will be available as `snn.loss`.
* The `spike-classifier` module will be available as `snn.predict`.
* The `spike-IO` module will be available as `snn.io`.


In [3]:
from datetime import datetime
import tqdm
import numpy as np
import matplotlib.pyplot as plt
import torch
from torch.utils.data import Dataset, DataLoader
import slayerSNN as snn
from learningStats import learningStats
from IPython.display import HTML
import zipfile
from torch.utils.tensorboard import SummaryWriter

## Read SNN configuration from yaml file
See the file for all the configuration parameters. This configuration file will be used to describe the SNN. We will ignore the network configuration  describe in the yaml file here.

In [4]:
netParams = snn.params('network.yaml')

In [5]:
netParams['training']

{'error': {'type': 'NumSpikes',
  'probSlidingWin': 20,
  'tgtSpikeRegion': {'start': 0, 'stop': 300},
  'tgtSpikeCount': {True: 60, False: 10}},
 'path': {'in': 'NMNISTsmall/',
  'train': 'NMNISTsmall/train1K.txt',
  'test': 'NMNISTsmall/test100.txt'}}

## Extract NMNISTsmall dataset
This is a subset of NMNIST dataset containing first 1000 training samples and first 100 testing samples. The original NMNSIT dataset consists of full MNIST samples converted into spikes using DVS sensor moved in three repeatable saccadic motion. For details and full dataset download links, refer to [https://www.garrickorchard.com/datasets/n-mnist](https://www.garrickorchard.com/datasets/n-mnist)

In [8]:
# extract data from zip only necessary if data is not already extracted
with zipfile.ZipFile('NMNISTsmall.zip') as zip_file:
    for member in zip_file.namelist():
        if not os.path.exists('./' + member):
            zip_file.extract(member, './')

## Defne the dataset class
The dataset definition follows standard PyTorch dataset definition.
Internally, it utilizes snn.io modules to read spikes and returns the spike in correct tensor format (CHWT).
* `datasetPath`: the path where the spike files are stored.
* `sampleFile`: the file that contains a list of sample indices and its corresponding clases.
* `samplingTime`: the sampling time (in ms) to bin the spikes.
* `sampleLength`: the length of the sample (in ms)

Note: This is a simple dataset class. A dataset that utilizes the folder hierarchy or xml list is easy to create.

In [9]:
# Dataset definition
class nmnistDataset(Dataset):
    def __init__(self, datasetPath, sampleFile, samplingTime, sampleLength):
        self.path = datasetPath 
        self.samples = np.loadtxt(sampleFile).astype('int')
        self.samplingTime = samplingTime
        self.nTimeBins    = int(sampleLength / samplingTime)

    def __getitem__(self, index):
        inputIndex  = self.samples[index, 0]
        classLabel  = self.samples[index, 1]

        inputSpikes = snn.io.read2Dspikes(
                        self.path + str(inputIndex.item()) + '.bs2'
                        ).toSpikeTensor(torch.zeros((2,34,34,self.nTimeBins)),
                        samplingTime=self.samplingTime)
        desiredClass = torch.zeros((10, 1, 1, 1))
        desiredClass[classLabel,...] = 1
        return inputSpikes, desiredClass, classLabel

    def __len__(self):
        return self.samples.shape[0]

In [10]:
# Dataset definition
class nmnistDatasetNoisy(nmnistDataset):
    def __init__(self, datasetPath, sampleFile, samplingTime, sampleLength, percentNoise=0.1):
        super().__init__(datasetPath, sampleFile, samplingTime, sampleLength)
        self.percentNoise = percentNoise
        # generate binary (boolean) noise with percentNoise 1s
#         self.noise = torch.cuda.FloatTensor(len(self.samples), 2, 34, 34, self.nTimeBins).uniform_() < percentNoise
        
    def __getitem__(self, index):
        inputSpikes, desiredClass, classLabel = super().__getitem__(index)
        noise = torch.FloatTensor(2, 34, 34, self.nTimeBins).uniform_() < self.percentNoise #self.noise[index] # get pre-generated noise for this example
        inputSpikes[noise] = 1-inputSpikes[noise] # invert 1s and 0s where noise=True
        return inputSpikes, desiredClass, classLabel

## Visualize the spike data

In [11]:
trainingSet = nmnistDataset(datasetPath =netParams['training']['path']['in'], 
                            sampleFile  =netParams['training']['path']['train'],
                            samplingTime=netParams['simulation']['Ts'],
                            sampleLength=netParams['simulation']['tSample'])

In [12]:
len(trainingSet.samples)

1000

In [13]:
input, target, label = trainingSet[0]
anim = snn.io.animTD(snn.io.spikeArrayToEvent(input.reshape((2, 34, 34, -1)).cpu().data.numpy()))
HTML(anim.to_jshtml())

## Noisy data

In [14]:
trainingSetNoisy = nmnistDatasetNoisy(datasetPath =netParams['training']['path']['in'], 
                            sampleFile  =netParams['training']['path']['train'],
                            samplingTime=netParams['simulation']['Ts'],
                            sampleLength=netParams['simulation']['tSample'],
                            percentNoise = 0.05)

In [15]:
input2, target2, label2 = trainingSetNoisy[0]
anim = snn.io.animTD(snn.io.spikeArrayToEvent(input2.reshape((2, 34, 34, -1)).cpu().data.numpy()))
HTML(anim.to_jshtml())

### The data looks far too noisy, but that's just because the animation collects several frames of the real data before showing it. So the noise sums over those frames.

In [16]:
# Delete the rogue temp-file
try:
    os.remove('None0000000.png')
except FileNotFoundError:
    pass

## Define the network
The network definition follows similar style as standard PyTorch network definition, but it utilizes snn modules.

In [25]:
class SpikingNetwork(torch.nn.Module):
    def __init__(self, netParams=netParams):
        super(SpikingNetwork, self).__init__()
        # initialize slayer
        self.slayer = snn.layer(netParams['neuron'], netParams['simulation'])
        # self.slayer = slayer
        
    def forward(self, spikeInput):
        x = spikeInput
        for layer in self.layers:
            x = self.slayer.spike(layer(self.slayer.psp(x)))
        return x
    
    def setLayers(self, layers):
        if isinstance(layers, list):
            self.layers = torch.nn.ModuleList(layers)
    
    def readyTraining(self, device, deviceIds):
        # Create network instance.
        # net = Network(netParams).to(device)
        # Split the network to run over multiple GPUs
        net = torch.nn.DataParallel(self.to(device), device_ids=deviceIds)
        
        # Create snn loss instance.
        error = snn.loss(netParams).to(device)

        # Define optimizer module.
        optimizer = torch.optim.Adam(net.parameters(), lr = 0.01, amsgrad = True)

        # Dataset and dataLoader instances.
        trainingSet = nmnistDatasetNoisy(datasetPath =netParams['training']['path']['in'], 
                                    sampleFile  =netParams['training']['path']['train'],
                                    samplingTime=netParams['simulation']['Ts'],
                                    sampleLength=netParams['simulation']['tSample'],
                                    percentNoise=0.05)
        trainLoader = DataLoader(dataset=trainingSet, batch_size=8, shuffle=True, num_workers=4)

        testingSet = nmnistDatasetNoisy(datasetPath  =netParams['training']['path']['in'], 
                                    sampleFile  =netParams['training']['path']['test'],
                                    samplingTime=netParams['simulation']['Ts'],
                                    sampleLength=netParams['simulation']['tSample'],
                                    percentNoise=0.05)
        testLoader = DataLoader(dataset=testingSet, batch_size=8, shuffle=True, num_workers=4)

        # Learning stats instance.
        stats = learningStats()
        
        self.netVars = {
            'net': net,
            'error': error,
            'optimizer': optimizer,
            'trainingSet': trainingSet,
            'trainLoader': trainLoader,
            'testingSet': testingSet,
            'testLoader': testLoader,
            'stats': stats
        }
    def train(self, device, epochcount=1, writer=SummaryWriter()):
        for epoch in tqdm.trange(epochcount, desc='epoch'):
            # Reset training stats.
            stats = self.netVars['stats']
            net = self.netVars['net']
            trainLoader = self.netVars['trainLoader']
            testLoader = self.netVars['testLoader']
            error = self.netVars['error']
            optimizer = self.netVars['optimizer']

            stats.training.reset()
            tSt = datetime.now()

            # Training loop.
            for i, (input, target, label) in enumerate(tqdm.tqdm(trainLoader, desc='batch'), 0):
                # Move the input and target to correct GPU.
                input  = input.to(device)
                target = target.to(device) 

                # Forward pass of the network.
                output = net.forward(input)

                # Gather the training stats.
                stats.training.correctSamples += torch.sum( snn.predict.getClass(output) == label ).data.item()
                stats.training.numSamples     += len(label)

                # Calculate loss.
                loss = error.numSpikes(output, target)

                # Reset gradients to zero.
                optimizer.zero_grad()

                # Backward pass of the network.
                loss.backward()

                # Update weights.
                optimizer.step()

                # Gather training loss stats.
                stats.training.lossSum += loss.cpu().data.item()

                # Display training stats. (Suitable for normal python implementation)
                # stats.print(epoch, i, (datetime.now() - tSt).total_seconds())

            # Update training stats.
            stats.training.update()
            # Reset testing stats.
            stats.testing.reset()

            # Testing loop.
            # Same steps as Training loops except loss backpropagation and weight update.
            for i, (input, target, label) in enumerate(testLoader, 0):
                input  = input.to(device)
                target = target.to(device) 

                output = net.forward(input)

                stats.testing.correctSamples += torch.sum( snn.predict.getClass(output) == label ).data.item()
                stats.testing.numSamples     += len(label)

                loss = error.numSpikes(output, target)
                stats.testing.lossSum += loss.cpu().data.item()
                # stats.print(epoch, i)

            # Update testing stats.
            stats.testing.update()
            writer.add_scalar('Accuracy/train', stats.training.accuracy(), epoch)
            writer.add_scalar('Accuracy/test',  stats.testing.accuracy(),  epoch)
            writer.add_scalar('Loss/train',     stats.training.loss(),     epoch)
            writer.add_scalar('Loss/test',      stats.testing.loss(),      epoch)

            # if epoch%10==0:  stats.print(epoch, timeElapsed=(datetime.now() - tSt).total_seconds())
        writer.close()

# create several networks with different architectures

In [26]:
Networks = []

In [27]:
Networks.append(SpikingNetwork())
slayer = Networks[0].slayer
Networks[0].setLayers([
            slayer.conv(2, 16, 5, padding=1),
            slayer.conv(16, 32, 3, padding=1),
            slayer.conv(32, 64, 3, padding=1),
            slayer.pool(2),
            slayer.pool(2),
            slayer.dense((8,8,64), 10)
        ])

In [28]:
Networks.append(SpikingNetwork())
slayer = Networks[1].slayer
Networks[1].setLayers([
            slayer.conv(2, 6, 5, padding=1),
            slayer.conv(6, 12, 3, padding=1),
            slayer.conv(12, 12, 3, padding=1),
            slayer.pool(4),
            slayer.dense((8,8,12), 10)
        ])

In [29]:
Networks.append(SpikingNetwork())
slayer = Networks[2].slayer
Networks[2].setLayers([
            slayer.conv(2, 6, 5, padding=1),
            slayer.conv(6, 12, 3, padding=1),
            slayer.pool(4),
            slayer.dense((8,8,12), 10)
        ])

# Train the network
Train the network for 100 epochs.

In [30]:
# first move to a new directory to not overwrite anything
dt_string = datetime.now().strftime("%d-%m-%Y_%Hh%Mm%S")
dirname = dt_string
i = 0
while os.path.exists(dirname):
    dirname = dt_string + '_' + str(++i)
os.mkdir(dirname)

In [31]:
# Define the cuda device to run the code on.
# device = torch.device('cuda')
# Use multiple GPU's if available
device = torch.device('cuda:0')#:2') # should be the first GPU of deviceIDs 
deviceIds = [0]#2, 3]

In [None]:
for i, Network in enumerate(tqdm.tqdm(Networks, desc='network')):
    Network.readyTraining(device, deviceIds)
    Network.train(device, epochcount=100, writer=SummaryWriter(dirname+"/modelset3_noise/"+str(i), comment='model'+str(i)))

network:   0%|          | 0/3 [00:00<?, ?it/s]
epoch:   0%|          | 0/100 [00:00<?, ?it/s][A

batch:   0%|          | 0/125 [00:00<?, ?it/s][A[A

batch:   1%|          | 1/125 [00:04<08:46,  4.24s/it][A[A

batch:   2%|▏         | 2/125 [00:08<08:32,  4.16s/it][A[A

batch:   2%|▏         | 3/125 [00:12<08:20,  4.10s/it][A[A

batch:   3%|▎         | 4/125 [00:16<08:11,  4.06s/it][A[A

batch:   4%|▍         | 5/125 [00:20<08:03,  4.03s/it][A[A

batch:   5%|▍         | 6/125 [00:24<07:56,  4.00s/it][A[A

batch:   6%|▌         | 7/125 [00:28<07:50,  3.99s/it][A[A

batch:   6%|▋         | 8/125 [00:31<07:45,  3.98s/it][A[A

batch:   7%|▋         | 9/125 [00:35<07:40,  3.97s/it][A[A

batch:   8%|▊         | 10/125 [00:39<07:36,  3.97s/it][A[A

batch:   9%|▉         | 11/125 [00:43<07:31,  3.96s/it][A[A

batch:  10%|▉         | 12/125 [00:47<07:26,  3.95s/it][A[A

batch:  10%|█         | 13/125 [00:51<07:21,  3.94s/it][A[A

batch:  11%|█         | 14/125 [00:55<

batch:   1%|          | 1/125 [00:03<08:11,  3.96s/it][A[A

batch:   2%|▏         | 2/125 [00:07<08:02,  3.92s/it][A[A

batch:   2%|▏         | 3/125 [00:11<07:54,  3.89s/it][A[A

batch:   3%|▎         | 4/125 [00:15<07:48,  3.87s/it][A[A

batch:   4%|▍         | 5/125 [00:19<07:42,  3.86s/it][A[A

batch:   5%|▍         | 6/125 [00:23<07:37,  3.85s/it][A[A

batch:   6%|▌         | 7/125 [00:26<07:32,  3.84s/it][A[A

batch:   6%|▋         | 8/125 [00:30<07:28,  3.83s/it][A[A

batch:   7%|▋         | 9/125 [00:34<07:24,  3.83s/it][A[A

batch:   8%|▊         | 10/125 [00:38<07:20,  3.83s/it][A[A

batch:   9%|▉         | 11/125 [00:42<07:16,  3.83s/it][A[A

batch:  10%|▉         | 12/125 [00:46<07:11,  3.82s/it][A[A

batch:  10%|█         | 13/125 [00:49<07:07,  3.82s/it][A[A

batch:  11%|█         | 14/125 [00:53<07:03,  3.82s/it][A[A

batch:  12%|█▏        | 15/125 [00:57<06:59,  3.82s/it][A[A

batch:  13%|█▎        | 16/125 [01:01<06:55,  3.82s/it][A[A

b

batch:   2%|▏         | 3/125 [00:11<07:48,  3.84s/it][A[A

batch:   3%|▎         | 4/125 [00:15<07:41,  3.81s/it][A[A

batch:   4%|▍         | 5/125 [00:18<07:36,  3.80s/it][A[A

batch:   5%|▍         | 6/125 [00:22<07:30,  3.79s/it][A[A

batch:   6%|▌         | 7/125 [00:26<07:26,  3.78s/it][A[A

batch:   6%|▋         | 8/125 [00:30<07:22,  3.78s/it][A[A

batch:   7%|▋         | 9/125 [00:34<07:18,  3.78s/it][A[A

batch:   8%|▊         | 10/125 [00:37<07:14,  3.78s/it][A[A

batch:   9%|▉         | 11/125 [00:41<07:09,  3.77s/it][A[A

batch:  10%|▉         | 12/125 [00:45<07:05,  3.77s/it][A[A

batch:  10%|█         | 13/125 [00:49<07:01,  3.77s/it][A[A

batch:  11%|█         | 14/125 [00:52<06:58,  3.77s/it][A[A

batch:  12%|█▏        | 15/125 [00:56<06:54,  3.77s/it][A[A

batch:  13%|█▎        | 16/125 [01:00<06:50,  3.77s/it][A[A

batch:  14%|█▎        | 17/125 [01:04<06:46,  3.76s/it][A[A

batch:  14%|█▍        | 18/125 [01:07<06:42,  3.76s/it][A[A


batch:   4%|▍         | 5/125 [00:18<07:21,  3.68s/it][A[A

batch:   5%|▍         | 6/125 [00:21<07:15,  3.66s/it][A[A

batch:   6%|▌         | 7/125 [00:25<07:10,  3.65s/it][A[A

batch:   6%|▋         | 8/125 [00:29<07:06,  3.65s/it][A[A

batch:   7%|▋         | 9/125 [00:32<07:02,  3.65s/it][A[A

batch:   8%|▊         | 10/125 [00:36<06:58,  3.64s/it][A[A

batch:   9%|▉         | 11/125 [00:40<06:54,  3.64s/it][A[A

batch:  10%|▉         | 12/125 [00:43<06:50,  3.64s/it][A[A

batch:  10%|█         | 13/125 [00:47<06:46,  3.63s/it][A[A

batch:  11%|█         | 14/125 [00:51<06:43,  3.63s/it][A[A

batch:  12%|█▏        | 15/125 [00:54<06:39,  3.63s/it][A[A

batch:  13%|█▎        | 16/125 [00:58<06:35,  3.63s/it][A[A

batch:  14%|█▎        | 17/125 [01:01<06:31,  3.62s/it][A[A

batch:  14%|█▍        | 18/125 [01:05<06:27,  3.62s/it][A[A

batch:  15%|█▌        | 19/125 [01:09<06:24,  3.62s/it][A[A

batch:  16%|█▌        | 20/125 [01:12<06:20,  3.62s/it][A[

batch:   6%|▌         | 7/125 [00:23<06:36,  3.36s/it][A[A

batch:   6%|▋         | 8/125 [00:26<06:32,  3.35s/it][A[A

batch:   7%|▋         | 9/125 [00:30<06:28,  3.35s/it][A[A

batch:   8%|▊         | 10/125 [00:33<06:24,  3.35s/it][A[A

batch:   9%|▉         | 11/125 [00:36<06:20,  3.34s/it][A[A

batch:  10%|▉         | 12/125 [00:40<06:17,  3.34s/it][A[A

batch:  10%|█         | 13/125 [00:43<06:13,  3.34s/it][A[A

batch:  11%|█         | 14/125 [00:46<06:10,  3.33s/it][A[A

batch:  12%|█▏        | 15/125 [00:50<06:06,  3.33s/it][A[A

batch:  13%|█▎        | 16/125 [00:53<06:02,  3.33s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:58,  3.32s/it][A[A

batch:  14%|█▍        | 18/125 [01:00<05:55,  3.32s/it][A[A

batch:  15%|█▌        | 19/125 [01:03<05:51,  3.32s/it][A[A

batch:  16%|█▌        | 20/125 [01:06<05:47,  3.31s/it][A[A

batch:  17%|█▋        | 21/125 [01:10<05:44,  3.31s/it][A[A

batch:  18%|█▊        | 22/125 [01:13<05:40,  3.31s/it][A

batch:   7%|▋         | 9/125 [00:27<05:58,  3.09s/it][A[A

batch:   8%|▊         | 10/125 [00:30<05:54,  3.08s/it][A[A

batch:   9%|▉         | 11/125 [00:34<05:51,  3.08s/it][A[A

batch:  10%|▉         | 12/125 [00:37<05:48,  3.08s/it][A[A

batch:  10%|█         | 13/125 [00:40<05:44,  3.08s/it][A[A

batch:  11%|█         | 14/125 [00:43<05:41,  3.08s/it][A[A

batch:  12%|█▏        | 15/125 [00:46<05:38,  3.07s/it][A[A

batch:  13%|█▎        | 16/125 [00:49<05:35,  3.08s/it][A[A

batch:  14%|█▎        | 17/125 [00:52<05:31,  3.07s/it][A[A

batch:  14%|█▍        | 18/125 [00:55<05:28,  3.07s/it][A[A

batch:  15%|█▌        | 19/125 [00:58<05:25,  3.07s/it][A[A

batch:  16%|█▌        | 20/125 [01:01<05:22,  3.07s/it][A[A

batch:  17%|█▋        | 21/125 [01:04<05:19,  3.07s/it][A[A

batch:  18%|█▊        | 22/125 [01:07<05:15,  3.07s/it][A[A

batch:  18%|█▊        | 23/125 [01:10<05:12,  3.07s/it][A[A

batch:  19%|█▉        | 24/125 [01:13<05:09,  3.06s/it]

batch:   9%|▉         | 11/125 [00:32<05:40,  2.99s/it][A[A

batch:  10%|▉         | 12/125 [00:35<05:37,  2.99s/it][A[A

batch:  10%|█         | 13/125 [00:38<05:34,  2.99s/it][A[A

batch:  11%|█         | 14/125 [00:41<05:31,  2.98s/it][A[A

batch:  12%|█▏        | 15/125 [00:44<05:28,  2.98s/it][A[A

batch:  13%|█▎        | 16/125 [00:47<05:25,  2.98s/it][A[A

batch:  14%|█▎        | 17/125 [00:50<05:22,  2.98s/it][A[A

batch:  14%|█▍        | 18/125 [00:53<05:18,  2.98s/it][A[A

batch:  15%|█▌        | 19/125 [00:56<05:16,  2.98s/it][A[A

batch:  16%|█▌        | 20/125 [00:59<05:13,  2.98s/it][A[A

batch:  17%|█▋        | 21/125 [01:02<05:09,  2.98s/it][A[A

batch:  18%|█▊        | 22/125 [01:05<05:06,  2.98s/it][A[A

batch:  18%|█▊        | 23/125 [01:08<05:03,  2.98s/it][A[A

batch:  19%|█▉        | 24/125 [01:11<05:00,  2.98s/it][A[A

batch:  20%|██        | 25/125 [01:14<04:57,  2.98s/it][A[A

batch:  21%|██        | 26/125 [01:17<04:54,  2.98s/it]

batch:  10%|█         | 13/125 [00:38<05:30,  2.95s/it][A[A

batch:  11%|█         | 14/125 [00:41<05:27,  2.95s/it][A[A

batch:  12%|█▏        | 15/125 [00:44<05:24,  2.95s/it][A[A

batch:  13%|█▎        | 16/125 [00:47<05:21,  2.95s/it][A[A

batch:  14%|█▎        | 17/125 [00:50<05:18,  2.95s/it][A[A

batch:  14%|█▍        | 18/125 [00:53<05:15,  2.95s/it][A[A

batch:  15%|█▌        | 19/125 [00:56<05:12,  2.95s/it][A[A

batch:  16%|█▌        | 20/125 [00:59<05:09,  2.95s/it][A[A

batch:  17%|█▋        | 21/125 [01:02<05:06,  2.95s/it][A[A

batch:  18%|█▊        | 22/125 [01:05<05:03,  2.95s/it][A[A

batch:  18%|█▊        | 23/125 [01:07<05:00,  2.95s/it][A[A

batch:  19%|█▉        | 24/125 [01:10<04:57,  2.95s/it][A[A

batch:  20%|██        | 25/125 [01:13<04:55,  2.95s/it][A[A

batch:  21%|██        | 26/125 [01:16<04:52,  2.95s/it][A[A

batch:  22%|██▏       | 27/125 [01:19<04:49,  2.95s/it][A[A

batch:  22%|██▏       | 28/125 [01:22<04:46,  2.95s/it]

batch:  12%|█▏        | 15/125 [00:45<05:34,  3.04s/it][A[A

batch:  13%|█▎        | 16/125 [00:48<05:31,  3.04s/it][A[A

batch:  14%|█▎        | 17/125 [00:51<05:28,  3.04s/it][A[A

batch:  14%|█▍        | 18/125 [00:54<05:25,  3.04s/it][A[A

batch:  15%|█▌        | 19/125 [00:57<05:22,  3.04s/it][A[A

batch:  16%|█▌        | 20/125 [01:01<05:19,  3.04s/it][A[A

batch:  17%|█▋        | 21/125 [01:04<05:15,  3.04s/it][A[A

batch:  18%|█▊        | 22/125 [01:07<05:13,  3.04s/it][A[A

batch:  18%|█▊        | 23/125 [01:10<05:09,  3.04s/it][A[A

batch:  19%|█▉        | 24/125 [01:13<05:07,  3.04s/it][A[A

batch:  20%|██        | 25/125 [01:16<05:03,  3.04s/it][A[A

batch:  21%|██        | 26/125 [01:19<05:00,  3.03s/it][A[A

batch:  22%|██▏       | 27/125 [01:22<04:57,  3.03s/it][A[A

batch:  22%|██▏       | 28/125 [01:25<04:54,  3.03s/it][A[A

batch:  23%|██▎       | 29/125 [01:28<04:51,  3.03s/it][A[A

batch:  24%|██▍       | 30/125 [01:31<04:48,  3.03s/it]

batch:  99%|█████████▉| 124/125 [07:03<00:03,  3.41s/it][A[A

batch: 100%|██████████| 125/125 [07:07<00:00,  3.42s/it][A[A

epoch:  17%|█▋        | 17/100 [2:06:00<10:11:44, 442.22s/it][A

batch:   0%|          | 0/125 [00:00<?, ?it/s][A[A

batch:   1%|          | 1/125 [00:03<07:18,  3.54s/it][A[A

batch:   2%|▏         | 2/125 [00:06<07:10,  3.50s/it][A[A

batch:   2%|▏         | 3/125 [00:10<07:03,  3.47s/it][A[A

batch:   3%|▎         | 4/125 [00:13<06:57,  3.45s/it][A[A

batch:   4%|▍         | 5/125 [00:17<06:52,  3.44s/it][A[A

batch:   5%|▍         | 6/125 [00:20<06:48,  3.43s/it][A[A

batch:   6%|▌         | 7/125 [00:24<06:44,  3.43s/it][A[A

batch:   6%|▋         | 8/125 [00:27<06:40,  3.43s/it][A[A

batch:   7%|▋         | 9/125 [00:30<06:36,  3.42s/it][A[A

batch:   8%|▊         | 10/125 [00:34<06:32,  3.41s/it][A[A

batch:   9%|▉         | 11/125 [00:37<06:28,  3.41s/it][A[A

batch:  10%|▉         | 12/125 [00:41<06:25,  3.41s/it][A[A

batch

epoch:  18%|█▊        | 18/100 [2:13:35<10:09:47, 446.19s/it][A

batch:   0%|          | 0/125 [00:00<?, ?it/s][A[A

batch:   1%|          | 1/125 [00:03<07:16,  3.52s/it][A[A

batch:   2%|▏         | 2/125 [00:06<07:08,  3.48s/it][A[A

batch:   2%|▏         | 3/125 [00:10<07:01,  3.45s/it][A[A

batch:   3%|▎         | 4/125 [00:13<06:55,  3.43s/it][A[A

batch:   4%|▍         | 5/125 [00:17<06:49,  3.41s/it][A[A

batch:   5%|▍         | 6/125 [00:20<06:45,  3.41s/it][A[A

batch:   6%|▌         | 7/125 [00:23<06:41,  3.40s/it][A[A

batch:   6%|▋         | 8/125 [00:27<06:37,  3.40s/it][A[A

batch:   7%|▋         | 9/125 [00:30<06:33,  3.39s/it][A[A

batch:   8%|▊         | 10/125 [00:33<06:29,  3.39s/it][A[A

batch:   9%|▉         | 11/125 [00:37<06:26,  3.39s/it][A[A

batch:  10%|▉         | 12/125 [00:40<06:23,  3.39s/it][A[A

batch:  10%|█         | 13/125 [00:44<06:19,  3.39s/it][A[A

batch:  11%|█         | 14/125 [00:47<06:15,  3.39s/it][A[A

batch: 

batch:   2%|▏         | 2/125 [00:06<06:55,  3.37s/it][A[A

batch:   2%|▏         | 3/125 [00:09<06:48,  3.34s/it][A[A

batch:   3%|▎         | 4/125 [00:13<06:42,  3.32s/it][A[A

batch:   4%|▍         | 5/125 [00:16<06:37,  3.31s/it][A[A

batch:   5%|▍         | 6/125 [00:19<06:32,  3.30s/it][A[A

batch:   6%|▌         | 7/125 [00:23<06:28,  3.29s/it][A[A

batch:   6%|▋         | 8/125 [00:26<06:24,  3.29s/it][A[A

batch:   7%|▋         | 9/125 [00:29<06:20,  3.28s/it][A[A

batch:   8%|▊         | 10/125 [00:32<06:17,  3.28s/it][A[A

batch:   9%|▉         | 11/125 [00:36<06:13,  3.28s/it][A[A

batch:  10%|▉         | 12/125 [00:39<06:10,  3.27s/it][A[A

batch:  10%|█         | 13/125 [00:42<06:06,  3.28s/it][A[A

batch:  11%|█         | 14/125 [00:45<06:03,  3.28s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:00,  3.28s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:56,  3.27s/it][A[A

batch:  14%|█▎        | 17/125 [00:55<05:53,  3.28s/it][A[A



batch:   3%|▎         | 4/125 [00:12<06:29,  3.22s/it][A[A

batch:   4%|▍         | 5/125 [00:15<06:23,  3.20s/it][A[A

batch:   5%|▍         | 6/125 [00:19<06:17,  3.17s/it][A[A

batch:   6%|▌         | 7/125 [00:22<06:12,  3.16s/it][A[A

batch:   6%|▋         | 8/125 [00:25<06:09,  3.15s/it][A[A

batch:   7%|▋         | 9/125 [00:28<06:05,  3.15s/it][A[A

batch:   8%|▊         | 10/125 [00:31<06:02,  3.15s/it][A[A

batch:   9%|▉         | 11/125 [00:34<05:59,  3.15s/it][A[A

batch:  10%|▉         | 12/125 [00:37<05:57,  3.16s/it][A[A

batch:  10%|█         | 13/125 [00:41<05:55,  3.17s/it][A[A

batch:  11%|█         | 14/125 [00:44<05:52,  3.18s/it][A[A

batch:  12%|█▏        | 15/125 [00:47<05:50,  3.18s/it][A[A

batch:  13%|█▎        | 16/125 [00:50<05:48,  3.19s/it][A[A

batch:  14%|█▎        | 17/125 [00:54<05:45,  3.20s/it][A[A

batch:  14%|█▍        | 18/125 [00:57<05:43,  3.21s/it][A[A

batch:  15%|█▌        | 19/125 [01:00<05:41,  3.22s/it][A[A

batch:   5%|▍         | 6/125 [00:19<06:34,  3.32s/it][A[A

batch:   6%|▌         | 7/125 [00:23<06:30,  3.31s/it][A[A

batch:   6%|▋         | 8/125 [00:26<06:26,  3.30s/it][A[A

batch:   7%|▋         | 9/125 [00:29<06:22,  3.30s/it][A[A

batch:   8%|▊         | 10/125 [00:33<06:18,  3.30s/it][A[A

batch:   9%|▉         | 11/125 [00:36<06:15,  3.29s/it][A[A

batch:  10%|▉         | 12/125 [00:39<06:12,  3.29s/it][A[A

batch:  10%|█         | 13/125 [00:42<06:08,  3.29s/it][A[A

batch:  11%|█         | 14/125 [00:46<06:05,  3.29s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:02,  3.30s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:58,  3.29s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:55,  3.29s/it][A[A

batch:  14%|█▍        | 18/125 [00:59<05:52,  3.29s/it][A[A

batch:  15%|█▌        | 19/125 [01:02<05:49,  3.29s/it][A[A

batch:  16%|█▌        | 20/125 [01:05<05:45,  3.29s/it][A[A

batch:  17%|█▋        | 21/125 [01:09<05:42,  3.30s/it][A

batch:   6%|▋         | 8/125 [00:26<06:26,  3.31s/it][A[A

batch:   7%|▋         | 9/125 [00:29<06:22,  3.30s/it][A[A

batch:   8%|▊         | 10/125 [00:33<06:19,  3.30s/it][A[A

batch:   9%|▉         | 11/125 [00:36<06:15,  3.30s/it][A[A

batch:  10%|▉         | 12/125 [00:39<06:12,  3.29s/it][A[A

batch:  10%|█         | 13/125 [00:42<06:08,  3.29s/it][A[A

batch:  11%|█         | 14/125 [00:46<06:05,  3.30s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:02,  3.29s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:58,  3.29s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:55,  3.29s/it][A[A

batch:  14%|█▍        | 18/125 [00:59<05:52,  3.29s/it][A[A

batch:  15%|█▌        | 19/125 [01:02<05:49,  3.30s/it][A[A

batch:  16%|█▌        | 20/125 [01:05<05:45,  3.29s/it][A[A

batch:  17%|█▋        | 21/125 [01:09<05:42,  3.29s/it][A[A

batch:  18%|█▊        | 22/125 [01:12<05:39,  3.29s/it][A[A

batch:  18%|█▊        | 23/125 [01:15<05:35,  3.29s/it][

batch:   8%|▊         | 10/125 [00:33<06:19,  3.30s/it][A[A

batch:   9%|▉         | 11/125 [00:36<06:15,  3.30s/it][A[A

batch:  10%|▉         | 12/125 [00:39<06:12,  3.29s/it][A[A

batch:  10%|█         | 13/125 [00:42<06:08,  3.29s/it][A[A

batch:  11%|█         | 14/125 [00:46<06:05,  3.30s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:02,  3.29s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:58,  3.29s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:55,  3.29s/it][A[A

batch:  14%|█▍        | 18/125 [00:59<05:52,  3.29s/it][A[A

batch:  15%|█▌        | 19/125 [01:02<05:48,  3.29s/it][A[A

batch:  16%|█▌        | 20/125 [01:05<05:45,  3.29s/it][A[A

batch:  17%|█▋        | 21/125 [01:09<05:42,  3.29s/it][A[A

batch:  18%|█▊        | 22/125 [01:12<05:39,  3.29s/it][A[A

batch:  18%|█▊        | 23/125 [01:15<05:35,  3.29s/it][A[A

batch:  19%|█▉        | 24/125 [01:19<05:32,  3.29s/it][A[A

batch:  20%|██        | 25/125 [01:22<05:29,  3.29s/it]

batch:  10%|▉         | 12/125 [00:39<06:11,  3.29s/it][A[A

batch:  10%|█         | 13/125 [00:42<06:08,  3.29s/it][A[A

batch:  11%|█         | 14/125 [00:46<06:05,  3.29s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:02,  3.29s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:59,  3.30s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:55,  3.29s/it][A[A

batch:  14%|█▍        | 18/125 [00:59<05:52,  3.29s/it][A[A

batch:  15%|█▌        | 19/125 [01:02<05:48,  3.29s/it][A[A

batch:  16%|█▌        | 20/125 [01:05<05:45,  3.29s/it][A[A

batch:  17%|█▋        | 21/125 [01:09<05:42,  3.29s/it][A[A

batch:  18%|█▊        | 22/125 [01:12<05:39,  3.29s/it][A[A

batch:  18%|█▊        | 23/125 [01:15<05:36,  3.29s/it][A[A

batch:  19%|█▉        | 24/125 [01:19<05:32,  3.29s/it][A[A

batch:  20%|██        | 25/125 [01:22<05:29,  3.29s/it][A[A

batch:  21%|██        | 26/125 [01:25<05:26,  3.29s/it][A[A

batch:  22%|██▏       | 27/125 [01:29<05:22,  3.29s/it]

batch:  11%|█         | 14/125 [00:46<06:05,  3.29s/it][A[A

batch:  12%|█▏        | 15/125 [00:49<06:02,  3.29s/it][A[A

batch:  13%|█▎        | 16/125 [00:52<05:58,  3.29s/it][A[A

batch:  14%|█▎        | 17/125 [00:56<05:55,  3.29s/it][A[A

batch:  14%|█▍        | 18/125 [00:59<05:52,  3.29s/it][A[A

batch:  15%|█▌        | 19/125 [01:02<05:48,  3.29s/it][A[A

batch:  16%|█▌        | 20/125 [01:05<05:45,  3.29s/it][A[A

batch:  17%|█▋        | 21/125 [01:09<05:42,  3.29s/it][A[A

batch:  18%|█▊        | 22/125 [01:12<05:39,  3.29s/it][A[A

batch:  18%|█▊        | 23/125 [01:15<05:35,  3.29s/it][A[A

batch:  19%|█▉        | 24/125 [01:19<05:32,  3.29s/it][A[A

batch:  20%|██        | 25/125 [01:22<05:28,  3.29s/it][A[A

batch:  21%|██        | 26/125 [01:25<05:25,  3.29s/it][A[A

batch:  22%|██▏       | 27/125 [01:28<05:22,  3.29s/it][A[A

batch:  22%|██▏       | 28/125 [01:32<05:19,  3.29s/it][A[A

batch:  23%|██▎       | 29/125 [01:35<05:15,  3.29s/it]

batch:  22%|██▏       | 27/125 [01:27<05:16,  3.23s/it][A[A

batch:  22%|██▏       | 28/125 [01:30<05:13,  3.23s/it][A[A

batch:  23%|██▎       | 29/125 [01:33<05:10,  3.24s/it][A[A

batch:  24%|██▍       | 30/125 [01:37<05:07,  3.23s/it][A[A

batch:  25%|██▍       | 31/125 [01:40<05:03,  3.23s/it][A[A

batch:  26%|██▌       | 32/125 [01:43<05:00,  3.23s/it][A[A

batch:  26%|██▋       | 33/125 [01:46<04:57,  3.23s/it][A[A

batch:  27%|██▋       | 34/125 [01:49<04:54,  3.23s/it][A[A

batch:  28%|██▊       | 35/125 [01:53<04:50,  3.23s/it][A[A

batch:  29%|██▉       | 36/125 [01:56<04:47,  3.23s/it][A[A

batch:  30%|██▉       | 37/125 [01:59<04:44,  3.23s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:41,  3.23s/it][A[A

batch:  31%|███       | 39/125 [02:06<04:38,  3.23s/it][A[A

batch:  32%|███▏      | 40/125 [02:09<04:34,  3.23s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:31,  3.23s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:28,  3.23s/it]

batch:  23%|██▎       | 29/125 [01:33<05:10,  3.23s/it][A[A

batch:  24%|██▍       | 30/125 [01:37<05:06,  3.23s/it][A[A

batch:  25%|██▍       | 31/125 [01:40<05:03,  3.23s/it][A[A

batch:  26%|██▌       | 32/125 [01:43<05:00,  3.23s/it][A[A

batch:  26%|██▋       | 33/125 [01:46<04:57,  3.23s/it][A[A

batch:  27%|██▋       | 34/125 [01:49<04:53,  3.23s/it][A[A

batch:  28%|██▊       | 35/125 [01:53<04:50,  3.23s/it][A[A

batch:  29%|██▉       | 36/125 [01:56<04:47,  3.23s/it][A[A

batch:  30%|██▉       | 37/125 [01:59<04:44,  3.23s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:40,  3.23s/it][A[A

batch:  31%|███       | 39/125 [02:06<04:37,  3.23s/it][A[A

batch:  32%|███▏      | 40/125 [02:09<04:34,  3.23s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:31,  3.23s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.23s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:24,  3.23s/it][A[A

batch:  35%|███▌      | 44/125 [02:22<04:21,  3.23s/it]

batch:  25%|██▍       | 31/125 [01:39<05:02,  3.22s/it][A[A

batch:  26%|██▌       | 32/125 [01:43<04:59,  3.22s/it][A[A

batch:  26%|██▋       | 33/125 [01:46<04:56,  3.22s/it][A[A

batch:  27%|██▋       | 34/125 [01:49<04:53,  3.22s/it][A[A

batch:  28%|██▊       | 35/125 [01:52<04:49,  3.22s/it][A[A

batch:  29%|██▉       | 36/125 [01:56<04:46,  3.22s/it][A[A

batch:  30%|██▉       | 37/125 [01:59<04:43,  3.22s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:40,  3.22s/it][A[A

batch:  31%|███       | 39/125 [02:05<04:37,  3.22s/it][A[A

batch:  32%|███▏      | 40/125 [02:08<04:33,  3.22s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.22s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:23,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:20,  3.22s/it][A[A

batch:  36%|███▌      | 45/125 [02:25<04:17,  3.22s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.22s/it]

batch:  26%|██▋       | 33/125 [01:46<04:56,  3.23s/it][A[A

batch:  27%|██▋       | 34/125 [01:49<04:53,  3.22s/it][A[A

batch:  28%|██▊       | 35/125 [01:52<04:50,  3.23s/it][A[A

batch:  29%|██▉       | 36/125 [01:56<04:46,  3.22s/it][A[A

batch:  30%|██▉       | 37/125 [01:59<04:43,  3.22s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:40,  3.22s/it][A[A

batch:  31%|███       | 39/125 [02:05<04:36,  3.22s/it][A[A

batch:  32%|███▏      | 40/125 [02:09<04:33,  3.22s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.22s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:24,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:21,  3.23s/it][A[A

batch:  36%|███▌      | 45/125 [02:25<04:18,  3.23s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.23s/it][A[A

batch:  38%|███▊      | 47/125 [02:31<04:11,  3.23s/it][A[A

batch:  38%|███▊      | 48/125 [02:34<04:08,  3.23s/it]

batch:  28%|██▊       | 35/125 [01:52<04:50,  3.22s/it][A[A

batch:  29%|██▉       | 36/125 [01:56<04:47,  3.23s/it][A[A

batch:  30%|██▉       | 37/125 [01:59<04:43,  3.23s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:40,  3.23s/it][A[A

batch:  31%|███       | 39/125 [02:05<04:37,  3.23s/it][A[A

batch:  32%|███▏      | 40/125 [02:09<04:33,  3.22s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.23s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:24,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:20,  3.22s/it][A[A

batch:  36%|███▌      | 45/125 [02:25<04:17,  3.22s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.23s/it][A[A

batch:  38%|███▊      | 47/125 [02:31<04:11,  3.23s/it][A[A

batch:  38%|███▊      | 48/125 [02:34<04:08,  3.22s/it][A[A

batch:  39%|███▉      | 49/125 [02:38<04:05,  3.22s/it][A[A

batch:  40%|████      | 50/125 [02:41<04:02,  3.23s/it]

batch:  30%|██▉       | 37/125 [01:59<04:43,  3.22s/it][A[A

batch:  30%|███       | 38/125 [02:02<04:40,  3.22s/it][A[A

batch:  31%|███       | 39/125 [02:05<04:36,  3.22s/it][A[A

batch:  32%|███▏      | 40/125 [02:09<04:33,  3.22s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.22s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:24,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:21,  3.22s/it][A[A

batch:  36%|███▌      | 45/125 [02:25<04:17,  3.22s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.22s/it][A[A

batch:  38%|███▊      | 47/125 [02:31<04:11,  3.22s/it][A[A

batch:  38%|███▊      | 48/125 [02:34<04:07,  3.22s/it][A[A

batch:  39%|███▉      | 49/125 [02:38<04:05,  3.22s/it][A[A

batch:  40%|████      | 50/125 [02:41<04:01,  3.22s/it][A[A

batch:  41%|████      | 51/125 [02:44<03:58,  3.22s/it][A[A

batch:  42%|████▏     | 52/125 [02:47<03:55,  3.22s/it]

batch:  31%|███       | 39/125 [02:05<04:37,  3.22s/it][A[A

batch:  32%|███▏      | 40/125 [02:08<04:33,  3.22s/it][A[A

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.22s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:27,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:24,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:20,  3.22s/it][A[A

batch:  36%|███▌      | 45/125 [02:24<04:17,  3.22s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.22s/it][A[A

batch:  38%|███▊      | 47/125 [02:31<04:11,  3.22s/it][A[A

batch:  38%|███▊      | 48/125 [02:34<04:07,  3.22s/it][A[A

batch:  39%|███▉      | 49/125 [02:37<04:04,  3.22s/it][A[A

batch:  40%|████      | 50/125 [02:41<04:01,  3.22s/it][A[A

batch:  41%|████      | 51/125 [02:44<03:58,  3.22s/it][A[A

batch:  42%|████▏     | 52/125 [02:47<03:55,  3.22s/it][A[A

batch:  42%|████▏     | 53/125 [02:50<03:51,  3.22s/it][A[A

batch:  43%|████▎     | 54/125 [02:53<03:48,  3.22s/it]

batch:  33%|███▎      | 41/125 [02:12<04:30,  3.22s/it][A[A

batch:  34%|███▎      | 42/125 [02:15<04:26,  3.22s/it][A[A

batch:  34%|███▍      | 43/125 [02:18<04:23,  3.22s/it][A[A

batch:  35%|███▌      | 44/125 [02:21<04:20,  3.22s/it][A[A

batch:  36%|███▌      | 45/125 [02:24<04:17,  3.22s/it][A[A

batch:  37%|███▋      | 46/125 [02:28<04:14,  3.22s/it][A[A

batch:  38%|███▊      | 47/125 [02:31<04:11,  3.22s/it][A[A

batch:  38%|███▊      | 48/125 [02:34<04:07,  3.22s/it][A[A

batch:  39%|███▉      | 49/125 [02:37<04:04,  3.22s/it][A[A

batch:  40%|████      | 50/125 [02:41<04:01,  3.22s/it][A[A

batch:  41%|████      | 51/125 [02:44<03:58,  3.22s/it][A[A

batch:  42%|████▏     | 52/125 [02:47<03:55,  3.22s/it][A[A

batch:  42%|████▏     | 53/125 [02:50<03:51,  3.22s/it][A[A

batch:  43%|████▎     | 54/125 [02:53<03:48,  3.22s/it][A[A

batch:  44%|████▍     | 55/125 [02:57<03:45,  3.22s/it][A[A

batch:  45%|████▍     | 56/125 [03:00<03:41,  3.22s/it]

## ~~Plot the Results~~