# **Homework 1: COVID-19 Cases Prediction (Regression)**

### [作业辅导](https://www.youtube.com/watch?v=iMzxjBDMvac)

Objectives:
* Solve a regression problem with deep neural networks (DNN).
* Understand basic DNN training tips.
* Familiarize yourself with PyTorch.

If you have any questions, please contact the TAs via TA hours, NTU COOL, or email to mlta-2023-spring@googlegroups.com

In [1]:
# check gpu type
!nvidia-smi

Fri Nov 10 22:58:46 2023       
+---------------------------------------------------------------------------------------+
| NVIDIA-SMI 537.13                 Driver Version: 537.13       CUDA Version: 12.2     |
|-----------------------------------------+----------------------+----------------------+
| GPU  Name                     TCC/WDDM  | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                                         |                      |               MIG M. |
|   0  NVIDIA GeForce RTX 4060 ...  WDDM  | 00000000:01:00.0  On |                  N/A |
| N/A   38C    P8               2W /  93W |     94MiB /  8188MiB |      0%      Default |
|                                         |                      |                  N/A |
+-----------------------------------------+----------------------+----------------------+
                                                                    

# Import packages

In [2]:
# Numerical Operations
import math
import numpy as np

# Reading/Writing Data
import pandas as pd
import os
import csv

# Feature selecting
import sklearn
from sklearn.feature_selection import SelectKBest,f_regression

# For Progress Bar
from tqdm import tqdm

# Pytorch
import torch
import torch.nn as nn
from torch.utils.data import Dataset, DataLoader, random_split

# For plotting learning curve
from torch.utils.tensorboard import SummaryWriter

In [3]:
# playground
# dir方法会返回类中所有的公共方法
# dir(torch)
# dir(torch.cuda.is_available)
# help(torch.cuda.is_available)

# Download data
If the Google Drive links below do not work, you can use the dropbox link below or download data from [Kaggle](https://www.kaggle.com/competitions/ml2023spring-hw1/overview), and upload data manually to the workspace.

In [4]:
# google drive link
# !gdown --id '1BjXalPZxq9mybPKNjF3h5L3NcF7XKTS-' --output covid_train.csv
# !gdown --id '1B55t74Jg2E5FCsKCsUEkPKIuqaY7UIi1' --output covid_test.csv

# dropbox link
# wget方法会报错，经查资料这个wget是linux系统上的方法，由于现在电脑是windows无法调用，后续在mac电脑上尝试
# 在colab上可以执行
# !wget -O covid_train.csv https://www.dropbox.com/s/lmy1riadzoy0ahw/covid.train.csv?dl=0
# !wget -O covid_test.csv https://www.dropbox.com/s/zalbw42lu4nmhr2/covid.test.csv?dl=0
# /kaggle/input/hw1-covid-19

File_Path_Train = './covid_train.csv'
File_Path_Test = './covid_test.csv'

# Kaggle
# File_Path_Train = '/kaggle/input/hw1-covid-19/covid_train.csv'
# File_Path_Test = '/kaggle/input/hw1-covid-19/covid_test.csv'

# Common
ds_train = pd.read_csv(File_Path_Train)
ds_test = pd.read_csv(File_Path_Train)

### Data checking
先检查一遍数据，按照作业要求，基础数据中，应该包含了美国35个州过去3天的感染人数数据：

- 先尝试看懂数据
- 如果有数据需要padding，先做padding

In [5]:
# ds_train.head()

In [6]:
# ds_train.describe()

In [7]:
# 查看所有列
# for idx,name in enumerate(ds_train.columns):
#   print(idx,name)

In [8]:
# ds_train.shape[0]

In [9]:
# print('--------------------------- STATE ----------------------------------')
# print('id:',ds_train.columns.get_loc('id'))
# print('AL:',ds_train.columns.get_loc('AL'))
# print('WI:',ds_train.columns.get_loc('WI'))
# print('--------------------------- DAY 1 ----------------------------------')
# print('cli:',ds_train.columns.get_loc('cli'))
# print('tested_positive:',ds_train.columns.get_loc('tested_positive'))
# print('--------------------------- DAY 2 ----------------------------------')
# print('cli.1:',ds_train.columns.get_loc('cli.1'))
# print('tested_positive.1:',ds_train.columns.get_loc('tested_positive.1'))
# print('--------------------------- DAY 3 ----------------------------------')
# print('cli.2:',ds_train.columns.get_loc('cli.2'))
# print('tested_positive.2:',ds_train.columns.get_loc('tested_positive.2'))

#### 检查是否包含35个州?
不是35个州，一共34个州。从col[1]到col[34]，AL->WI，
> PS: 美国一共50个州 https://zh.wikipedia.org/zh-hans/%E7%BE%8E%E5%9B%BD%E5%B7%9E%E4%BB%BD

#### 检查是否包含3天的数据?
有三天的数据，每一天有18列数据
- 第一天的数据从col[35]到col[52],从cli到tested_positive
- 第二天有.1的尾缀，从col[53]到col[70],从cli.1到tested_positive.1
- 第三天有.2的尾缀，从col[71]到col[88],从cli.2到tested_positive.2

#### 列名分类
|no|col name|remark|
|-----------|-----------|-----------|
|1|COVID-like illness|冠状病毒样疾病|
|2|Behavior indicators|行为指标|
|3|Belief indicators|信念指标|
|4|Mental indicator|心理指标|
|5|Enviromental indicator|环境指标|
|6|Tested Positive Cases|检测呈阳性的病例|

- ![col-feature-desc](./hw-col-feature-desc.jpg)
- ![col-feature-desc-2](./hw-col-feature-desc-2.jpg)
- ![col-feature-desc-3](./hw-col-feature-desc-3.jpg)

#### 列名解释
由于所有列都为缩写，看起来很费劲，这里找到了一份调查，其中有对应翻译其内容：
https://cmu-delphi.github.io/delphi-epidata/api/covidcast-signals/fb-survey.html

第一个字母为*w*的，代表百分比，否则代表数量

简要摘录内容如下:

|no|col name|desc|Chinese desc|Classification|
|-----------|-----------|-----------|-----------|-----------|
|1|cli|COVID-like illness|COVID-like illness新冠疾病|COVID-like illness|
|2|ili|influenza-like illness|流感样疾病|COVID-like illness|
|3|wnohh_cmnty_cli|not including household community COVID-like illness|社区中非家庭中统计到的cli比例|COVID-like illness|
|4|wbelief_masking_effective||相信戴口罩是有效的预防手段|Belief indicators|
|5|wbelief_distancing_effective||相信远距离是有效的预防手段|Belief indicators|
|6|wcovid_vaccinated_friends||他们的大多数朋友和家人都接种了新冠肺炎疫苗|Behavior indicators|
|7|wlarge_event_indoors||参与过大于10人活动的百分比|Behavior indicators|
|8|wothers_masked_public||受访者表示除他之外 ，大多数人或所有人在公共场合都戴口罩的估计百分比|Enviromental indicator|
|9|wothers_distanced_public||受访者表示除他之外 ，大多数人或所有人在公共场所与他们保持至少为6英尺（1.8米）的估计百分比|Enviromental indicator|
|10|wshop_indoors||去过室内市场、杂货店或药店的受访者的估计百分比|Behavior indicators|
|11|wrestaurant_indoors||去过室内“酒吧、餐厅或咖啡馆”的受访者的估计百分比|Behavior indicators|
|12|wworried_catch_covid||对新冠肺炎感到非常或适度担忧的受访者的估计百分比|Mental indicator|
|13|hh_cmnty_cli||在当地社区报告新冠的人数包括他们的家人|COVID-like illness|
|14|nohh_cmnty_cli||在当地社区报告新冠的人数，不包括他们的家人|COVID-like illness|
|15|wearing_mask_7d||过去7天内大部分或所有时间在公共场合戴口罩的人|Enviromental indicator|
|16|public_transit||使用公共交通|Behavior indicators|
|17|worried_finances||表示非常担心或有点担心下个月家庭财务状况的受访者|Mental indicator|
|18|tested_positive||检测呈阳性的病例|Tested Positive Cases|

In [10]:
# print('--------------------------- COVID-like illness ----------------------------------')
# print(ds_train.loc[:0,['id','cli','ili','wnohh_cmnty_cli','hh_cmnty_cli','nohh_cmnty_cli']])
# print('--------------------------- Behavior indicators ----------------------------------')
# print(ds_train.loc[:0,['id','wcovid_vaccinated_friends','wlarge_event_indoors','wshop_indoors','wrestaurant_indoors','public_transit']])
# print('--------------------------- Belief indicators ----------------------------------')
# print(ds_train.loc[:0,['id','wbelief_masking_effective','wbelief_distancing_effective']])
# print('--------------------------- Mental indicator ----------------------------------')
# print(ds_train.loc[:0,['id','wworried_catch_covid','worried_finances']])
# print('--------------------------- Enviromental indicator ----------------------------------')
# print(ds_train.loc[:0,['id','wothers_masked_public','wothers_distanced_public','wearing_mask_7d']])
# print('--------------------------- Tested Positive Cases ----------------------------------')
# print(ds_train.loc[:0,['id','tested_positive',]])

# Configurations
`config` contains hyper-parameters for training and the path to save your model.

In [11]:
device = 'cuda' if torch.cuda.is_available() else 'cpu'

# 'select_features': np.r_[1:34,35:37,40:45,47:50,52,53:55,58:63,65:68,70,71:73,76:81,83:86], # Select columns with out id, Mental indicator
config = {
    'model_use_business': False,# 设计模型的时候，神经元数量的确实方式是按照feature数量来，还是按照2的N次方来
    'seed': 19871201,      # Your seed number, you can pick your lucky number. :)
    'select_all': False,   # Whether to use all features.
    'select_features': [35,36,37,47,48,52,53,54,55,65,66,70,71,72,73,83,84],
    'valid_ratio': 0.2,   # validation_size = train_size * valid_ratio
    'n_epochs': 5000,     # Number of epochs.
    'batch_size': 256,
    'learning_rate': 1e-3,
    'early_stop': 600,    # If model has not improved for this many consecutive epochs, stop training.
    'weight_decay': 1e-6,
    'save_path': './models/model.ckpt'  # Your model will be saved here.
}

## 记录一些实验结果

### Epoch [1000/1000]: Train loss: 0.8504, Valid loss: 0.8970
将model的layer改为2的n次方，感觉效果一般
```python
# 将神经元得与业务解耦，将业务体现在input_dim,也就是feature_select上。
# 下方linear的神经元数量修改为2的n次方
layer_1_nodes = 64
layer_2_nodes = 32
layer_3_nodes = 16

class My_Model(nn.Module):
    def __init__(self, input_dim):
        super(My_Model, self).__init__()
        # TODO: modify model's structure, be aware of dimensions.
        self.layers = nn.Sequential(
            nn.Linear(input_dim, layer_1_nodes),
            nn.ReLU(),
            nn.Linear(layer_1_nodes, layer_2_nodes),
            nn.ReLU(),
            nn.Linear(layer_2_nodes, layer_3_nodes),
            nn.ReLU(),
            nn.Linear(layer_3_nodes, 1) # 最后归为一个阳性的数量
        )

    def forward(self, x):
        x = self.layers(x)
        x = x.squeeze(1) # (B, 1) -> (B)
        return x
```

### Epoch [1000/1000]: Train loss: 0.8320, Valid loss: 0.8807

选feature时，更新了feature的筛选，将beilf和mental都去掉。同时，将feature放入config，这需要修改train时的代码
```python
config = {
    'seed': 19871201,      # Your seed number, you can pick your lucky number. :)
    'select_all': False,   # Whether to use all features.
    'select_features': np.r_[1:34,35:37,40:45,47:50,52,53:55,58:63,65:68,70,71:73,76:81,83:86], # Select columns with out id, Mental indicator
    'valid_ratio': 0.2,   # validation_size = train_size * valid_ratio
    'n_epochs': 1000,     # Number of epochs.
    'batch_size': 34*10,
    'learning_rate': 1e-3,
    'early_stop': 600,    # If model has not improved for this many consecutive epochs, stop training.
    'weight_decay': 1e-6,
    'save_path': './models/model.ckpt'  # Your model will be saved here.
}

x_train, x_valid, x_test, y_train, y_valid = select_feat(train_data, valid_data, test_data, config['select_all'], config['select_features'])
```

optimizer使用了Adam方法，并进行L2 Regularzation
```python
optimizer = torch.optim.Adam(model.parameters(), lr=config['learning_rate'], betas=(0.9, 0.999), eps=1e-08, weight_decay=config['weight_decay'], amsgrad=False)
```

### Epoch [1000/1000]: Train loss: 1.1560, Valid loss: 1.3151

```python
# define feature
state_count = 34
feature_count = 15
feature_type_count = 4 # 去掉了一个心理因素
neuron_nodes = state_count + feature_count*3 # 34个州，一共有15列有用特征*3天

nn.Sequential(
    nn.Linear(input_dim, neuron_nodes),
    nn.ReLU(),
    nn.Linear(neuron_nodes, feature_count),
    nn.ReLU(),
    nn.Linear(feature_count, feature_type_count),
    nn.ReLU(),
    nn.Linear(feature_type_count, 1) # 最后归为一个阳性的数量
)

feat_idx = np.r_[1:34,35:45,47:50,52,53:63,65:68,70,71:81,83:86] # Select columns with out id, Mental indicator
```

### Epoch [1000/1000]: Train loss: 4.8335, Valid loss: 3.4793

```python
class My_Model(nn.Module):
    def __init__(self, input_dim):
        super(My_Model, self).__init__()
        # TODO: modify model's structure, be aware of dimensions.
        self.layers = nn.Sequential(
            nn.Linear(input_dim, 17),
            nn.ReLU(),
            nn.ReLU(),
            nn.Linear(17, 5),
            nn.ReLU(),
            nn.ReLU(),
            nn.Linear(5, 1)
        )

    def forward(self, x):
        x = self.layers(x)
        x = x.squeeze(1) # (B, 1) -> (B)
        return x

```

### 下面这个原生网络的结果
修改batch之后：


Epoch [1000/1000]: Train loss: 5.5759, Valid loss: 2.4200

Epoch [4569/5000]: Train loss: 1.2975, Valid loss: 1.9382

``` python
class My_Model(nn.Module):
    def __init__(self, input_dim):
        super(My_Model, self).__init__()
        # TODO: modify model's structure, be aware of dimensions.
        self.layers = nn.Sequential(
            nn.Linear(input_dim, 16),
            nn.ReLU(),
            nn.Linear(16, 8),
            nn.ReLU(),
            nn.Linear(8, 1)
        )

    def forward(self, x):
        x = self.layers(x)
        x = x.squeeze(1) # (B, 1) -> (B)
        return x
```

# Some Utility Functions

You do not need to modify this part.

In [12]:
def same_seed(seed):
    '''Fixes random number generator seeds for reproducibility.'''
    torch.backends.cudnn.deterministic = True
    torch.backends.cudnn.benchmark = False
    np.random.seed(seed)
    torch.manual_seed(seed)
    if torch.cuda.is_available():
        torch.cuda.manual_seed_all(seed)

def train_valid_split(data_set, valid_ratio, seed):
    '''Split provided training data into training set and validation set'''
    valid_set_size = int(valid_ratio * len(data_set))
    train_set_size = len(data_set) - valid_set_size
    train_set, valid_set = random_split(data_set, [train_set_size, valid_set_size], generator=torch.Generator().manual_seed(seed))
    return np.array(train_set), np.array(valid_set)

def predict(test_loader, model, device):
    model.eval() # Set your model to evaluation mode.切换模型状态，后面还有model.train()，要来回切换使用，挺麻烦的
    preds = []
    for x in tqdm(test_loader):
        x = x.to(device)
        with torch.no_grad():
            pred = model(x)
            preds.append(pred.detach().cpu())
    preds = torch.cat(preds, dim=0).numpy()
    return preds

# Dataset

In [13]:
class COVID19Dataset(Dataset):
    '''
    x: Features.
    y: Targets, if none, do prediction.
    '''
    def __init__(self, x, y=None):
        if y is None:
            self.y = y
        else:
            self.y = torch.FloatTensor(y)
        self.x = torch.FloatTensor(x)

    def __getitem__(self, idx):
        if self.y is None:
            return self.x[idx]
        else:
            return self.x[idx], self.y[idx]

    def __len__(self):
        return len(self.x)

# Neural Network Model
Try out different model architectures by modifying the class below.

In [14]:
# @title
# 定义nn(nerual network) 一般需要重写init和forward方法
# forward是前馈网络的意思

# 将神经元得与业务解耦，将业务体现在input_dim,也就是feature_select上
# 下方linear的神经元数量修改为2的n次方
layer_1_nodes = 16
layer_2_nodes = 8
layer_3_nodes = 4

if config['model_use_business']:
  # define feature
  state_count = 34
  previous_count = 2
  feature_count = 13
  feature_type_count = 3 # 去掉了心理因素，相信因素
  neuron_nodes = feature_count*3 # 34个州，一共有15列有用特征*3天

  layer_1_nodes = neuron_nodes + previous_count
  layer_2_nodes = feature_count+1 + previous_count
  layer_3_nodes = feature_type_count+ previous_count

class My_Model(nn.Module):
    def __init__(self, input_dim):
        super(My_Model, self).__init__()
        # TODO: modify model's structure, be aware of dimensions.
        self.layers = nn.Sequential(
            nn.Linear(input_dim, layer_1_nodes),
            nn.ReLU(),
            nn.Linear(layer_1_nodes, layer_2_nodes),
            nn.ReLU(),
            nn.Linear(layer_2_nodes, layer_3_nodes),
            nn.ReLU(),
            nn.Linear(layer_3_nodes, 1) # 最后归为一个阳性的数量
        )

    def forward(self, x):
        x = self.layers(x)
        x = x.squeeze(1) # (B, 1) -> (B)
        return x

# Feature Selection
Choose features you deem useful by modifying the function below.

In [15]:
# np.r_[35:37,40:45,47:50,52,53:55,58:63,65:68,70,71:73,76:81,83:86], # 往期自定义的结果


train_data_for_select = pd.read_csv(File_Path_Train).values
# train_data_for_select, valid_data = train_valid_split(train_data_for_select, config['valid_ratio'], config['seed'])
train_data_for_select_X,train_data_for_select_Y = train_data_for_select[:,:-1],train_data_for_select[:,-1]


# k设置为17，其预估是将前两天的阳性人数和3天中相关度比较的因素取前5，最终返回的列如下：
# cli   ili   wnohh_cmnty_cli   hh_cmnty_cli   nohh_cmnty_cli   tested_positive
# cli.1 ili.1 wnohh_cmnty_cli.1 hh_cmnty_cli.1 nohh_cmnty_cli.1 tested_positive.1
# cli.2 ili.2 wnohh_cmnty_cli.2 hh_cmnty_cli.2 nohh_cmnty_cli.2

# k设置为2，只取两个因素，其方法返回的值为：
# tested_positive，tested_positive.1

# 从结果上看，返回的内容很不错

def select_feat_sklearn(x,y):
 k=17
 x_new = SelectKBest(score_func=f_regression,k=k).fit(x,y)
 return x_new

# print(train_data_for_select_X[:1,:])
# train_data_for_select_X.shape
# train_data_for_select_Y.shape
# print(train_data_for_select_Y)

features = select_feat_sklearn(train_data_for_select_X,train_data_for_select_Y)
array = features.get_feature_names_out()
for idx,name in enumerate(array):
 array[idx] = int(array[idx][1:])

# [35 36 37 47 48 52 53 54 55 65 66 70 71 72 73 83 84]
    
print(array)
# idx = np.argsort(features.scores_)[::1]
# print(features[:3])


[35 36 37 47 48 52 53 54 55 65 66 70 71 72 73 83 84]


In [16]:
# 参数select_all默认为True，代表默认会将所有列作为feature加入计算
# 如果在训练过程中，有发现某几列可能关联性不大，可以排除出来，那要将select_all改为false，并手工导入列数
def select_feat(train_data, valid_data, test_data, select_all=True, select_features=[]):
    '''Selects useful features to perform regression'''
    y_train, y_valid = train_data[:,-1], valid_data[:,-1]
    raw_x_train, raw_x_valid, raw_x_test = train_data[:,:-1], valid_data[:,:-1], test_data

    if select_all:
        feat_idx = list(range(raw_x_train.shape[1]))
    else:
        feat_idx = select_features

    return raw_x_train[:,feat_idx], raw_x_valid[:,feat_idx], raw_x_test[:,feat_idx], y_train, y_valid

# Training Loop

In [17]:
same_seed(config['seed'])

train_data, test_data = pd.read_csv(File_Path_Train).values, pd.read_csv(File_Path_Test).values
train_data, valid_data = train_valid_split(train_data, config['valid_ratio'], config['seed'])

# Print out the data size.
print(f"""train_data size: {train_data.shape}
valid_data size: {valid_data.shape}
test_data size: {test_data.shape}""")

# Select features
x_train, x_valid, x_test, y_train, y_valid = select_feat(train_data, valid_data, test_data, config['select_all'], config['select_features'])

print('x_train')
print(x_train)

# Print out the number of features.
print(f'number of features: {x_train.shape[1]}')

train_dataset, valid_dataset, test_dataset = COVID19Dataset(x_train, y_train), \
                                            COVID19Dataset(x_valid, y_valid), \
                                            COVID19Dataset(x_test)

# Pytorch data loader loads pytorch dataset into batches.
train_loader = DataLoader(train_dataset, batch_size=config['batch_size'], shuffle=True, pin_memory=True)
valid_loader = DataLoader(valid_dataset, batch_size=config['batch_size'], shuffle=True, pin_memory=True)
test_loader = DataLoader(test_dataset, batch_size=config['batch_size'], shuffle=False, pin_memory=True)

train_data size: (2408, 89)
valid_data size: (601, 89)
test_data size: (997, 88)
x_train
[[ 1.3138593  1.2748362 15.7057702 ... 15.8342843 22.0118552 16.1409036]
 [ 1.6800285  1.7248551 16.6408925 ... 16.2795037 21.1383018 16.1498569]
 [ 0.8818757  0.9345295  9.7551765 ...  8.7525655 12.8101195  8.8501928]
 ...
 [ 1.0946756  1.1118771 11.7432324 ... 12.0424304 15.3299617 10.9991829]
 [ 3.6352241  3.7769054 26.5714705 ... 23.7412583 32.9247972 25.8964017]
 [ 3.6850859  3.7616652 33.6158757 ... 31.6886024 37.331582  31.3212202]]
number of features: 17


In [18]:
def trainer(train_loader, valid_loader, model, config, device):

    criterion = nn.MSELoss(reduction='mean') # Define your loss function, do not modify this.

    # Define your optimization algorithm.
    # TODO: Please check https://pytorch.org/docs/stable/optim.html to get more available algorithms.
    # TODO: L2 regularization (optimizer(weight decay...) or implement by your self).
    # optimizer = torch.optim.SGD(model.parameters(), lr=config['learning_rate'], momentum=0.9)
    optimizer = torch.optim.Adam(model.parameters(), lr=config['learning_rate'], betas=(0.9, 0.999), eps=1e-08, weight_decay=config['weight_decay'], amsgrad=False)
    writer = SummaryWriter() # Writer of tensoboard.

    if not os.path.isdir('./models'):
        os.mkdir('./models') # Create directory of saving models.

    n_epochs, best_loss, step, early_stop_count = config['n_epochs'], math.inf, 0, 0

    for epoch in range(n_epochs):
        model.train() # Set your model to train mode.
        loss_record = []

        # tqdm is a package to visualize your training progress.
        train_pbar = tqdm(train_loader, position=0, leave=True)

        for x, y in train_pbar:
            optimizer.zero_grad()               # Set gradient to zero.
            x, y = x.to(device), y.to(device)   # Move your data to device.
            pred = model(x)
            loss = criterion(pred, y)
            loss.backward()                     # Compute gradient(backpropagation).
            optimizer.step()                    # Update parameters.
            step += 1
            loss_record.append(loss.detach().item())

            # Display current epoch number and loss on tqdm progress bar.
            train_pbar.set_description(f'Epoch [{epoch+1}/{n_epochs}]')
            train_pbar.set_postfix({'loss': loss.detach().item()})

        mean_train_loss = sum(loss_record)/len(loss_record)
        # writer.add_scalar('Loss/train', mean_train_loss, step)

        model.eval() # Set your model to evaluation mode.
        loss_record = []
        for x, y in valid_loader:
            x, y = x.to(device), y.to(device)
            with torch.no_grad():
                pred = model(x)
                loss = criterion(pred, y)

            loss_record.append(loss.item())

        mean_valid_loss = sum(loss_record)/len(loss_record)
        if(epoch%100 == 0):
            print(f'Epoch [{epoch+1}/{n_epochs}]: Train loss: {mean_train_loss:.4f}, Valid loss: {mean_valid_loss:.4f}')
        # writer.add_scalar('Loss/valid', mean_valid_loss, step)

        if mean_valid_loss < best_loss:
            best_loss = mean_valid_loss
            torch.save(model.state_dict(), config['save_path'] + str(best_loss)) # Save your best model
            print('Saving model with loss {:.3f}...'.format(best_loss))
            early_stop_count = 0
        else:
            early_stop_count += 1

        if early_stop_count >= config['early_stop']:
            print('\nModel is not improving, so we halt the training session.')
            print('Model current loss {:.3f}'.format(best_loss))
            return

# Dataloader
Read data from files and set up training, validation, and testing sets. You do not need to modify this part.

# Start training!

In [None]:
model = My_Model(input_dim=x_train.shape[1]).to(device) # put your model and data on the same computation device.
trainer(train_loader, valid_loader, model, config, device)

Epoch [1/5000]: 100%|██████████| 10/10 [00:03<00:00,  2.67it/s, loss=312]


Epoch [1/5000]: Train loss: 383.3497, Valid loss: 376.6558
Saving model with loss 376.656...


Epoch [2/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.02it/s, loss=317]


Epoch [2/5000]: Train loss: 383.2822, Valid loss: 399.6446


Epoch [3/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.02it/s, loss=458]


Epoch [3/5000]: Train loss: 391.3567, Valid loss: 389.9184


Epoch [4/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.06it/s, loss=381]


Epoch [4/5000]: Train loss: 386.4864, Valid loss: 402.2886


Epoch [5/5000]: 100%|██████████| 10/10 [00:00<00:00, 212.91it/s, loss=384]


Epoch [5/5000]: Train loss: 386.3413, Valid loss: 384.9726


Epoch [6/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.76it/s, loss=393]


Epoch [6/5000]: Train loss: 386.5272, Valid loss: 394.2755


Epoch [7/5000]: 100%|██████████| 10/10 [00:00<00:00, 215.44it/s, loss=458]


Epoch [7/5000]: Train loss: 390.0890, Valid loss: 393.4238


Epoch [8/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.29it/s, loss=384]


Epoch [8/5000]: Train loss: 385.3380, Valid loss: 389.9320


Epoch [9/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.63it/s, loss=426]


Epoch [9/5000]: Train loss: 387.5667, Valid loss: 380.7877


Epoch [10/5000]: 100%|██████████| 10/10 [00:00<00:00, 202.17it/s, loss=390]


Epoch [10/5000]: Train loss: 385.0837, Valid loss: 397.5418


Epoch [11/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.03it/s, loss=429]


Epoch [11/5000]: Train loss: 387.1082, Valid loss: 373.2898
Saving model with loss 373.290...


Epoch [12/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=389]


Epoch [12/5000]: Train loss: 384.3653, Valid loss: 378.9699


Epoch [13/5000]: 100%|██████████| 10/10 [00:00<00:00, 302.16it/s, loss=427]


Epoch [13/5000]: Train loss: 386.3131, Valid loss: 382.5679


Epoch [14/5000]: 100%|██████████| 10/10 [00:00<00:00, 262.61it/s, loss=449]


Epoch [14/5000]: Train loss: 387.2889, Valid loss: 389.3155


Epoch [15/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.93it/s, loss=324]


Epoch [15/5000]: Train loss: 379.6003, Valid loss: 382.7310


Epoch [16/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.71it/s, loss=355]


Epoch [16/5000]: Train loss: 381.0709, Valid loss: 388.1866


Epoch [17/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.10it/s, loss=304]


Epoch [17/5000]: Train loss: 377.7501, Valid loss: 381.4165


Epoch [18/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.11it/s, loss=367]


Epoch [18/5000]: Train loss: 381.1838, Valid loss: 396.6018


Epoch [19/5000]: 100%|██████████| 10/10 [00:00<00:00, 308.34it/s, loss=296]


Epoch [19/5000]: Train loss: 376.6424, Valid loss: 384.2643


Epoch [20/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.04it/s, loss=420]


Epoch [20/5000]: Train loss: 383.7232, Valid loss: 382.7307


Epoch [21/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.60it/s, loss=374]


Epoch [21/5000]: Train loss: 380.6776, Valid loss: 383.9329


Epoch [22/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.82it/s, loss=418]


Epoch [22/5000]: Train loss: 382.9220, Valid loss: 366.5770
Saving model with loss 366.577...


Epoch [23/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.20it/s, loss=367]


Epoch [23/5000]: Train loss: 379.6095, Valid loss: 370.2920


Epoch [24/5000]: 100%|██████████| 10/10 [00:00<00:00, 249.36it/s, loss=386]


Epoch [24/5000]: Train loss: 380.4464, Valid loss: 383.8401


Epoch [25/5000]: 100%|██████████| 10/10 [00:00<00:00, 245.88it/s, loss=365]


Epoch [25/5000]: Train loss: 378.8775, Valid loss: 380.5204


Epoch [26/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.72it/s, loss=320]


Epoch [26/5000]: Train loss: 375.8696, Valid loss: 388.5535


Epoch [27/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.35it/s, loss=381]


Epoch [27/5000]: Train loss: 379.2037, Valid loss: 374.0441


Epoch [28/5000]: 100%|██████████| 10/10 [00:00<00:00, 275.48it/s, loss=407]


Epoch [28/5000]: Train loss: 380.3872, Valid loss: 381.8938


Epoch [29/5000]: 100%|██████████| 10/10 [00:00<00:00, 237.34it/s, loss=490]


Epoch [29/5000]: Train loss: 385.0009, Valid loss: 373.4389


Epoch [30/5000]: 100%|██████████| 10/10 [00:00<00:00, 185.59it/s, loss=402]


Epoch [30/5000]: Train loss: 379.4925, Valid loss: 379.4751


Epoch [31/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.92it/s, loss=375]


Epoch [31/5000]: Train loss: 377.5716, Valid loss: 361.4354
Saving model with loss 361.435...


Epoch [32/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.21it/s, loss=461]


Epoch [32/5000]: Train loss: 382.3510, Valid loss: 368.5357


Epoch [33/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.64it/s, loss=407]


Epoch [33/5000]: Train loss: 378.8229, Valid loss: 389.5262


Epoch [34/5000]: 100%|██████████| 10/10 [00:00<00:00, 70.32it/s, loss=403]


Epoch [34/5000]: Train loss: 378.3030, Valid loss: 388.3046


Epoch [35/5000]: 100%|██████████| 10/10 [00:00<00:00, 301.35it/s, loss=262]


Epoch [35/5000]: Train loss: 369.6217, Valid loss: 389.6921


Epoch [36/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.38it/s, loss=393]


Epoch [36/5000]: Train loss: 377.0825, Valid loss: 372.1512


Epoch [37/5000]: 100%|██████████| 10/10 [00:00<00:00, 319.68it/s, loss=336]


Epoch [37/5000]: Train loss: 373.4066, Valid loss: 376.4251


Epoch [38/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.81it/s, loss=390]


Epoch [38/5000]: Train loss: 376.2586, Valid loss: 377.5220


Epoch [39/5000]: 100%|██████████| 10/10 [00:00<00:00, 255.70it/s, loss=441]


Epoch [39/5000]: Train loss: 378.9932, Valid loss: 379.9924


Epoch [40/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.66it/s, loss=330]


Epoch [40/5000]: Train loss: 372.0816, Valid loss: 392.2930


Epoch [41/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.38it/s, loss=407]


Epoch [41/5000]: Train loss: 376.3860, Valid loss: 369.2705


Epoch [42/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.22it/s, loss=435]


Epoch [42/5000]: Train loss: 377.6968, Valid loss: 393.6333


Epoch [43/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.70it/s, loss=313]


Epoch [43/5000]: Train loss: 370.1356, Valid loss: 389.1812


Epoch [44/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.51it/s, loss=452]


Epoch [44/5000]: Train loss: 378.1296, Valid loss: 368.0308


Epoch [45/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.18it/s, loss=363]


Epoch [45/5000]: Train loss: 372.4947, Valid loss: 392.4685


Epoch [46/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.74it/s, loss=356]


Epoch [46/5000]: Train loss: 371.7697, Valid loss: 392.3937


Epoch [47/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.11it/s, loss=421]


Epoch [47/5000]: Train loss: 375.3687, Valid loss: 386.0233


Epoch [48/5000]: 100%|██████████| 10/10 [00:00<00:00, 276.01it/s, loss=344]


Epoch [48/5000]: Train loss: 370.4892, Valid loss: 369.0329


Epoch [49/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.79it/s, loss=349]


Epoch [49/5000]: Train loss: 370.4634, Valid loss: 375.3754


Epoch [50/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.35it/s, loss=349]


Epoch [50/5000]: Train loss: 370.1350, Valid loss: 387.2258


Epoch [51/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=389]


Epoch [51/5000]: Train loss: 372.2363, Valid loss: 378.1797


Epoch [52/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.20it/s, loss=382]


Epoch [52/5000]: Train loss: 371.5001, Valid loss: 373.4012


Epoch [53/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.64it/s, loss=342]


Epoch [53/5000]: Train loss: 368.8315, Valid loss: 362.1770


Epoch [54/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.27it/s, loss=328]


Epoch [54/5000]: Train loss: 367.6565, Valid loss: 390.7021


Epoch [55/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.12it/s, loss=363]


Epoch [55/5000]: Train loss: 369.4151, Valid loss: 363.6025


Epoch [56/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.41it/s, loss=386]


Epoch [56/5000]: Train loss: 370.4853, Valid loss: 359.1085
Saving model with loss 359.109...


Epoch [57/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.32it/s, loss=447]


Epoch [57/5000]: Train loss: 373.8257, Valid loss: 374.5177


Epoch [58/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.77it/s, loss=479]


Epoch [58/5000]: Train loss: 375.4138, Valid loss: 379.5147


Epoch [59/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.14it/s, loss=341]


Epoch [59/5000]: Train loss: 366.9044, Valid loss: 394.5166


Epoch [60/5000]: 100%|██████████| 10/10 [00:00<00:00, 354.79it/s, loss=311]


Epoch [60/5000]: Train loss: 364.8379, Valid loss: 369.0917


Epoch [61/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.60it/s, loss=280]


Epoch [61/5000]: Train loss: 362.6979, Valid loss: 371.5320


Epoch [62/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.24it/s, loss=374]


Epoch [62/5000]: Train loss: 367.9672, Valid loss: 360.2490


Epoch [63/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.53it/s, loss=374]


Epoch [63/5000]: Train loss: 367.6545, Valid loss: 373.1875


Epoch [64/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.84it/s, loss=409]


Epoch [64/5000]: Train loss: 369.4506, Valid loss: 372.8156


Epoch [65/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.18it/s, loss=338]


Epoch [65/5000]: Train loss: 364.8899, Valid loss: 377.0067


Epoch [66/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.02it/s, loss=361]


Epoch [66/5000]: Train loss: 365.9787, Valid loss: 364.0429


Epoch [67/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.40it/s, loss=458]


Epoch [67/5000]: Train loss: 371.4257, Valid loss: 372.3297


Epoch [68/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.24it/s, loss=336]


Epoch [68/5000]: Train loss: 363.8857, Valid loss: 380.3225


Epoch [69/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.64it/s, loss=414]


Epoch [69/5000]: Train loss: 368.2327, Valid loss: 356.0246
Saving model with loss 356.025...


Epoch [70/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.79it/s, loss=334]


Epoch [70/5000]: Train loss: 363.1904, Valid loss: 359.3259


Epoch [71/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.26it/s, loss=341]


Epoch [71/5000]: Train loss: 363.2686, Valid loss: 367.1188


Epoch [72/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.55it/s, loss=396]


Epoch [72/5000]: Train loss: 366.2377, Valid loss: 363.1954


Epoch [73/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.29it/s, loss=391]


Epoch [73/5000]: Train loss: 365.6274, Valid loss: 360.2153


Epoch [74/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.44it/s, loss=415]


Epoch [74/5000]: Train loss: 366.7405, Valid loss: 362.0332


Epoch [75/5000]: 100%|██████████| 10/10 [00:00<00:00, 231.01it/s, loss=309]


Epoch [75/5000]: Train loss: 360.1852, Valid loss: 383.3637


Epoch [76/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.35it/s, loss=407]


Epoch [76/5000]: Train loss: 365.7134, Valid loss: 358.9988


Epoch [77/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.99it/s, loss=390]


Epoch [77/5000]: Train loss: 364.3681, Valid loss: 350.6112
Saving model with loss 350.611...


Epoch [78/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.42it/s, loss=364]


Epoch [78/5000]: Train loss: 362.5625, Valid loss: 366.9027


Epoch [79/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.78it/s, loss=419]


Epoch [79/5000]: Train loss: 365.4772, Valid loss: 368.2704


Epoch [80/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.99it/s, loss=334]


Epoch [80/5000]: Train loss: 360.1698, Valid loss: 361.9412


Epoch [81/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.67it/s, loss=345]


Epoch [81/5000]: Train loss: 360.4972, Valid loss: 363.8802


Epoch [82/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.24it/s, loss=377]


Epoch [82/5000]: Train loss: 362.1040, Valid loss: 372.7327


Epoch [83/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.63it/s, loss=338]


Epoch [83/5000]: Train loss: 359.4860, Valid loss: 363.8286


Epoch [84/5000]: 100%|██████████| 10/10 [00:00<00:00, 249.84it/s, loss=464]


Epoch [84/5000]: Train loss: 366.6888, Valid loss: 360.9469


Epoch [85/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.67it/s, loss=354]


Epoch [85/5000]: Train loss: 359.8693, Valid loss: 353.4858


Epoch [86/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.20it/s, loss=327]


Epoch [86/5000]: Train loss: 357.9309, Valid loss: 360.5047


Epoch [87/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.02it/s, loss=340]


Epoch [87/5000]: Train loss: 358.4066, Valid loss: 351.5207


Epoch [88/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.74it/s, loss=370]


Epoch [88/5000]: Train loss: 359.8790, Valid loss: 365.2755


Epoch [89/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=407]


Epoch [89/5000]: Train loss: 361.7916, Valid loss: 366.8937


Epoch [90/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.62it/s, loss=317]


Epoch [90/5000]: Train loss: 356.1542, Valid loss: 373.2526


Epoch [91/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.61it/s, loss=362]


Epoch [91/5000]: Train loss: 358.5424, Valid loss: 358.5810


Epoch [92/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.30it/s, loss=344]


Epoch [92/5000]: Train loss: 357.1877, Valid loss: 343.7834
Saving model with loss 343.783...


Epoch [93/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.32it/s, loss=323]


Epoch [93/5000]: Train loss: 355.6307, Valid loss: 352.7290


Epoch [94/5000]: 100%|██████████| 10/10 [00:00<00:00, 385.62it/s, loss=430]


Epoch [94/5000]: Train loss: 361.7033, Valid loss: 356.7037


Epoch [95/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.52it/s, loss=347]


Epoch [95/5000]: Train loss: 356.4485, Valid loss: 355.2186


Epoch [96/5000]: 100%|██████████| 10/10 [00:00<00:00, 329.28it/s, loss=389]


Epoch [96/5000]: Train loss: 358.6484, Valid loss: 369.4070


Epoch [97/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.58it/s, loss=340]

Epoch [97/5000]: Train loss: 355.4351, Valid loss: 363.8490



Epoch [98/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.46it/s, loss=349]


Epoch [98/5000]: Train loss: 355.6876, Valid loss: 380.7497


Epoch [99/5000]: 100%|██████████| 10/10 [00:00<00:00, 340.11it/s, loss=459]


Epoch [99/5000]: Train loss: 361.9194, Valid loss: 351.3710


Epoch [100/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.51it/s, loss=395]


Epoch [100/5000]: Train loss: 357.8051, Valid loss: 358.0236


Epoch [101/5000]: 100%|██████████| 10/10 [00:00<00:00, 340.06it/s, loss=367]


Epoch [101/5000]: Train loss: 355.8606, Valid loss: 356.5120


Epoch [102/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.24it/s, loss=343]


Epoch [102/5000]: Train loss: 354.1784, Valid loss: 360.7235


Epoch [103/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.57it/s, loss=411]


Epoch [103/5000]: Train loss: 357.9055, Valid loss: 346.5036


Epoch [104/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.28it/s, loss=359]


Epoch [104/5000]: Train loss: 354.5340, Valid loss: 362.8422


Epoch [105/5000]: 100%|██████████| 10/10 [00:00<00:00, 347.22it/s, loss=338]


Epoch [105/5000]: Train loss: 352.9837, Valid loss: 362.4549


Epoch [106/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.23it/s, loss=401]


Epoch [106/5000]: Train loss: 356.4221, Valid loss: 361.8116


Epoch [107/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.75it/s, loss=379]


Epoch [107/5000]: Train loss: 354.8262, Valid loss: 355.4485


Epoch [108/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.92it/s, loss=373]


Epoch [108/5000]: Train loss: 354.1892, Valid loss: 342.4162
Saving model with loss 342.416...


Epoch [109/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.41it/s, loss=274]


Epoch [109/5000]: Train loss: 348.0193, Valid loss: 360.3957


Epoch [110/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.76it/s, loss=364]


Epoch [110/5000]: Train loss: 353.0559, Valid loss: 343.2833


Epoch [111/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.66it/s, loss=339]


Epoch [111/5000]: Train loss: 351.2851, Valid loss: 352.6570


Epoch [112/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.30it/s, loss=339]


Epoch [112/5000]: Train loss: 350.9842, Valid loss: 359.9644


Epoch [113/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.10it/s, loss=400]


Epoch [113/5000]: Train loss: 354.2943, Valid loss: 340.4182
Saving model with loss 340.418...


Epoch [114/5000]: 100%|██████████| 10/10 [00:00<00:00, 340.11it/s, loss=314]


Epoch [114/5000]: Train loss: 348.9216, Valid loss: 347.3161


Epoch [115/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.92it/s, loss=342]


Epoch [115/5000]: Train loss: 350.3103, Valid loss: 354.6696


Epoch [116/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.71it/s, loss=432]


Epoch [116/5000]: Train loss: 355.3236, Valid loss: 368.0235


Epoch [117/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.94it/s, loss=346]


Epoch [117/5000]: Train loss: 349.9610, Valid loss: 361.5738


Epoch [118/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.12it/s, loss=318]


Epoch [118/5000]: Train loss: 347.9878, Valid loss: 346.8391


Epoch [119/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.56it/s, loss=287]


Epoch [119/5000]: Train loss: 345.8632, Valid loss: 352.8250


Epoch [120/5000]: 100%|██████████| 10/10 [00:00<00:00, 88.08it/s, loss=352]


Epoch [120/5000]: Train loss: 349.4621, Valid loss: 349.1163


Epoch [121/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=334]


Epoch [121/5000]: Train loss: 348.0485, Valid loss: 354.3440


Epoch [122/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.11it/s, loss=363]


Epoch [122/5000]: Train loss: 349.5247, Valid loss: 353.2297


Epoch [123/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.98it/s, loss=325]


Epoch [123/5000]: Train loss: 346.9353, Valid loss: 349.5632


Epoch [124/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.12it/s, loss=362]


Epoch [124/5000]: Train loss: 348.8406, Valid loss: 360.0482


Epoch [125/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.70it/s, loss=303]


Epoch [125/5000]: Train loss: 345.0790, Valid loss: 356.1332


Epoch [126/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.65it/s, loss=398]


Epoch [126/5000]: Train loss: 350.4457, Valid loss: 343.7810


Epoch [127/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.68it/s, loss=356]


Epoch [127/5000]: Train loss: 347.6714, Valid loss: 352.6531


Epoch [128/5000]: 100%|██████████| 10/10 [00:00<00:00, 364.38it/s, loss=356]


Epoch [128/5000]: Train loss: 347.3406, Valid loss: 347.7947


Epoch [129/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.99it/s, loss=392]


Epoch [129/5000]: Train loss: 349.2336, Valid loss: 354.0813


Epoch [130/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=358]


Epoch [130/5000]: Train loss: 346.9218, Valid loss: 343.6390


Epoch [131/5000]: 100%|██████████| 10/10 [00:00<00:00, 355.87it/s, loss=354]


Epoch [131/5000]: Train loss: 346.3627, Valid loss: 344.9606


Epoch [132/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.53it/s, loss=367]


Epoch [132/5000]: Train loss: 346.8549, Valid loss: 351.8972


Epoch [133/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.48it/s, loss=320]


Epoch [133/5000]: Train loss: 343.8109, Valid loss: 335.9709
Saving model with loss 335.971...


Epoch [134/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.93it/s, loss=353]


Epoch [134/5000]: Train loss: 345.4393, Valid loss: 357.3481


Epoch [135/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.10it/s, loss=281]


Epoch [135/5000]: Train loss: 340.8859, Valid loss: 336.1461


Epoch [136/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.35it/s, loss=348]


Epoch [136/5000]: Train loss: 344.5965, Valid loss: 352.1670


Epoch [137/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.67it/s, loss=350]


Epoch [137/5000]: Train loss: 344.4097, Valid loss: 344.1339


Epoch [138/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.33it/s, loss=347]


Epoch [138/5000]: Train loss: 343.9678, Valid loss: 352.8519


Epoch [139/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.78it/s, loss=313]


Epoch [139/5000]: Train loss: 341.6420, Valid loss: 337.9616


Epoch [140/5000]: 100%|██████████| 10/10 [00:00<00:00, 364.33it/s, loss=330]


Epoch [140/5000]: Train loss: 342.3677, Valid loss: 329.8398
Saving model with loss 329.840...


Epoch [141/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.31it/s, loss=340]


Epoch [141/5000]: Train loss: 342.6855, Valid loss: 353.4670


Epoch [142/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.49it/s, loss=392]


Epoch [142/5000]: Train loss: 345.4865, Valid loss: 358.4775


Epoch [143/5000]: 100%|██████████| 10/10 [00:00<00:00, 356.71it/s, loss=386]


Epoch [143/5000]: Train loss: 344.8718, Valid loss: 348.2481


Epoch [144/5000]: 100%|██████████| 10/10 [00:00<00:00, 357.04it/s, loss=300]


Epoch [144/5000]: Train loss: 339.4505, Valid loss: 341.2921


Epoch [145/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.55it/s, loss=275]


Epoch [145/5000]: Train loss: 337.7154, Valid loss: 350.0662


Epoch [146/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.09it/s, loss=320]


Epoch [146/5000]: Train loss: 340.0871, Valid loss: 337.4432


Epoch [147/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.88it/s, loss=334]


Epoch [147/5000]: Train loss: 340.6403, Valid loss: 340.0579


Epoch [148/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.30it/s, loss=388]


Epoch [148/5000]: Train loss: 343.5605, Valid loss: 353.5679


Epoch [149/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.75it/s, loss=330]


Epoch [149/5000]: Train loss: 339.8235, Valid loss: 340.9479


Epoch [150/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.55it/s, loss=358]


Epoch [150/5000]: Train loss: 341.2092, Valid loss: 335.5438


Epoch [151/5000]: 100%|██████████| 10/10 [00:00<00:00, 356.83it/s, loss=295]


Epoch [151/5000]: Train loss: 337.1590, Valid loss: 359.8557


Epoch [152/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.64it/s, loss=339]


Epoch [152/5000]: Train loss: 339.5220, Valid loss: 342.9696


Epoch [153/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.53it/s, loss=361]


Epoch [153/5000]: Train loss: 340.5466, Valid loss: 340.6034


Epoch [154/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.63it/s, loss=357]


Epoch [154/5000]: Train loss: 339.9992, Valid loss: 330.6166


Epoch [155/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.84it/s, loss=332]


Epoch [155/5000]: Train loss: 338.2305, Valid loss: 344.9365


Epoch [156/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.32it/s, loss=356]


Epoch [156/5000]: Train loss: 339.3609, Valid loss: 347.0977


Epoch [157/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.14it/s, loss=293]


Epoch [157/5000]: Train loss: 335.3867, Valid loss: 337.7714


Epoch [158/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.48it/s, loss=356]


Epoch [158/5000]: Train loss: 338.8131, Valid loss: 337.8035


Epoch [159/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.39it/s, loss=324]


Epoch [159/5000]: Train loss: 336.6285, Valid loss: 319.5141
Saving model with loss 319.514...


Epoch [160/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.76it/s, loss=332]


Epoch [160/5000]: Train loss: 336.8236, Valid loss: 348.9071


Epoch [161/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.09it/s, loss=379]


Epoch [161/5000]: Train loss: 339.3769, Valid loss: 338.6786


Epoch [162/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.76it/s, loss=372]


Epoch [162/5000]: Train loss: 338.6399, Valid loss: 338.9555


Epoch [163/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.50it/s, loss=312]


Epoch [163/5000]: Train loss: 334.8141, Valid loss: 327.1353


Epoch [164/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.42it/s, loss=351]


Epoch [164/5000]: Train loss: 336.8339, Valid loss: 341.3180


Epoch [165/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.96it/s, loss=320]


Epoch [165/5000]: Train loss: 334.7427, Valid loss: 342.9595


Epoch [166/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.54it/s, loss=293]


Epoch [166/5000]: Train loss: 332.8337, Valid loss: 335.7440


Epoch [167/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.75it/s, loss=353]


Epoch [167/5000]: Train loss: 336.1406, Valid loss: 354.0524


Epoch [168/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.38it/s, loss=299]


Epoch [168/5000]: Train loss: 332.6575, Valid loss: 346.3041


Epoch [169/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.47it/s, loss=395]


Epoch [169/5000]: Train loss: 338.0932, Valid loss: 330.3525


Epoch [170/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.08it/s, loss=326]


Epoch [170/5000]: Train loss: 333.6939, Valid loss: 329.7732


Epoch [171/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.88it/s, loss=338]


Epoch [171/5000]: Train loss: 334.1305, Valid loss: 322.3678


Epoch [172/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.33it/s, loss=290]


Epoch [172/5000]: Train loss: 331.0184, Valid loss: 335.2805


Epoch [173/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.18it/s, loss=363]


Epoch [173/5000]: Train loss: 335.0801, Valid loss: 333.0873


Epoch [174/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.09it/s, loss=360]


Epoch [174/5000]: Train loss: 334.5829, Valid loss: 346.8747


Epoch [175/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.04it/s, loss=393]


Epoch [175/5000]: Train loss: 336.2764, Valid loss: 344.8821


Epoch [176/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.04it/s, loss=324]


Epoch [176/5000]: Train loss: 331.8924, Valid loss: 329.9054


Epoch [177/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.98it/s, loss=382]


Epoch [177/5000]: Train loss: 335.0822, Valid loss: 340.6321


Epoch [178/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.27it/s, loss=336]


Epoch [178/5000]: Train loss: 332.0582, Valid loss: 335.6771


Epoch [179/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.14it/s, loss=299]


Epoch [179/5000]: Train loss: 329.5860, Valid loss: 330.6571


Epoch [180/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.39it/s, loss=301]


Epoch [180/5000]: Train loss: 329.4463, Valid loss: 328.3264


Epoch [181/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.64it/s, loss=311]


Epoch [181/5000]: Train loss: 329.7444, Valid loss: 335.0110


Epoch [182/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.10it/s, loss=415]


Epoch [182/5000]: Train loss: 335.6371, Valid loss: 328.8649


Epoch [183/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.74it/s, loss=363]


Epoch [183/5000]: Train loss: 332.2680, Valid loss: 341.5683


Epoch [184/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.85it/s, loss=378]


Epoch [184/5000]: Train loss: 332.8920, Valid loss: 333.8140


Epoch [185/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.48it/s, loss=397]


Epoch [185/5000]: Train loss: 333.7536, Valid loss: 337.9960


Epoch [186/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.31it/s, loss=276]


Epoch [186/5000]: Train loss: 326.2797, Valid loss: 328.1822


Epoch [187/5000]: 100%|██████████| 10/10 [00:00<00:00, 349.55it/s, loss=314]


Epoch [187/5000]: Train loss: 328.2964, Valid loss: 346.3702


Epoch [188/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.00it/s, loss=300]


Epoch [188/5000]: Train loss: 327.1829, Valid loss: 320.8061


Epoch [189/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=307]


Epoch [189/5000]: Train loss: 327.3313, Valid loss: 345.6728


Epoch [190/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.64it/s, loss=306]


Epoch [190/5000]: Train loss: 326.9652, Valid loss: 323.7768


Epoch [191/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.46it/s, loss=299]


Epoch [191/5000]: Train loss: 326.2952, Valid loss: 325.5836


Epoch [192/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.20it/s, loss=371]


Epoch [192/5000]: Train loss: 330.3103, Valid loss: 332.5745


Epoch [193/5000]: 100%|██████████| 10/10 [00:00<00:00, 361.33it/s, loss=314]


Epoch [193/5000]: Train loss: 326.6483, Valid loss: 327.4669


Epoch [194/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.36it/s, loss=355]


Epoch [194/5000]: Train loss: 328.8147, Valid loss: 329.2232


Epoch [195/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.54it/s, loss=372]


Epoch [195/5000]: Train loss: 329.5299, Valid loss: 333.4960


Epoch [196/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.82it/s, loss=395]


Epoch [196/5000]: Train loss: 330.6118, Valid loss: 334.8170


Epoch [197/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.02it/s, loss=283]


Epoch [197/5000]: Train loss: 323.7186, Valid loss: 310.8869
Saving model with loss 310.887...


Epoch [198/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.78it/s, loss=275]


Epoch [198/5000]: Train loss: 322.9728, Valid loss: 325.4134


Epoch [199/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.08it/s, loss=411]


Epoch [199/5000]: Train loss: 330.7623, Valid loss: 325.2969


Epoch [200/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.11it/s, loss=331]


Epoch [200/5000]: Train loss: 325.7496, Valid loss: 334.3189


Epoch [201/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.96it/s, loss=360]


Epoch [201/5000]: Train loss: 327.1653, Valid loss: 343.3436


Epoch [202/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.66it/s, loss=315]


Epoch [202/5000]: Train loss: 324.2146, Valid loss: 336.9509


Epoch [203/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.61it/s, loss=317]


Epoch [203/5000]: Train loss: 324.0697, Valid loss: 332.3524


Epoch [204/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.33it/s, loss=242]


Epoch [204/5000]: Train loss: 319.3296, Valid loss: 317.4128


Epoch [205/5000]: 100%|██████████| 10/10 [00:00<00:00, 339.70it/s, loss=307]


Epoch [205/5000]: Train loss: 322.9341, Valid loss: 308.5706
Saving model with loss 308.571...


Epoch [206/5000]: 100%|██████████| 10/10 [00:00<00:00, 86.09it/s, loss=276]


Epoch [206/5000]: Train loss: 320.8286, Valid loss: 324.4318


Epoch [207/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.53it/s, loss=370]


Epoch [207/5000]: Train loss: 326.1645, Valid loss: 325.8155


Epoch [208/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.19it/s, loss=327]


Epoch [208/5000]: Train loss: 323.3044, Valid loss: 327.4812


Epoch [209/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.19it/s, loss=262]


Epoch [209/5000]: Train loss: 319.2233, Valid loss: 322.1879


Epoch [210/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.68it/s, loss=334]


Epoch [210/5000]: Train loss: 323.2105, Valid loss: 340.4641


Epoch [211/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=334]


Epoch [211/5000]: Train loss: 322.9174, Valid loss: 310.0868


Epoch [212/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.05it/s, loss=276]


Epoch [212/5000]: Train loss: 319.2092, Valid loss: 318.0659


Epoch [213/5000]: 100%|██████████| 10/10 [00:00<00:00, 308.08it/s, loss=333]


Epoch [213/5000]: Train loss: 322.3505, Valid loss: 313.0043


Epoch [214/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.79it/s, loss=313]


Epoch [214/5000]: Train loss: 320.8670, Valid loss: 320.9800


Epoch [215/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.01it/s, loss=283]


Epoch [215/5000]: Train loss: 318.8459, Valid loss: 332.7319


Epoch [216/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=251]


Epoch [216/5000]: Train loss: 316.6625, Valid loss: 323.6412


Epoch [217/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.02it/s, loss=335]


Epoch [217/5000]: Train loss: 321.3661, Valid loss: 322.9315


Epoch [218/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.44it/s, loss=271]


Epoch [218/5000]: Train loss: 317.3004, Valid loss: 329.0780


Epoch [219/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.04it/s, loss=374]


Epoch [219/5000]: Train loss: 323.1739, Valid loss: 323.0629


Epoch [220/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.36it/s, loss=314]


Epoch [220/5000]: Train loss: 319.3416, Valid loss: 323.8104


Epoch [221/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.04it/s, loss=317]


Epoch [221/5000]: Train loss: 319.2268, Valid loss: 315.6566


Epoch [222/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.05it/s, loss=342]


Epoch [222/5000]: Train loss: 320.4452, Valid loss: 352.3267


Epoch [223/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.71it/s, loss=331]


Epoch [223/5000]: Train loss: 319.5507, Valid loss: 311.3776


Epoch [224/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.46it/s, loss=338]


Epoch [224/5000]: Train loss: 319.7081, Valid loss: 322.2794


Epoch [225/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.60it/s, loss=308]


Epoch [225/5000]: Train loss: 317.6158, Valid loss: 327.5812


Epoch [226/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.55it/s, loss=347]


Epoch [226/5000]: Train loss: 319.7116, Valid loss: 325.5103


Epoch [227/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.69it/s, loss=353]


Epoch [227/5000]: Train loss: 319.7622, Valid loss: 327.1315


Epoch [228/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.29it/s, loss=317]


Epoch [228/5000]: Train loss: 317.4025, Valid loss: 327.7579


Epoch [229/5000]: 100%|██████████| 10/10 [00:00<00:00, 268.17it/s, loss=325]


Epoch [229/5000]: Train loss: 317.5616, Valid loss: 305.8078
Saving model with loss 305.808...


Epoch [230/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.03it/s, loss=302]


Epoch [230/5000]: Train loss: 315.9706, Valid loss: 306.9286


Epoch [231/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.04it/s, loss=334]


Epoch [231/5000]: Train loss: 317.5768, Valid loss: 310.6856


Epoch [232/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.13it/s, loss=240]


Epoch [232/5000]: Train loss: 311.7152, Valid loss: 320.1844


Epoch [233/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.63it/s, loss=320]


Epoch [233/5000]: Train loss: 316.2123, Valid loss: 321.5645


Epoch [234/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.65it/s, loss=280]


Epoch [234/5000]: Train loss: 313.6003, Valid loss: 322.8732


Epoch [235/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.36it/s, loss=371]


Epoch [235/5000]: Train loss: 318.7374, Valid loss: 317.3734


Epoch [236/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.10it/s, loss=319]


Epoch [236/5000]: Train loss: 315.3651, Valid loss: 316.7733


Epoch [237/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.59it/s, loss=303]


Epoch [237/5000]: Train loss: 314.1517, Valid loss: 315.9869


Epoch [238/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.04it/s, loss=342]


Epoch [238/5000]: Train loss: 316.1828, Valid loss: 303.3604
Saving model with loss 303.360...


Epoch [239/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.61it/s, loss=347]


Epoch [239/5000]: Train loss: 316.2294, Valid loss: 315.2527


Epoch [240/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.53it/s, loss=287]


Epoch [240/5000]: Train loss: 312.3854, Valid loss: 322.6056


Epoch [241/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.88it/s, loss=280]


Epoch [241/5000]: Train loss: 311.7375, Valid loss: 319.1156


Epoch [242/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.14it/s, loss=322]


Epoch [242/5000]: Train loss: 313.9485, Valid loss: 313.1327


Epoch [243/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.09it/s, loss=348]


Epoch [243/5000]: Train loss: 315.2218, Valid loss: 307.2828


Epoch [244/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.41it/s, loss=258]


Epoch [244/5000]: Train loss: 309.6664, Valid loss: 320.4822


Epoch [245/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.78it/s, loss=232]


Epoch [245/5000]: Train loss: 307.8215, Valid loss: 325.0313


Epoch [246/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.31it/s, loss=331]


Epoch [246/5000]: Train loss: 313.4293, Valid loss: 298.5134
Saving model with loss 298.513...


Epoch [247/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.63it/s, loss=319]


Epoch [247/5000]: Train loss: 312.4911, Valid loss: 315.4267


Epoch [248/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.75it/s, loss=332]


Epoch [248/5000]: Train loss: 312.9605, Valid loss: 310.8431


Epoch [249/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.15it/s, loss=262]


Epoch [249/5000]: Train loss: 308.5575, Valid loss: 309.8115


Epoch [250/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=333]


Epoch [250/5000]: Train loss: 312.5416, Valid loss: 294.7376
Saving model with loss 294.738...


Epoch [251/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.24it/s, loss=287]


Epoch [251/5000]: Train loss: 309.5327, Valid loss: 303.2691


Epoch [252/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.21it/s, loss=276]


Epoch [252/5000]: Train loss: 308.6388, Valid loss: 316.0752


Epoch [253/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.02it/s, loss=327]


Epoch [253/5000]: Train loss: 311.3652, Valid loss: 311.7107


Epoch [254/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.30it/s, loss=298]


Epoch [254/5000]: Train loss: 309.3895, Valid loss: 314.6273


Epoch [255/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.80it/s, loss=323]


Epoch [255/5000]: Train loss: 310.6374, Valid loss: 320.9039


Epoch [256/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.62it/s, loss=270]


Epoch [256/5000]: Train loss: 307.2346, Valid loss: 320.3586


Epoch [257/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.03it/s, loss=353]


Epoch [257/5000]: Train loss: 311.8831, Valid loss: 310.2250


Epoch [258/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.91it/s, loss=316]


Epoch [258/5000]: Train loss: 309.4359, Valid loss: 311.7204


Epoch [259/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.77it/s, loss=296]


Epoch [259/5000]: Train loss: 307.9927, Valid loss: 309.7307


Epoch [260/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.01it/s, loss=274]


Epoch [260/5000]: Train loss: 306.4295, Valid loss: 327.6260


Epoch [261/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.71it/s, loss=365]


Epoch [261/5000]: Train loss: 311.5619, Valid loss: 299.5292


Epoch [262/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.22it/s, loss=228]


Epoch [262/5000]: Train loss: 303.1439, Valid loss: 310.5898


Epoch [263/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.52it/s, loss=302]


Epoch [263/5000]: Train loss: 307.3030, Valid loss: 319.1591


Epoch [264/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.19it/s, loss=280]


Epoch [264/5000]: Train loss: 305.7488, Valid loss: 308.4283


Epoch [265/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=319]


Epoch [265/5000]: Train loss: 307.8151, Valid loss: 301.8065


Epoch [266/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.57it/s, loss=335]


Epoch [266/5000]: Train loss: 308.4638, Valid loss: 305.9397


Epoch [267/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.16it/s, loss=316]


Epoch [267/5000]: Train loss: 307.1319, Valid loss: 315.8419


Epoch [268/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.23it/s, loss=333]


Epoch [268/5000]: Train loss: 307.8322, Valid loss: 300.5474


Epoch [269/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.91it/s, loss=305]


Epoch [269/5000]: Train loss: 305.9504, Valid loss: 307.9128


Epoch [270/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.77it/s, loss=330]


Epoch [270/5000]: Train loss: 307.1561, Valid loss: 324.6633


Epoch [271/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.18it/s, loss=323]


Epoch [271/5000]: Train loss: 306.5095, Valid loss: 310.9797


Epoch [272/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.47it/s, loss=337]


Epoch [272/5000]: Train loss: 307.0696, Valid loss: 320.9130


Epoch [273/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.55it/s, loss=358]


Epoch [273/5000]: Train loss: 308.0520, Valid loss: 316.9168


Epoch [274/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.90it/s, loss=310]


Epoch [274/5000]: Train loss: 304.9461, Valid loss: 314.2672


Epoch [275/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.68it/s, loss=320]


Epoch [275/5000]: Train loss: 305.2645, Valid loss: 305.7164


Epoch [276/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.21it/s, loss=304]


Epoch [276/5000]: Train loss: 304.0802, Valid loss: 305.8494


Epoch [277/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.73it/s, loss=298]


Epoch [277/5000]: Train loss: 303.4369, Valid loss: 294.0483
Saving model with loss 294.048...


Epoch [278/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.05it/s, loss=359]


Epoch [278/5000]: Train loss: 306.8397, Valid loss: 296.4722


Epoch [279/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.46it/s, loss=383]


Epoch [279/5000]: Train loss: 308.0021, Valid loss: 305.4527


Epoch [280/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.09it/s, loss=294]


Epoch [280/5000]: Train loss: 302.4716, Valid loss: 284.5960
Saving model with loss 284.596...


Epoch [281/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.87it/s, loss=343]


Epoch [281/5000]: Train loss: 305.0787, Valid loss: 303.9294


Epoch [282/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.10it/s, loss=375]


Epoch [282/5000]: Train loss: 306.7694, Valid loss: 288.8377


Epoch [283/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.91it/s, loss=346]


Epoch [283/5000]: Train loss: 304.7571, Valid loss: 315.8541


Epoch [284/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.41it/s, loss=317]


Epoch [284/5000]: Train loss: 302.7740, Valid loss: 303.8396


Epoch [285/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.71it/s, loss=280]


Epoch [285/5000]: Train loss: 300.3493, Valid loss: 301.6301


Epoch [286/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.87it/s, loss=343]


Epoch [286/5000]: Train loss: 303.8152, Valid loss: 299.3451


Epoch [287/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.55it/s, loss=311]


Epoch [287/5000]: Train loss: 301.6747, Valid loss: 305.0469


Epoch [288/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.25it/s, loss=250]


Epoch [288/5000]: Train loss: 297.7809, Valid loss: 296.2563


Epoch [289/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.48it/s, loss=296]


Epoch [289/5000]: Train loss: 300.2569, Valid loss: 298.2811


Epoch [290/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.20it/s, loss=327]


Epoch [290/5000]: Train loss: 301.8493, Valid loss: 295.5561


Epoch [291/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.44it/s, loss=313]


Epoch [291/5000]: Train loss: 300.7507, Valid loss: 302.1255


Epoch [292/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.14it/s, loss=315]


Epoch [292/5000]: Train loss: 300.6534, Valid loss: 306.6256


Epoch [293/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.95it/s, loss=263]


Epoch [293/5000]: Train loss: 297.3001, Valid loss: 319.9355


Epoch [294/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.73it/s, loss=252]


Epoch [294/5000]: Train loss: 296.3699, Valid loss: 294.9963


Epoch [295/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.28it/s, loss=265]


Epoch [295/5000]: Train loss: 296.9284, Valid loss: 314.9796


Epoch [296/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.51it/s, loss=362]


Epoch [296/5000]: Train loss: 302.4099, Valid loss: 301.7891


Epoch [297/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.85it/s, loss=311]


Epoch [297/5000]: Train loss: 299.1635, Valid loss: 299.2718


Epoch [298/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.37it/s, loss=324]


Epoch [298/5000]: Train loss: 299.6761, Valid loss: 294.5987


Epoch [299/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.61it/s, loss=345]


Epoch [299/5000]: Train loss: 300.6816, Valid loss: 301.1112


Epoch [300/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.21it/s, loss=279]


Epoch [300/5000]: Train loss: 296.5031, Valid loss: 307.0847


Epoch [301/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.83it/s, loss=352]


Epoch [301/5000]: Train loss: 300.5612, Valid loss: 291.6136


Epoch [302/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.08it/s, loss=327]


Epoch [302/5000]: Train loss: 298.8112, Valid loss: 300.0263


Epoch [303/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.06it/s, loss=227]


Epoch [303/5000]: Train loss: 292.6409, Valid loss: 295.6272


Epoch [304/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.19it/s, loss=254]


Epoch [304/5000]: Train loss: 294.0140, Valid loss: 306.9301


Epoch [305/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.59it/s, loss=279]


Epoch [305/5000]: Train loss: 295.2283, Valid loss: 305.8347


Epoch [306/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.62it/s, loss=264]


Epoch [306/5000]: Train loss: 294.0710, Valid loss: 316.5207


Epoch [307/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.13it/s, loss=287]


Epoch [307/5000]: Train loss: 295.2139, Valid loss: 298.2373


Epoch [308/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.62it/s, loss=276]


Epoch [308/5000]: Train loss: 294.2992, Valid loss: 309.9677


Epoch [309/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.55it/s, loss=278]


Epoch [309/5000]: Train loss: 294.1574, Valid loss: 295.2441


Epoch [310/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.63it/s, loss=279]


Epoch [310/5000]: Train loss: 292.8770, Valid loss: 289.1947


Epoch [311/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.13it/s, loss=292]


Epoch [311/5000]: Train loss: 277.0659, Valid loss: 239.9378
Saving model with loss 239.938...


Epoch [312/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.28it/s, loss=102]


Epoch [312/5000]: Train loss: 182.5812, Valid loss: 88.3758
Saving model with loss 88.376...


Epoch [313/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.45it/s, loss=33.7]


Epoch [313/5000]: Train loss: 38.6676, Valid loss: 38.3030
Saving model with loss 38.303...


Epoch [314/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.66it/s, loss=17.4]


Epoch [314/5000]: Train loss: 30.4800, Valid loss: 17.2119
Saving model with loss 17.212...


Epoch [315/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.67it/s, loss=21.2]


Epoch [315/5000]: Train loss: 19.9576, Valid loss: 21.4435


Epoch [316/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.71it/s, loss=19.6]


Epoch [316/5000]: Train loss: 18.2491, Valid loss: 16.7151
Saving model with loss 16.715...


Epoch [317/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=15.9]


Epoch [317/5000]: Train loss: 17.2788, Valid loss: 16.6117
Saving model with loss 16.612...


Epoch [318/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.03it/s, loss=15.9]


Epoch [318/5000]: Train loss: 16.6741, Valid loss: 16.6918


Epoch [319/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=14.3]


Epoch [319/5000]: Train loss: 16.4465, Valid loss: 16.6968


Epoch [320/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.09it/s, loss=15.4]


Epoch [320/5000]: Train loss: 16.3021, Valid loss: 15.7380
Saving model with loss 15.738...


Epoch [321/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.83it/s, loss=11.7]


Epoch [321/5000]: Train loss: 15.9305, Valid loss: 15.8413


Epoch [322/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.77it/s, loss=14.9]


Epoch [322/5000]: Train loss: 15.9663, Valid loss: 16.0177


Epoch [323/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.71it/s, loss=14.4]


Epoch [323/5000]: Train loss: 15.7775, Valid loss: 15.7713


Epoch [324/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.17it/s, loss=14.8]


Epoch [324/5000]: Train loss: 15.6527, Valid loss: 15.5182
Saving model with loss 15.518...


Epoch [325/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.38it/s, loss=15.5]


Epoch [325/5000]: Train loss: 15.5537, Valid loss: 15.2677
Saving model with loss 15.268...


Epoch [326/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.46it/s, loss=15.7]


Epoch [326/5000]: Train loss: 15.4044, Valid loss: 15.2143
Saving model with loss 15.214...


Epoch [327/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.49it/s, loss=17.8]


Epoch [327/5000]: Train loss: 15.3871, Valid loss: 14.4604
Saving model with loss 14.460...


Epoch [328/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.42it/s, loss=15.7]


Epoch [328/5000]: Train loss: 15.1093, Valid loss: 15.2861


Epoch [329/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.08it/s, loss=18]


Epoch [329/5000]: Train loss: 15.0894, Valid loss: 15.0778


Epoch [330/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.56it/s, loss=15.4]


Epoch [330/5000]: Train loss: 14.7894, Valid loss: 14.9235


Epoch [331/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.30it/s, loss=14.7]


Epoch [331/5000]: Train loss: 14.5916, Valid loss: 14.7112


Epoch [332/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.73it/s, loss=11.8]


Epoch [332/5000]: Train loss: 14.2732, Valid loss: 13.9879
Saving model with loss 13.988...


Epoch [333/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.36it/s, loss=12.9]


Epoch [333/5000]: Train loss: 14.1875, Valid loss: 13.5220
Saving model with loss 13.522...


Epoch [334/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.33it/s, loss=14.8]


Epoch [334/5000]: Train loss: 14.1330, Valid loss: 13.7263


Epoch [335/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.40it/s, loss=13]


Epoch [335/5000]: Train loss: 13.8013, Valid loss: 13.5241


Epoch [336/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.33it/s, loss=14.2]


Epoch [336/5000]: Train loss: 13.5711, Valid loss: 13.5742


Epoch [337/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.71it/s, loss=13.2]


Epoch [337/5000]: Train loss: 13.1476, Valid loss: 12.9402
Saving model with loss 12.940...


Epoch [338/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.94it/s, loss=12.5]


Epoch [338/5000]: Train loss: 12.7245, Valid loss: 12.4726
Saving model with loss 12.473...


Epoch [339/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.88it/s, loss=13.6]


Epoch [339/5000]: Train loss: 12.3504, Valid loss: 11.5957
Saving model with loss 11.596...


Epoch [340/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.04it/s, loss=14.3]


Epoch [340/5000]: Train loss: 11.8939, Valid loss: 10.8841
Saving model with loss 10.884...


Epoch [341/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.47it/s, loss=9.04]


Epoch [341/5000]: Train loss: 11.1100, Valid loss: 11.1812


Epoch [342/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.27it/s, loss=10.8]


Epoch [342/5000]: Train loss: 10.6866, Valid loss: 10.2825
Saving model with loss 10.283...


Epoch [343/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.72it/s, loss=8.16]


Epoch [343/5000]: Train loss: 9.9857, Valid loss: 9.6564
Saving model with loss 9.656...


Epoch [344/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.61it/s, loss=8.33]


Epoch [344/5000]: Train loss: 9.4508, Valid loss: 9.1042
Saving model with loss 9.104...


Epoch [345/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.28it/s, loss=9.94]


Epoch [345/5000]: Train loss: 9.0401, Valid loss: 8.7225
Saving model with loss 8.722...


Epoch [346/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.32it/s, loss=7.84]


Epoch [346/5000]: Train loss: 8.3842, Valid loss: 8.1610
Saving model with loss 8.161...


Epoch [347/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.56it/s, loss=7.9]


Epoch [347/5000]: Train loss: 7.8332, Valid loss: 7.6274
Saving model with loss 7.627...


Epoch [348/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.44it/s, loss=7.1]


Epoch [348/5000]: Train loss: 7.2596, Valid loss: 7.0602
Saving model with loss 7.060...


Epoch [349/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.69it/s, loss=8.16]


Epoch [349/5000]: Train loss: 6.8024, Valid loss: 6.5465
Saving model with loss 6.547...


Epoch [350/5000]: 100%|██████████| 10/10 [00:00<00:00, 225.26it/s, loss=6.52]


Epoch [350/5000]: Train loss: 6.2145, Valid loss: 5.9752
Saving model with loss 5.975...


Epoch [351/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.27it/s, loss=6.55]


Epoch [351/5000]: Train loss: 5.7356, Valid loss: 5.5057
Saving model with loss 5.506...


Epoch [352/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.98it/s, loss=5.29]


Epoch [352/5000]: Train loss: 5.2047, Valid loss: 4.9505
Saving model with loss 4.951...


Epoch [353/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.06it/s, loss=4.15]


Epoch [353/5000]: Train loss: 4.7216, Valid loss: 4.2835
Saving model with loss 4.283...


Epoch [354/5000]: 100%|██████████| 10/10 [00:00<00:00, 235.26it/s, loss=4.43]


Epoch [354/5000]: Train loss: 4.3598, Valid loss: 4.3328


Epoch [355/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.11it/s, loss=4.27]


Epoch [355/5000]: Train loss: 4.0153, Valid loss: 4.1028
Saving model with loss 4.103...


Epoch [356/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.57it/s, loss=3.36]


Epoch [356/5000]: Train loss: 3.6693, Valid loss: 3.5938
Saving model with loss 3.594...


Epoch [357/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.34it/s, loss=3.31]


Epoch [357/5000]: Train loss: 3.4126, Valid loss: 3.4099
Saving model with loss 3.410...


Epoch [358/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.80it/s, loss=3.33]


Epoch [358/5000]: Train loss: 3.2011, Valid loss: 3.1698
Saving model with loss 3.170...


Epoch [359/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.39it/s, loss=3.42]


Epoch [359/5000]: Train loss: 3.0393, Valid loss: 3.1787


Epoch [360/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.77it/s, loss=3.01]


Epoch [360/5000]: Train loss: 2.8635, Valid loss: 3.0370
Saving model with loss 3.037...


Epoch [361/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.08it/s, loss=2.77]


Epoch [361/5000]: Train loss: 2.7397, Valid loss: 2.8428
Saving model with loss 2.843...


Epoch [362/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.66it/s, loss=2.62]


Epoch [362/5000]: Train loss: 2.6366, Valid loss: 2.7608
Saving model with loss 2.761...


Epoch [363/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.10it/s, loss=2.75]


Epoch [363/5000]: Train loss: 2.5711, Valid loss: 2.7866


Epoch [364/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.09it/s, loss=2.27]


Epoch [364/5000]: Train loss: 2.4928, Valid loss: 2.6608
Saving model with loss 2.661...


Epoch [365/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.98it/s, loss=2.23]


Epoch [365/5000]: Train loss: 2.4240, Valid loss: 2.6756


Epoch [366/5000]: 100%|██████████| 10/10 [00:00<00:00, 214.49it/s, loss=2.68]


Epoch [366/5000]: Train loss: 2.4029, Valid loss: 2.4224
Saving model with loss 2.422...


Epoch [367/5000]: 100%|██████████| 10/10 [00:00<00:00, 192.97it/s, loss=1.85]


Epoch [367/5000]: Train loss: 2.3184, Valid loss: 2.5763


Epoch [368/5000]: 100%|██████████| 10/10 [00:00<00:00, 183.11it/s, loss=2.7]


Epoch [368/5000]: Train loss: 2.3412, Valid loss: 2.4643


Epoch [369/5000]: 100%|██████████| 10/10 [00:00<00:00, 196.42it/s, loss=2.73]


Epoch [369/5000]: Train loss: 2.3183, Valid loss: 2.5105


Epoch [370/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.96it/s, loss=2.02]


Epoch [370/5000]: Train loss: 2.2481, Valid loss: 2.2726
Saving model with loss 2.273...


Epoch [371/5000]: 100%|██████████| 10/10 [00:00<00:00, 208.54it/s, loss=2.41]


Epoch [371/5000]: Train loss: 2.2403, Valid loss: 2.4181


Epoch [372/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.48it/s, loss=1.84]


Epoch [372/5000]: Train loss: 2.1861, Valid loss: 2.5049


Epoch [373/5000]: 100%|██████████| 10/10 [00:00<00:00, 209.89it/s, loss=1.86]


Epoch [373/5000]: Train loss: 2.1641, Valid loss: 2.3236


Epoch [374/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.88it/s, loss=1.98]


Epoch [374/5000]: Train loss: 2.1496, Valid loss: 2.5340


Epoch [375/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.83it/s, loss=1.86]


Epoch [375/5000]: Train loss: 2.1235, Valid loss: 2.3995


Epoch [376/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.39it/s, loss=2.34]


Epoch [376/5000]: Train loss: 2.1197, Valid loss: 2.3528


Epoch [377/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.16it/s, loss=2.27]


Epoch [377/5000]: Train loss: 2.0942, Valid loss: 2.3641


Epoch [378/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.21it/s, loss=2.13]


Epoch [378/5000]: Train loss: 2.0673, Valid loss: 2.1710
Saving model with loss 2.171...


Epoch [379/5000]: 100%|██████████| 10/10 [00:00<00:00, 64.09it/s, loss=1.65]


Epoch [379/5000]: Train loss: 2.0188, Valid loss: 2.2637


Epoch [380/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.28it/s, loss=2.04]


Epoch [380/5000]: Train loss: 2.0343, Valid loss: 2.2288


Epoch [381/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.53it/s, loss=1.91]


Epoch [381/5000]: Train loss: 2.0072, Valid loss: 2.4093


Epoch [382/5000]: 100%|██████████| 10/10 [00:00<00:00, 204.77it/s, loss=1.99]


Epoch [382/5000]: Train loss: 1.9944, Valid loss: 2.1461
Saving model with loss 2.146...


Epoch [383/5000]: 100%|██████████| 10/10 [00:00<00:00, 235.78it/s, loss=2.21]


Epoch [383/5000]: Train loss: 1.9911, Valid loss: 2.2399


Epoch [384/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=1.82]


Epoch [384/5000]: Train loss: 1.9567, Valid loss: 2.1991


Epoch [385/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.03it/s, loss=1.83]


Epoch [385/5000]: Train loss: 1.9536, Valid loss: 2.1404
Saving model with loss 2.140...


Epoch [386/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.12it/s, loss=2.05]


Epoch [386/5000]: Train loss: 1.9346, Valid loss: 2.0298
Saving model with loss 2.030...


Epoch [387/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.49it/s, loss=1.95]


Epoch [387/5000]: Train loss: 1.9098, Valid loss: 2.1450


Epoch [388/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.84it/s, loss=1.95]


Epoch [388/5000]: Train loss: 1.8947, Valid loss: 1.9888
Saving model with loss 1.989...


Epoch [389/5000]: 100%|██████████| 10/10 [00:00<00:00, 206.83it/s, loss=1.55]


Epoch [389/5000]: Train loss: 1.8623, Valid loss: 2.0158


Epoch [390/5000]: 100%|██████████| 10/10 [00:00<00:00, 204.73it/s, loss=1.65]


Epoch [390/5000]: Train loss: 1.8512, Valid loss: 2.0621


Epoch [391/5000]: 100%|██████████| 10/10 [00:00<00:00, 210.97it/s, loss=2]


Epoch [391/5000]: Train loss: 1.8607, Valid loss: 1.9849
Saving model with loss 1.985...


Epoch [392/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.97it/s, loss=2.26]


Epoch [392/5000]: Train loss: 1.8603, Valid loss: 2.0290


Epoch [393/5000]: 100%|██████████| 10/10 [00:00<00:00, 225.17it/s, loss=1.66]


Epoch [393/5000]: Train loss: 1.8263, Valid loss: 2.0372


Epoch [394/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.27it/s, loss=1.84]


Epoch [394/5000]: Train loss: 1.8283, Valid loss: 1.9395
Saving model with loss 1.940...


Epoch [395/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.21it/s, loss=2]


Epoch [395/5000]: Train loss: 1.8169, Valid loss: 2.0047


Epoch [396/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.41it/s, loss=2.06]


Epoch [396/5000]: Train loss: 1.8082, Valid loss: 2.0932


Epoch [397/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.56it/s, loss=1.76]


Epoch [397/5000]: Train loss: 1.7726, Valid loss: 1.9219
Saving model with loss 1.922...


Epoch [398/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.89it/s, loss=1.8]


Epoch [398/5000]: Train loss: 1.7695, Valid loss: 1.8650
Saving model with loss 1.865...


Epoch [399/5000]: 100%|██████████| 10/10 [00:00<00:00, 209.99it/s, loss=2]


Epoch [399/5000]: Train loss: 1.7672, Valid loss: 1.9245


Epoch [400/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.79it/s, loss=1.94]


Epoch [400/5000]: Train loss: 1.7561, Valid loss: 1.8272
Saving model with loss 1.827...


Epoch [401/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.04it/s, loss=1.83]


Epoch [401/5000]: Train loss: 1.7571, Valid loss: 1.8828


Epoch [402/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.25it/s, loss=1.78]


Epoch [402/5000]: Train loss: 1.7306, Valid loss: 1.8882


Epoch [403/5000]: 100%|██████████| 10/10 [00:00<00:00, 217.80it/s, loss=1.87]


Epoch [403/5000]: Train loss: 1.7243, Valid loss: 1.8477


Epoch [404/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.04it/s, loss=1.5]


Epoch [404/5000]: Train loss: 1.6898, Valid loss: 1.7721
Saving model with loss 1.772...


Epoch [405/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.90it/s, loss=2]


Epoch [405/5000]: Train loss: 1.7101, Valid loss: 1.8896


Epoch [406/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.74it/s, loss=1.79]


Epoch [406/5000]: Train loss: 1.6913, Valid loss: 1.9171


Epoch [407/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.94it/s, loss=1.86]


Epoch [407/5000]: Train loss: 1.6858, Valid loss: 1.8529


Epoch [408/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.32it/s, loss=1.27]


Epoch [408/5000]: Train loss: 1.6564, Valid loss: 1.9166


Epoch [409/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.04it/s, loss=1.47]


Epoch [409/5000]: Train loss: 1.6481, Valid loss: 1.8771


Epoch [410/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.54it/s, loss=2.01]


Epoch [410/5000]: Train loss: 1.6728, Valid loss: 1.8428


Epoch [411/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=2.07]


Epoch [411/5000]: Train loss: 1.6776, Valid loss: 1.9310


Epoch [412/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.98it/s, loss=1.58]


Epoch [412/5000]: Train loss: 1.6298, Valid loss: 1.7559
Saving model with loss 1.756...


Epoch [413/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.90it/s, loss=1.91]


Epoch [413/5000]: Train loss: 1.6448, Valid loss: 1.7353
Saving model with loss 1.735...


Epoch [414/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.74it/s, loss=2.05]


Epoch [414/5000]: Train loss: 1.6416, Valid loss: 1.8435


Epoch [415/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.89it/s, loss=2.11]


Epoch [415/5000]: Train loss: 1.6388, Valid loss: 1.8382


Epoch [416/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.14it/s, loss=1.66]


Epoch [416/5000]: Train loss: 1.6024, Valid loss: 1.8415


Epoch [417/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.93it/s, loss=1.6]


Epoch [417/5000]: Train loss: 1.5937, Valid loss: 1.6510
Saving model with loss 1.651...


Epoch [418/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.85it/s, loss=1.89]


Epoch [418/5000]: Train loss: 1.6050, Valid loss: 1.6853


Epoch [419/5000]: 100%|██████████| 10/10 [00:00<00:00, 212.88it/s, loss=1.72]


Epoch [419/5000]: Train loss: 1.5966, Valid loss: 1.6729


Epoch [420/5000]: 100%|██████████| 10/10 [00:00<00:00, 209.08it/s, loss=1.47]


Epoch [420/5000]: Train loss: 1.5682, Valid loss: 1.7481


Epoch [421/5000]: 100%|██████████| 10/10 [00:00<00:00, 214.40it/s, loss=1.36]


Epoch [421/5000]: Train loss: 1.5529, Valid loss: 1.7626


Epoch [422/5000]: 100%|██████████| 10/10 [00:00<00:00, 200.96it/s, loss=1.19]


Epoch [422/5000]: Train loss: 1.5454, Valid loss: 1.6791


Epoch [423/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.93it/s, loss=1.5]


Epoch [423/5000]: Train loss: 1.5626, Valid loss: 1.6353
Saving model with loss 1.635...


Epoch [424/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.40it/s, loss=1.46]


Epoch [424/5000]: Train loss: 1.5449, Valid loss: 1.7554


Epoch [425/5000]: 100%|██████████| 10/10 [00:00<00:00, 239.99it/s, loss=1.4]


Epoch [425/5000]: Train loss: 1.5320, Valid loss: 1.7260


Epoch [426/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.73it/s, loss=1.77]


Epoch [426/5000]: Train loss: 1.5478, Valid loss: 1.6192
Saving model with loss 1.619...


Epoch [427/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.66it/s, loss=1.48]


Epoch [427/5000]: Train loss: 1.5429, Valid loss: 1.5923
Saving model with loss 1.592...


Epoch [428/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.22it/s, loss=1.52]


Epoch [428/5000]: Train loss: 1.5190, Valid loss: 1.6271


Epoch [429/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.55it/s, loss=1.49]


Epoch [429/5000]: Train loss: 1.5260, Valid loss: 1.6008


Epoch [430/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.23it/s, loss=1.21]


Epoch [430/5000]: Train loss: 1.5008, Valid loss: 1.5259
Saving model with loss 1.526...


Epoch [431/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.61it/s, loss=1.29]


Epoch [431/5000]: Train loss: 1.4977, Valid loss: 1.6206


Epoch [432/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.43it/s, loss=1.48]


Epoch [432/5000]: Train loss: 1.5184, Valid loss: 1.5845


Epoch [433/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.58it/s, loss=1.54]


Epoch [433/5000]: Train loss: 1.5057, Valid loss: 1.6721


Epoch [434/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.30it/s, loss=1.38]


Epoch [434/5000]: Train loss: 1.4820, Valid loss: 1.6200


Epoch [435/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.79it/s, loss=1.41]


Epoch [435/5000]: Train loss: 1.4828, Valid loss: 1.6225


Epoch [436/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.43it/s, loss=1.43]


Epoch [436/5000]: Train loss: 1.4873, Valid loss: 1.6408


Epoch [437/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.77it/s, loss=1.73]


Epoch [437/5000]: Train loss: 1.4908, Valid loss: 1.6004


Epoch [438/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.96it/s, loss=1.22]


Epoch [438/5000]: Train loss: 1.4562, Valid loss: 1.4887
Saving model with loss 1.489...


Epoch [439/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.93it/s, loss=1.5]


Epoch [439/5000]: Train loss: 1.4690, Valid loss: 1.5159


Epoch [440/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.30it/s, loss=1.4]


Epoch [440/5000]: Train loss: 1.4577, Valid loss: 1.5513


Epoch [441/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.18it/s, loss=1.41]


Epoch [441/5000]: Train loss: 1.4551, Valid loss: 1.5102


Epoch [442/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.65it/s, loss=1.28]


Epoch [442/5000]: Train loss: 1.4436, Valid loss: 1.4933


Epoch [443/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.15it/s, loss=1.32]


Epoch [443/5000]: Train loss: 1.4412, Valid loss: 1.5955


Epoch [444/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.58it/s, loss=1.62]

Epoch [444/5000]: Train loss: 1.4545, Valid loss: 1.6233



Epoch [445/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.38it/s, loss=1.46]


Epoch [445/5000]: Train loss: 1.4439, Valid loss: 1.5774


Epoch [446/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.94it/s, loss=1.69]


Epoch [446/5000]: Train loss: 1.4524, Valid loss: 1.4772
Saving model with loss 1.477...


Epoch [447/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.30it/s, loss=1.17]


Epoch [447/5000]: Train loss: 1.4177, Valid loss: 1.5330


Epoch [448/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.82it/s, loss=1.74]


Epoch [448/5000]: Train loss: 1.4515, Valid loss: 1.5267


Epoch [449/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.21it/s, loss=1.6]


Epoch [449/5000]: Train loss: 1.4338, Valid loss: 1.5073


Epoch [450/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.49it/s, loss=1.8]


Epoch [450/5000]: Train loss: 1.4495, Valid loss: 1.5850


Epoch [451/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.64it/s, loss=1.19]


Epoch [451/5000]: Train loss: 1.4190, Valid loss: 1.5178


Epoch [452/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.64it/s, loss=1.8]


Epoch [452/5000]: Train loss: 1.4529, Valid loss: 1.4988


Epoch [453/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.32it/s, loss=1.15]


Epoch [453/5000]: Train loss: 1.4258, Valid loss: 1.6344


Epoch [454/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.69it/s, loss=1.15]


Epoch [454/5000]: Train loss: 1.4013, Valid loss: 1.5358


Epoch [455/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.76it/s, loss=1.68]


Epoch [455/5000]: Train loss: 1.4195, Valid loss: 1.4958


Epoch [456/5000]: 100%|██████████| 10/10 [00:00<00:00, 217.28it/s, loss=1.27]


Epoch [456/5000]: Train loss: 1.3905, Valid loss: 1.5008


Epoch [457/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.74it/s, loss=1.48]


Epoch [457/5000]: Train loss: 1.3997, Valid loss: 1.4660
Saving model with loss 1.466...


Epoch [458/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.04it/s, loss=1.42]


Epoch [458/5000]: Train loss: 1.3920, Valid loss: 1.5582


Epoch [459/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.74it/s, loss=1.76]


Epoch [459/5000]: Train loss: 1.4103, Valid loss: 1.4452
Saving model with loss 1.445...


Epoch [460/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.52it/s, loss=1.44]


Epoch [460/5000]: Train loss: 1.3910, Valid loss: 1.5181


Epoch [461/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.02it/s, loss=1.24]


Epoch [461/5000]: Train loss: 1.4019, Valid loss: 1.5576


Epoch [462/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.33it/s, loss=1.72]


Epoch [462/5000]: Train loss: 1.4111, Valid loss: 1.5106


Epoch [463/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.22it/s, loss=1.15]


Epoch [463/5000]: Train loss: 1.3878, Valid loss: 1.5638


Epoch [464/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.35it/s, loss=1.56]


Epoch [464/5000]: Train loss: 1.3890, Valid loss: 1.4294
Saving model with loss 1.429...


Epoch [465/5000]: 100%|██████████| 10/10 [00:00<00:00, 62.92it/s, loss=1.44]


Epoch [465/5000]: Train loss: 1.3756, Valid loss: 1.4723


Epoch [466/5000]: 100%|██████████| 10/10 [00:00<00:00, 245.95it/s, loss=1.2]


Epoch [466/5000]: Train loss: 1.3627, Valid loss: 1.4173
Saving model with loss 1.417...


Epoch [467/5000]: 100%|██████████| 10/10 [00:00<00:00, 223.19it/s, loss=1.25]


Epoch [467/5000]: Train loss: 1.3526, Valid loss: 1.4895


Epoch [468/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.46it/s, loss=1.46]


Epoch [468/5000]: Train loss: 1.3903, Valid loss: 1.5226


Epoch [469/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.91it/s, loss=1.43]


Epoch [469/5000]: Train loss: 1.3682, Valid loss: 1.3877
Saving model with loss 1.388...


Epoch [470/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.68it/s, loss=1.23]


Epoch [470/5000]: Train loss: 1.3536, Valid loss: 1.4630


Epoch [471/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.09it/s, loss=1.41]


Epoch [471/5000]: Train loss: 1.3747, Valid loss: 1.4226


Epoch [472/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.11it/s, loss=1.32]


Epoch [472/5000]: Train loss: 1.3458, Valid loss: 1.3465
Saving model with loss 1.347...


Epoch [473/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.06it/s, loss=1.23]


Epoch [473/5000]: Train loss: 1.3362, Valid loss: 1.4130


Epoch [474/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.93it/s, loss=1.35]


Epoch [474/5000]: Train loss: 1.3440, Valid loss: 1.4392


Epoch [475/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.97it/s, loss=1.23]


Epoch [475/5000]: Train loss: 1.3433, Valid loss: 1.3861


Epoch [476/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.29it/s, loss=1.68]

Epoch [476/5000]: Train loss: 1.3570, Valid loss: 1.4248



Epoch [477/5000]: 100%|██████████| 10/10 [00:00<00:00, 213.36it/s, loss=1.65]


Epoch [477/5000]: Train loss: 1.3505, Valid loss: 1.3846


Epoch [478/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.13it/s, loss=1.58]


Epoch [478/5000]: Train loss: 1.3458, Valid loss: 1.3940


Epoch [479/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=1.49]


Epoch [479/5000]: Train loss: 1.3426, Valid loss: 1.3567


Epoch [480/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.46it/s, loss=1.31]


Epoch [480/5000]: Train loss: 1.3235, Valid loss: 1.3553


Epoch [481/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.11it/s, loss=1.18]


Epoch [481/5000]: Train loss: 1.3131, Valid loss: 1.4671


Epoch [482/5000]: 100%|██████████| 10/10 [00:00<00:00, 200.08it/s, loss=1.27]

Epoch [482/5000]: Train loss: 1.3257, Valid loss: 1.3927



Epoch [483/5000]: 100%|██████████| 10/10 [00:00<00:00, 200.83it/s, loss=1.17]


Epoch [483/5000]: Train loss: 1.3102, Valid loss: 1.3603


Epoch [484/5000]: 100%|██████████| 10/10 [00:00<00:00, 210.03it/s, loss=1.22]


Epoch [484/5000]: Train loss: 1.3081, Valid loss: 1.3701


Epoch [485/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.12it/s, loss=1.15]


Epoch [485/5000]: Train loss: 1.3028, Valid loss: 1.4811


Epoch [486/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.48it/s, loss=1.1]


Epoch [486/5000]: Train loss: 1.2980, Valid loss: 1.3148
Saving model with loss 1.315...


Epoch [487/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.20it/s, loss=1.09]


Epoch [487/5000]: Train loss: 1.2960, Valid loss: 1.3448


Epoch [488/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.20it/s, loss=1.23]


Epoch [488/5000]: Train loss: 1.3071, Valid loss: 1.3913


Epoch [489/5000]: 100%|██████████| 10/10 [00:00<00:00, 213.50it/s, loss=1.25]


Epoch [489/5000]: Train loss: 1.3052, Valid loss: 1.3049
Saving model with loss 1.305...


Epoch [490/5000]: 100%|██████████| 10/10 [00:00<00:00, 190.48it/s, loss=1.54]


Epoch [490/5000]: Train loss: 1.3205, Valid loss: 1.4145


Epoch [491/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.95it/s, loss=1.25]


Epoch [491/5000]: Train loss: 1.2993, Valid loss: 1.4578


Epoch [492/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.54it/s, loss=1.21]


Epoch [492/5000]: Train loss: 1.2899, Valid loss: 1.3383


Epoch [493/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.82it/s, loss=1.64]


Epoch [493/5000]: Train loss: 1.3293, Valid loss: 1.4531


Epoch [494/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.04it/s, loss=1.65]


Epoch [494/5000]: Train loss: 1.3229, Valid loss: 1.3030
Saving model with loss 1.303...


Epoch [495/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.98it/s, loss=1.5]


Epoch [495/5000]: Train loss: 1.3037, Valid loss: 1.4788


Epoch [496/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.47it/s, loss=1.89]


Epoch [496/5000]: Train loss: 1.3248, Valid loss: 1.3048


Epoch [497/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.66it/s, loss=1.32]


Epoch [497/5000]: Train loss: 1.2937, Valid loss: 1.3191


Epoch [498/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.06it/s, loss=1.37]


Epoch [498/5000]: Train loss: 1.2941, Valid loss: 1.3676


Epoch [499/5000]: 100%|██████████| 10/10 [00:00<00:00, 247.91it/s, loss=1.16]


Epoch [499/5000]: Train loss: 1.2768, Valid loss: 1.2830
Saving model with loss 1.283...


Epoch [500/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.33it/s, loss=1.4]


Epoch [500/5000]: Train loss: 1.2963, Valid loss: 1.3030


Epoch [501/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.90it/s, loss=1.23]


Epoch [501/5000]: Train loss: 1.2726, Valid loss: 1.3205


Epoch [502/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.96it/s, loss=1.3]


Epoch [502/5000]: Train loss: 1.2760, Valid loss: 1.3278


Epoch [503/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.12it/s, loss=1.29]


Epoch [503/5000]: Train loss: 1.2749, Valid loss: 1.4106


Epoch [504/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.59it/s, loss=1.13]


Epoch [504/5000]: Train loss: 1.2624, Valid loss: 1.2668
Saving model with loss 1.267...


Epoch [505/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.68it/s, loss=1.24]


Epoch [505/5000]: Train loss: 1.2703, Valid loss: 1.3675


Epoch [506/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.75it/s, loss=1.48]


Epoch [506/5000]: Train loss: 1.2824, Valid loss: 1.3451


Epoch [507/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.39it/s, loss=1.19]


Epoch [507/5000]: Train loss: 1.2612, Valid loss: 1.3399


Epoch [508/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.94it/s, loss=1.76]


Epoch [508/5000]: Train loss: 1.3030, Valid loss: 1.3524


Epoch [509/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.01it/s, loss=1.08]


Epoch [509/5000]: Train loss: 1.2550, Valid loss: 1.3083


Epoch [510/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.91it/s, loss=1.09]


Epoch [510/5000]: Train loss: 1.2501, Valid loss: 1.3044


Epoch [511/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.29it/s, loss=1.15]


Epoch [511/5000]: Train loss: 1.2505, Valid loss: 1.3756


Epoch [512/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.00it/s, loss=1.61]


Epoch [512/5000]: Train loss: 1.2805, Valid loss: 1.3727


Epoch [513/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.36it/s, loss=1.05]


Epoch [513/5000]: Train loss: 1.2407, Valid loss: 1.4008


Epoch [514/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.14it/s, loss=1.58]


Epoch [514/5000]: Train loss: 1.2836, Valid loss: 1.3261


Epoch [515/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.27it/s, loss=1.57]


Epoch [515/5000]: Train loss: 1.2686, Valid loss: 1.3198


Epoch [516/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.82it/s, loss=1.06]


Epoch [516/5000]: Train loss: 1.2377, Valid loss: 1.2947


Epoch [517/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.02it/s, loss=1.33]


Epoch [517/5000]: Train loss: 1.2557, Valid loss: 1.3014


Epoch [518/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.03it/s, loss=0.925]


Epoch [518/5000]: Train loss: 1.2298, Valid loss: 1.2715


Epoch [519/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.65it/s, loss=1.46]


Epoch [519/5000]: Train loss: 1.2587, Valid loss: 1.3023


Epoch [520/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.19it/s, loss=1.21]


Epoch [520/5000]: Train loss: 1.2473, Valid loss: 1.2930


Epoch [521/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.33it/s, loss=0.822]


Epoch [521/5000]: Train loss: 1.2197, Valid loss: 1.2717


Epoch [522/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.50it/s, loss=1.07]


Epoch [522/5000]: Train loss: 1.2351, Valid loss: 1.2960


Epoch [523/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.99it/s, loss=1.17]


Epoch [523/5000]: Train loss: 1.2347, Valid loss: 1.2350
Saving model with loss 1.235...


Epoch [524/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.24it/s, loss=1.03]


Epoch [524/5000]: Train loss: 1.2260, Valid loss: 1.3081


Epoch [525/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.65it/s, loss=1.24]


Epoch [525/5000]: Train loss: 1.2388, Valid loss: 1.2983


Epoch [526/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.38it/s, loss=1.01]


Epoch [526/5000]: Train loss: 1.2204, Valid loss: 1.3229


Epoch [527/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.69it/s, loss=1.51]


Epoch [527/5000]: Train loss: 1.2522, Valid loss: 1.3111


Epoch [528/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.29it/s, loss=1.26]


Epoch [528/5000]: Train loss: 1.2339, Valid loss: 1.2261
Saving model with loss 1.226...


Epoch [529/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.57it/s, loss=1.64]


Epoch [529/5000]: Train loss: 1.2513, Valid loss: 1.2710


Epoch [530/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.38it/s, loss=1.11]


Epoch [530/5000]: Train loss: 1.2180, Valid loss: 1.2593


Epoch [531/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.71it/s, loss=1.06]


Epoch [531/5000]: Train loss: 1.2126, Valid loss: 1.2933


Epoch [532/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.22it/s, loss=1.05]


Epoch [532/5000]: Train loss: 1.2137, Valid loss: 1.3040


Epoch [533/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.50it/s, loss=1.35]


Epoch [533/5000]: Train loss: 1.2285, Valid loss: 1.2862


Epoch [534/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.17it/s, loss=1.29]


Epoch [534/5000]: Train loss: 1.2259, Valid loss: 1.1916
Saving model with loss 1.192...


Epoch [535/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.92it/s, loss=1.34]


Epoch [535/5000]: Train loss: 1.2276, Valid loss: 1.1955


Epoch [536/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.88it/s, loss=1.14]


Epoch [536/5000]: Train loss: 1.2213, Valid loss: 1.3177


Epoch [537/5000]: 100%|██████████| 10/10 [00:00<00:00, 228.07it/s, loss=0.888]


Epoch [537/5000]: Train loss: 1.2133, Valid loss: 1.1664
Saving model with loss 1.166...


Epoch [538/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.98it/s, loss=1.47]


Epoch [538/5000]: Train loss: 1.2321, Valid loss: 1.2400


Epoch [539/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.11it/s, loss=1.16]


Epoch [539/5000]: Train loss: 1.2178, Valid loss: 1.2606


Epoch [540/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.88it/s, loss=1.77]


Epoch [540/5000]: Train loss: 1.2429, Valid loss: 1.2918


Epoch [541/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.15it/s, loss=1.36]


Epoch [541/5000]: Train loss: 1.2179, Valid loss: 1.2815


Epoch [542/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.02it/s, loss=1.08]


Epoch [542/5000]: Train loss: 1.2061, Valid loss: 1.2395


Epoch [543/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.03it/s, loss=1.52]


Epoch [543/5000]: Train loss: 1.2240, Valid loss: 1.2205


Epoch [544/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.06it/s, loss=0.97]


Epoch [544/5000]: Train loss: 1.1954, Valid loss: 1.2726


Epoch [545/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.98it/s, loss=1.68]


Epoch [545/5000]: Train loss: 1.2307, Valid loss: 1.2401


Epoch [546/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.11it/s, loss=0.881]


Epoch [546/5000]: Train loss: 1.1832, Valid loss: 1.1816


Epoch [547/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.92it/s, loss=1.29]


Epoch [547/5000]: Train loss: 1.2118, Valid loss: 1.2043


Epoch [548/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.30it/s, loss=1.22]


Epoch [548/5000]: Train loss: 1.2140, Valid loss: 1.2612


Epoch [549/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.22it/s, loss=1.21]


Epoch [549/5000]: Train loss: 1.2029, Valid loss: 1.2786


Epoch [550/5000]: 100%|██████████| 10/10 [00:00<00:00, 220.12it/s, loss=1.2]


Epoch [550/5000]: Train loss: 1.2111, Valid loss: 1.2655


Epoch [551/5000]: 100%|██████████| 10/10 [00:00<00:00, 69.42it/s, loss=1.28]


Epoch [551/5000]: Train loss: 1.1994, Valid loss: 1.2220


Epoch [552/5000]: 100%|██████████| 10/10 [00:00<00:00, 171.08it/s, loss=1.62]


Epoch [552/5000]: Train loss: 1.2204, Valid loss: 1.1965


Epoch [553/5000]: 100%|██████████| 10/10 [00:00<00:00, 180.31it/s, loss=1.45]


Epoch [553/5000]: Train loss: 1.2102, Valid loss: 1.2790


Epoch [554/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.07it/s, loss=1.12]


Epoch [554/5000]: Train loss: 1.1899, Valid loss: 1.2136


Epoch [555/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.34it/s, loss=1.5]


Epoch [555/5000]: Train loss: 1.2139, Valid loss: 1.2570


Epoch [556/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.87it/s, loss=1.34]


Epoch [556/5000]: Train loss: 1.2095, Valid loss: 1.2840


Epoch [557/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.48it/s, loss=1.21]


Epoch [557/5000]: Train loss: 1.1963, Valid loss: 1.2076


Epoch [558/5000]: 100%|██████████| 10/10 [00:00<00:00, 192.93it/s, loss=1.17]


Epoch [558/5000]: Train loss: 1.1882, Valid loss: 1.2640


Epoch [559/5000]: 100%|██████████| 10/10 [00:00<00:00, 162.79it/s, loss=1.23]


Epoch [559/5000]: Train loss: 1.1847, Valid loss: 1.2149


Epoch [560/5000]: 100%|██████████| 10/10 [00:00<00:00, 214.65it/s, loss=0.989]


Epoch [560/5000]: Train loss: 1.1692, Valid loss: 1.1813


Epoch [561/5000]: 100%|██████████| 10/10 [00:00<00:00, 199.24it/s, loss=1.23]


Epoch [561/5000]: Train loss: 1.1849, Valid loss: 1.2288


Epoch [562/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.83it/s, loss=0.992]


Epoch [562/5000]: Train loss: 1.1798, Valid loss: 1.1938


Epoch [563/5000]: 100%|██████████| 10/10 [00:00<00:00, 189.49it/s, loss=0.994]


Epoch [563/5000]: Train loss: 1.1799, Valid loss: 1.1812


Epoch [564/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.17it/s, loss=0.888]


Epoch [564/5000]: Train loss: 1.1608, Valid loss: 1.2172


Epoch [565/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.86it/s, loss=1.41]


Epoch [565/5000]: Train loss: 1.1917, Valid loss: 1.2133


Epoch [566/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.78it/s, loss=1.24]


Epoch [566/5000]: Train loss: 1.1776, Valid loss: 1.1908


Epoch [567/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.14it/s, loss=1.04]


Epoch [567/5000]: Train loss: 1.1669, Valid loss: 1.2537


Epoch [568/5000]: 100%|██████████| 10/10 [00:00<00:00, 171.19it/s, loss=0.935]


Epoch [568/5000]: Train loss: 1.1600, Valid loss: 1.2038


Epoch [569/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.99it/s, loss=1.05]


Epoch [569/5000]: Train loss: 1.1643, Valid loss: 1.1927


Epoch [570/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.03it/s, loss=1.13]


Epoch [570/5000]: Train loss: 1.1838, Valid loss: 1.2034


Epoch [571/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.49it/s, loss=0.822]


Epoch [571/5000]: Train loss: 1.1681, Valid loss: 1.1946


Epoch [572/5000]: 100%|██████████| 10/10 [00:00<00:00, 210.56it/s, loss=1.09]


Epoch [572/5000]: Train loss: 1.1710, Valid loss: 1.2561


Epoch [573/5000]: 100%|██████████| 10/10 [00:00<00:00, 227.09it/s, loss=1.27]


Epoch [573/5000]: Train loss: 1.1780, Valid loss: 1.2006


Epoch [574/5000]: 100%|██████████| 10/10 [00:00<00:00, 204.61it/s, loss=1.28]


Epoch [574/5000]: Train loss: 1.1710, Valid loss: 1.2691


Epoch [575/5000]: 100%|██████████| 10/10 [00:00<00:00, 214.78it/s, loss=1.29]


Epoch [575/5000]: Train loss: 1.2044, Valid loss: 1.2463


Epoch [576/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=0.989]


Epoch [576/5000]: Train loss: 1.1592, Valid loss: 1.2095


Epoch [577/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.77it/s, loss=1.01]


Epoch [577/5000]: Train loss: 1.1549, Valid loss: 1.1174
Saving model with loss 1.117...


Epoch [578/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.62it/s, loss=1.52]


Epoch [578/5000]: Train loss: 1.1829, Valid loss: 1.2662


Epoch [579/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.78it/s, loss=1.17]


Epoch [579/5000]: Train loss: 1.1592, Valid loss: 1.2104


Epoch [580/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.34it/s, loss=1.84]


Epoch [580/5000]: Train loss: 1.2023, Valid loss: 1.1912


Epoch [581/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.37it/s, loss=1.16]


Epoch [581/5000]: Train loss: 1.1683, Valid loss: 1.0904
Saving model with loss 1.090...


Epoch [582/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.58it/s, loss=1.05]


Epoch [582/5000]: Train loss: 1.1579, Valid loss: 1.2182


Epoch [583/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=1.28]


Epoch [583/5000]: Train loss: 1.1715, Valid loss: 1.1723


Epoch [584/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.66it/s, loss=1.54]


Epoch [584/5000]: Train loss: 1.1784, Valid loss: 1.1218


Epoch [585/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.69it/s, loss=1.12]


Epoch [585/5000]: Train loss: 1.1530, Valid loss: 1.1849


Epoch [586/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.46it/s, loss=1.28]


Epoch [586/5000]: Train loss: 1.1586, Valid loss: 1.1925


Epoch [587/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.64it/s, loss=1.01]


Epoch [587/5000]: Train loss: 1.1428, Valid loss: 1.1844


Epoch [588/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.88it/s, loss=1.18]


Epoch [588/5000]: Train loss: 1.1539, Valid loss: 1.1831


Epoch [589/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.52it/s, loss=0.986]


Epoch [589/5000]: Train loss: 1.1403, Valid loss: 1.1414


Epoch [590/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.91it/s, loss=1.06]


Epoch [590/5000]: Train loss: 1.1407, Valid loss: 1.1716


Epoch [591/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.05it/s, loss=1.18]


Epoch [591/5000]: Train loss: 1.1470, Valid loss: 1.1155


Epoch [592/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.66it/s, loss=1.19]


Epoch [592/5000]: Train loss: 1.1455, Valid loss: 1.1908


Epoch [593/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.90it/s, loss=0.704]


Epoch [593/5000]: Train loss: 1.1282, Valid loss: 1.2125


Epoch [594/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.56it/s, loss=1.7]


Epoch [594/5000]: Train loss: 1.1909, Valid loss: 1.2187


Epoch [595/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.10it/s, loss=1.19]


Epoch [595/5000]: Train loss: 1.1599, Valid loss: 1.3283


Epoch [596/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.85it/s, loss=1.16]


Epoch [596/5000]: Train loss: 1.1678, Valid loss: 1.0851
Saving model with loss 1.085...


Epoch [597/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.69it/s, loss=1.03]


Epoch [597/5000]: Train loss: 1.1461, Valid loss: 1.1379


Epoch [598/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.97it/s, loss=1.08]


Epoch [598/5000]: Train loss: 1.1399, Valid loss: 1.1861


Epoch [599/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.05it/s, loss=1.15]


Epoch [599/5000]: Train loss: 1.1608, Valid loss: 1.1737


Epoch [600/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.92it/s, loss=1.03]


Epoch [600/5000]: Train loss: 1.1313, Valid loss: 1.1339


Epoch [601/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.39it/s, loss=1.06]


Epoch [601/5000]: Train loss: 1.1327, Valid loss: 1.1573


Epoch [602/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.20it/s, loss=1.13]


Epoch [602/5000]: Train loss: 1.1354, Valid loss: 1.1050


Epoch [603/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.54it/s, loss=1.09]


Epoch [603/5000]: Train loss: 1.1412, Valid loss: 1.1198


Epoch [604/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.64it/s, loss=1.34]


Epoch [604/5000]: Train loss: 1.1444, Valid loss: 1.1138


Epoch [605/5000]: 100%|██████████| 10/10 [00:00<00:00, 223.00it/s, loss=1.38]


Epoch [605/5000]: Train loss: 1.1561, Valid loss: 1.1623


Epoch [606/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.65it/s, loss=1.51]


Epoch [606/5000]: Train loss: 1.1541, Valid loss: 1.1999


Epoch [607/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.00it/s, loss=0.972]


Epoch [607/5000]: Train loss: 1.1286, Valid loss: 1.1676


Epoch [608/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.96it/s, loss=0.884]


Epoch [608/5000]: Train loss: 1.1230, Valid loss: 1.2043


Epoch [609/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.11it/s, loss=1.05]


Epoch [609/5000]: Train loss: 1.1399, Valid loss: 1.1947


Epoch [610/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.67it/s, loss=1.1]


Epoch [610/5000]: Train loss: 1.1264, Valid loss: 1.1000


Epoch [611/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.26it/s, loss=0.883]


Epoch [611/5000]: Train loss: 1.1167, Valid loss: 1.1895


Epoch [612/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.30it/s, loss=1.17]


Epoch [612/5000]: Train loss: 1.1381, Valid loss: 1.1699


Epoch [613/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.14it/s, loss=0.976]


Epoch [613/5000]: Train loss: 1.1198, Valid loss: 1.1561


Epoch [614/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.08it/s, loss=1.07]


Epoch [614/5000]: Train loss: 1.1280, Valid loss: 1.1912


Epoch [615/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.24it/s, loss=1.32]


Epoch [615/5000]: Train loss: 1.1572, Valid loss: 1.2227


Epoch [616/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.65it/s, loss=1.38]


Epoch [616/5000]: Train loss: 1.1358, Valid loss: 1.0848
Saving model with loss 1.085...


Epoch [617/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.39it/s, loss=1.21]


Epoch [617/5000]: Train loss: 1.1284, Valid loss: 1.1353


Epoch [618/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.05it/s, loss=1.41]


Epoch [618/5000]: Train loss: 1.1396, Valid loss: 1.0877


Epoch [619/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.91it/s, loss=1.57]


Epoch [619/5000]: Train loss: 1.1452, Valid loss: 1.1426


Epoch [620/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.98it/s, loss=1.03]


Epoch [620/5000]: Train loss: 1.1121, Valid loss: 1.1264


Epoch [621/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.26it/s, loss=1.2]


Epoch [621/5000]: Train loss: 1.1255, Valid loss: 1.1271


Epoch [622/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.82it/s, loss=1.32]


Epoch [622/5000]: Train loss: 1.1284, Valid loss: 1.2385


Epoch [623/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.47it/s, loss=1.16]


Epoch [623/5000]: Train loss: 1.1190, Valid loss: 1.1041


Epoch [624/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.75it/s, loss=0.87]


Epoch [624/5000]: Train loss: 1.1033, Valid loss: 1.1518


Epoch [625/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.70it/s, loss=0.954]


Epoch [625/5000]: Train loss: 1.1054, Valid loss: 1.1586


Epoch [626/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.69it/s, loss=1.08]


Epoch [626/5000]: Train loss: 1.1225, Valid loss: 1.1308


Epoch [627/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.01it/s, loss=0.92]


Epoch [627/5000]: Train loss: 1.0998, Valid loss: 1.1352


Epoch [628/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.90it/s, loss=1.15]


Epoch [628/5000]: Train loss: 1.1140, Valid loss: 1.1869


Epoch [629/5000]: 100%|██████████| 10/10 [00:00<00:00, 213.44it/s, loss=0.935]


Epoch [629/5000]: Train loss: 1.1029, Valid loss: 1.1351


Epoch [630/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.93it/s, loss=1.22]

Epoch [630/5000]: Train loss: 1.1291, Valid loss: 1.1829



Epoch [631/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.09it/s, loss=1.01]


Epoch [631/5000]: Train loss: 1.1060, Valid loss: 1.1582


Epoch [632/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.00it/s, loss=1.38]


Epoch [632/5000]: Train loss: 1.1361, Valid loss: 1.1995


Epoch [633/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.22it/s, loss=1.29]


Epoch [633/5000]: Train loss: 1.1382, Valid loss: 1.1554


Epoch [634/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.91it/s, loss=1.02]


Epoch [634/5000]: Train loss: 1.1107, Valid loss: 1.1978


Epoch [635/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.79it/s, loss=1.25]


Epoch [635/5000]: Train loss: 1.1464, Valid loss: 1.1755


Epoch [636/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.98it/s, loss=0.986]


Epoch [636/5000]: Train loss: 1.1227, Valid loss: 1.1598


Epoch [637/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.23it/s, loss=1.21]


Epoch [637/5000]: Train loss: 1.1065, Valid loss: 1.2282


Epoch [638/5000]: 100%|██████████| 10/10 [00:00<00:00, 79.71it/s, loss=0.972]


Epoch [638/5000]: Train loss: 1.0956, Valid loss: 1.1599


Epoch [639/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.71it/s, loss=0.656]


Epoch [639/5000]: Train loss: 1.0724, Valid loss: 1.0598
Saving model with loss 1.060...


Epoch [640/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.59it/s, loss=1.26]


Epoch [640/5000]: Train loss: 1.1136, Valid loss: 1.1200


Epoch [641/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.36it/s, loss=1.11]


Epoch [641/5000]: Train loss: 1.1051, Valid loss: 1.1366


Epoch [642/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.12it/s, loss=1.24]


Epoch [642/5000]: Train loss: 1.1070, Valid loss: 1.1362


Epoch [643/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.11it/s, loss=1.04]


Epoch [643/5000]: Train loss: 1.0921, Valid loss: 1.1070


Epoch [644/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.95it/s, loss=0.948]


Epoch [644/5000]: Train loss: 1.0891, Valid loss: 1.1422


Epoch [645/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.98it/s, loss=0.937]


Epoch [645/5000]: Train loss: 1.0927, Valid loss: 1.1714


Epoch [646/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.85it/s, loss=1.54]


Epoch [646/5000]: Train loss: 1.1252, Valid loss: 1.1633


Epoch [647/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.79it/s, loss=1.01]


Epoch [647/5000]: Train loss: 1.0906, Valid loss: 1.0819


Epoch [648/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.40it/s, loss=1.37]


Epoch [648/5000]: Train loss: 1.1118, Valid loss: 1.1224


Epoch [649/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.41it/s, loss=1.5]


Epoch [649/5000]: Train loss: 1.1188, Valid loss: 1.1484


Epoch [650/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.58it/s, loss=0.938]


Epoch [650/5000]: Train loss: 1.1043, Valid loss: 1.0841


Epoch [651/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.71it/s, loss=0.951]


Epoch [651/5000]: Train loss: 1.0910, Valid loss: 1.0773


Epoch [652/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.27it/s, loss=0.996]


Epoch [652/5000]: Train loss: 1.1006, Valid loss: 1.1576


Epoch [653/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.41it/s, loss=1.11]


Epoch [653/5000]: Train loss: 1.0986, Valid loss: 1.0963


Epoch [654/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.81it/s, loss=1.2]


Epoch [654/5000]: Train loss: 1.0997, Valid loss: 1.0756


Epoch [655/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.60it/s, loss=0.804]


Epoch [655/5000]: Train loss: 1.0789, Valid loss: 1.1465


Epoch [656/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.99it/s, loss=1.41]


Epoch [656/5000]: Train loss: 1.1148, Valid loss: 1.1289


Epoch [657/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.38it/s, loss=0.937]


Epoch [657/5000]: Train loss: 1.0790, Valid loss: 1.0317
Saving model with loss 1.032...


Epoch [658/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.81it/s, loss=1.17]


Epoch [658/5000]: Train loss: 1.0906, Valid loss: 1.1831


Epoch [659/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.20it/s, loss=0.949]


Epoch [659/5000]: Train loss: 1.0751, Valid loss: 1.0964


Epoch [660/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.45it/s, loss=1.42]


Epoch [660/5000]: Train loss: 1.1041, Valid loss: 1.0970


Epoch [661/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.12it/s, loss=1.16]

Epoch [661/5000]: Train loss: 1.0986, Valid loss: 1.1274



Epoch [662/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.34it/s, loss=1.01]

Epoch [662/5000]: Train loss: 1.0871, Valid loss: 1.1146



Epoch [663/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.93it/s, loss=0.801]


Epoch [663/5000]: Train loss: 1.0650, Valid loss: 1.0633


Epoch [664/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.97it/s, loss=1.19]


Epoch [664/5000]: Train loss: 1.0915, Valid loss: 1.1780


Epoch [665/5000]: 100%|██████████| 10/10 [00:00<00:00, 239.00it/s, loss=0.866]


Epoch [665/5000]: Train loss: 1.0706, Valid loss: 1.0816


Epoch [666/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.98it/s, loss=1.19]


Epoch [666/5000]: Train loss: 1.1002, Valid loss: 1.1086


Epoch [667/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.49it/s, loss=1.01]


Epoch [667/5000]: Train loss: 1.0859, Valid loss: 1.0889


Epoch [668/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.13it/s, loss=0.763]


Epoch [668/5000]: Train loss: 1.0681, Valid loss: 1.0990


Epoch [669/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.01it/s, loss=1]


Epoch [669/5000]: Train loss: 1.0703, Valid loss: 1.1557


Epoch [670/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.62it/s, loss=1.36]


Epoch [670/5000]: Train loss: 1.0929, Valid loss: 1.1170


Epoch [671/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.81it/s, loss=1.11]


Epoch [671/5000]: Train loss: 1.0832, Valid loss: 1.0455


Epoch [672/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.23it/s, loss=1.26]


Epoch [672/5000]: Train loss: 1.0909, Valid loss: 1.1049


Epoch [673/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.69it/s, loss=0.868]


Epoch [673/5000]: Train loss: 1.0625, Valid loss: 1.0254
Saving model with loss 1.025...


Epoch [674/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.63it/s, loss=1.13]


Epoch [674/5000]: Train loss: 1.1064, Valid loss: 1.1193


Epoch [675/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.20it/s, loss=1.13]


Epoch [675/5000]: Train loss: 1.1141, Valid loss: 1.0899


Epoch [676/5000]: 100%|██████████| 10/10 [00:00<00:00, 180.29it/s, loss=1.2]


Epoch [676/5000]: Train loss: 1.1083, Valid loss: 1.0830


Epoch [677/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.10it/s, loss=1.05]


Epoch [677/5000]: Train loss: 1.0760, Valid loss: 1.0986


Epoch [678/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.40it/s, loss=1.1]


Epoch [678/5000]: Train loss: 1.0727, Valid loss: 1.0908


Epoch [679/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.67it/s, loss=0.88]


Epoch [679/5000]: Train loss: 1.0620, Valid loss: 1.0442


Epoch [680/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.15it/s, loss=0.979]


Epoch [680/5000]: Train loss: 1.0683, Valid loss: 1.0529


Epoch [681/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.22it/s, loss=1.03]


Epoch [681/5000]: Train loss: 1.0675, Valid loss: 1.0498


Epoch [682/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.74it/s, loss=1.04]


Epoch [682/5000]: Train loss: 1.0645, Valid loss: 1.1328


Epoch [683/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.12it/s, loss=0.888]


Epoch [683/5000]: Train loss: 1.0603, Valid loss: 1.0456


Epoch [684/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.28it/s, loss=1.19]


Epoch [684/5000]: Train loss: 1.0730, Valid loss: 1.1271


Epoch [685/5000]: 100%|██████████| 10/10 [00:00<00:00, 235.03it/s, loss=0.961]


Epoch [685/5000]: Train loss: 1.0585, Valid loss: 1.1193


Epoch [686/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.18it/s, loss=1.54]


Epoch [686/5000]: Train loss: 1.0915, Valid loss: 1.0390


Epoch [687/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.75it/s, loss=1.24]


Epoch [687/5000]: Train loss: 1.0884, Valid loss: 1.0824


Epoch [688/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.71it/s, loss=1.06]


Epoch [688/5000]: Train loss: 1.0703, Valid loss: 1.0859


Epoch [689/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.02it/s, loss=0.804]


Epoch [689/5000]: Train loss: 1.0466, Valid loss: 1.0758


Epoch [690/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.87it/s, loss=1.17]


Epoch [690/5000]: Train loss: 1.0767, Valid loss: 1.1172


Epoch [691/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.51it/s, loss=1.55]


Epoch [691/5000]: Train loss: 1.0936, Valid loss: 1.0826


Epoch [692/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.99it/s, loss=1.04]


Epoch [692/5000]: Train loss: 1.0647, Valid loss: 1.0795


Epoch [693/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.64it/s, loss=0.92]


Epoch [693/5000]: Train loss: 1.0556, Valid loss: 1.1282


Epoch [694/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.11it/s, loss=1.05]


Epoch [694/5000]: Train loss: 1.0595, Valid loss: 1.0502


Epoch [695/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.34it/s, loss=0.89]


Epoch [695/5000]: Train loss: 1.0507, Valid loss: 1.1361


Epoch [696/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.68it/s, loss=0.964]


Epoch [696/5000]: Train loss: 1.0571, Valid loss: 1.0738


Epoch [697/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.07it/s, loss=0.798]


Epoch [697/5000]: Train loss: 1.0447, Valid loss: 1.0731


Epoch [698/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.25it/s, loss=0.65]


Epoch [698/5000]: Train loss: 1.0312, Valid loss: 1.0443


Epoch [699/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.05it/s, loss=0.949]


Epoch [699/5000]: Train loss: 1.0499, Valid loss: 1.0873


Epoch [700/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.27it/s, loss=0.79]


Epoch [700/5000]: Train loss: 1.0379, Valid loss: 1.0625


Epoch [701/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.88it/s, loss=1.08]


Epoch [701/5000]: Train loss: 1.0684, Valid loss: 1.0268


Epoch [702/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.48it/s, loss=0.843]


Epoch [702/5000]: Train loss: 1.0478, Valid loss: 1.0755


Epoch [703/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.66it/s, loss=0.839]


Epoch [703/5000]: Train loss: 1.0411, Valid loss: 1.1758


Epoch [704/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.36it/s, loss=1.19]


Epoch [704/5000]: Train loss: 1.0628, Valid loss: 1.0813


Epoch [705/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.71it/s, loss=1.2]


Epoch [705/5000]: Train loss: 1.0595, Valid loss: 1.0690


Epoch [706/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.13it/s, loss=0.87]


Epoch [706/5000]: Train loss: 1.0407, Valid loss: 1.0491


Epoch [707/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.70it/s, loss=1.14]


Epoch [707/5000]: Train loss: 1.0642, Valid loss: 1.0929


Epoch [708/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.74it/s, loss=1.15]


Epoch [708/5000]: Train loss: 1.0581, Valid loss: 1.0133
Saving model with loss 1.013...


Epoch [709/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.38it/s, loss=0.907]


Epoch [709/5000]: Train loss: 1.0433, Valid loss: 1.0283


Epoch [710/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.99it/s, loss=0.899]


Epoch [710/5000]: Train loss: 1.0421, Valid loss: 1.1194


Epoch [711/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.65it/s, loss=0.938]


Epoch [711/5000]: Train loss: 1.0389, Valid loss: 1.0375


Epoch [712/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.88it/s, loss=1.28]


Epoch [712/5000]: Train loss: 1.0662, Valid loss: 1.0890


Epoch [713/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.50it/s, loss=1.36]


Epoch [713/5000]: Train loss: 1.0887, Valid loss: 1.0047
Saving model with loss 1.005...


Epoch [714/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.20it/s, loss=0.77]


Epoch [714/5000]: Train loss: 1.0404, Valid loss: 1.1269


Epoch [715/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.23it/s, loss=1.01]


Epoch [715/5000]: Train loss: 1.0492, Valid loss: 1.0101


Epoch [716/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.44it/s, loss=0.846]


Epoch [716/5000]: Train loss: 1.0330, Valid loss: 1.0985


Epoch [717/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.05it/s, loss=1.16]


Epoch [717/5000]: Train loss: 1.0557, Valid loss: 1.0734


Epoch [718/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.58it/s, loss=0.934]


Epoch [718/5000]: Train loss: 1.0434, Valid loss: 1.0365


Epoch [719/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.23it/s, loss=1.04]


Epoch [719/5000]: Train loss: 1.0429, Valid loss: 1.0476


Epoch [720/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.50it/s, loss=1.08]


Epoch [720/5000]: Train loss: 1.0439, Valid loss: 1.0299


Epoch [721/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.42it/s, loss=1.04]


Epoch [721/5000]: Train loss: 1.0451, Valid loss: 1.0589


Epoch [722/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.60it/s, loss=1.4]


Epoch [722/5000]: Train loss: 1.0629, Valid loss: 1.0224


Epoch [723/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.04it/s, loss=1.03]


Epoch [723/5000]: Train loss: 1.0431, Valid loss: 1.0795


Epoch [724/5000]: 100%|██████████| 10/10 [00:00<00:00, 73.89it/s, loss=0.968]


Epoch [724/5000]: Train loss: 1.0431, Valid loss: 1.1366


Epoch [725/5000]: 100%|██████████| 10/10 [00:00<00:00, 239.88it/s, loss=0.895]


Epoch [725/5000]: Train loss: 1.0485, Valid loss: 1.0961


Epoch [726/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=0.857]


Epoch [726/5000]: Train loss: 1.0318, Valid loss: 1.0312


Epoch [727/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.86it/s, loss=1.16]


Epoch [727/5000]: Train loss: 1.0439, Valid loss: 1.1098


Epoch [728/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.69it/s, loss=0.934]


Epoch [728/5000]: Train loss: 1.0356, Valid loss: 0.9930
Saving model with loss 0.993...


Epoch [729/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.74it/s, loss=1.06]


Epoch [729/5000]: Train loss: 1.0381, Valid loss: 0.9893
Saving model with loss 0.989...


Epoch [730/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.28it/s, loss=0.936]


Epoch [730/5000]: Train loss: 1.0330, Valid loss: 1.0826


Epoch [731/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.99it/s, loss=0.948]


Epoch [731/5000]: Train loss: 1.0365, Valid loss: 1.0602


Epoch [732/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.97it/s, loss=0.885]


Epoch [732/5000]: Train loss: 1.0288, Valid loss: 1.0177


Epoch [733/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.46it/s, loss=1.25]


Epoch [733/5000]: Train loss: 1.0487, Valid loss: 1.0022


Epoch [734/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.58it/s, loss=1.29]


Epoch [734/5000]: Train loss: 1.0493, Valid loss: 1.1398


Epoch [735/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.77it/s, loss=0.943]


Epoch [735/5000]: Train loss: 1.0280, Valid loss: 1.0681


Epoch [736/5000]: 100%|██████████| 10/10 [00:00<00:00, 224.41it/s, loss=0.984]


Epoch [736/5000]: Train loss: 1.0376, Valid loss: 1.0671


Epoch [737/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.70it/s, loss=1.21]


Epoch [737/5000]: Train loss: 1.0521, Valid loss: 1.0316


Epoch [738/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.16it/s, loss=1.13]


Epoch [738/5000]: Train loss: 1.0466, Valid loss: 1.1154


Epoch [739/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.80it/s, loss=1.17]


Epoch [739/5000]: Train loss: 1.0438, Valid loss: 1.0430


Epoch [740/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.66it/s, loss=1.17]


Epoch [740/5000]: Train loss: 1.0394, Valid loss: 1.0008


Epoch [741/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.40it/s, loss=0.918]


Epoch [741/5000]: Train loss: 1.0229, Valid loss: 1.0140


Epoch [742/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.05it/s, loss=0.867]


Epoch [742/5000]: Train loss: 1.0229, Valid loss: 1.0907


Epoch [743/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.31it/s, loss=1.13]


Epoch [743/5000]: Train loss: 1.0420, Valid loss: 1.0823


Epoch [744/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.63it/s, loss=1.19]


Epoch [744/5000]: Train loss: 1.0513, Valid loss: 1.0228


Epoch [745/5000]: 100%|██████████| 10/10 [00:00<00:00, 204.73it/s, loss=1.29]


Epoch [745/5000]: Train loss: 1.0455, Valid loss: 0.9934


Epoch [746/5000]: 100%|██████████| 10/10 [00:00<00:00, 239.36it/s, loss=1.17]


Epoch [746/5000]: Train loss: 1.0428, Valid loss: 1.0623


Epoch [747/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.13it/s, loss=0.99]


Epoch [747/5000]: Train loss: 1.0428, Valid loss: 1.0112


Epoch [748/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.25it/s, loss=1.17]


Epoch [748/5000]: Train loss: 1.0528, Valid loss: 1.1411


Epoch [749/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.85it/s, loss=1.03]


Epoch [749/5000]: Train loss: 1.0511, Valid loss: 1.0389


Epoch [750/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.32it/s, loss=0.925]


Epoch [750/5000]: Train loss: 1.0280, Valid loss: 1.0387


Epoch [751/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.30it/s, loss=1.2]


Epoch [751/5000]: Train loss: 1.0430, Valid loss: 1.0517


Epoch [752/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.97it/s, loss=1.18]


Epoch [752/5000]: Train loss: 1.0355, Valid loss: 1.0287


Epoch [753/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.81it/s, loss=0.88]


Epoch [753/5000]: Train loss: 1.0216, Valid loss: 1.0744


Epoch [754/5000]: 100%|██████████| 10/10 [00:00<00:00, 228.03it/s, loss=1.26]


Epoch [754/5000]: Train loss: 1.0403, Valid loss: 1.0516


Epoch [755/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.70it/s, loss=1.05]


Epoch [755/5000]: Train loss: 1.0365, Valid loss: 1.0575


Epoch [756/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.11it/s, loss=0.803]


Epoch [756/5000]: Train loss: 1.0179, Valid loss: 1.0496


Epoch [757/5000]: 100%|██████████| 10/10 [00:00<00:00, 214.52it/s, loss=0.891]


Epoch [757/5000]: Train loss: 1.0226, Valid loss: 0.9842
Saving model with loss 0.984...


Epoch [758/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.71it/s, loss=0.973]


Epoch [758/5000]: Train loss: 1.0234, Valid loss: 1.0002


Epoch [759/5000]: 100%|██████████| 10/10 [00:00<00:00, 223.92it/s, loss=1.12]


Epoch [759/5000]: Train loss: 1.0359, Valid loss: 1.0529


Epoch [760/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.57it/s, loss=1.02]


Epoch [760/5000]: Train loss: 1.0361, Valid loss: 1.0794


Epoch [761/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.31it/s, loss=0.97]


Epoch [761/5000]: Train loss: 1.0268, Valid loss: 1.0955


Epoch [762/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.35it/s, loss=0.906]


Epoch [762/5000]: Train loss: 1.0492, Valid loss: 1.1537


Epoch [763/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.48it/s, loss=0.974]


Epoch [763/5000]: Train loss: 1.0680, Valid loss: 1.0675


Epoch [764/5000]: 100%|██████████| 10/10 [00:00<00:00, 213.44it/s, loss=0.9]


Epoch [764/5000]: Train loss: 1.0303, Valid loss: 1.0890


Epoch [765/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.26it/s, loss=1.04]


Epoch [765/5000]: Train loss: 1.0403, Valid loss: 1.0300


Epoch [766/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.33it/s, loss=0.963]


Epoch [766/5000]: Train loss: 1.0177, Valid loss: 1.0795


Epoch [767/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.83it/s, loss=1.03]


Epoch [767/5000]: Train loss: 1.0216, Valid loss: 0.9987


Epoch [768/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.97it/s, loss=0.844]

Epoch [768/5000]: Train loss: 1.0060, Valid loss: 1.0385



Epoch [769/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.08it/s, loss=0.888]


Epoch [769/5000]: Train loss: 1.0100, Valid loss: 1.0432


Epoch [770/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.80it/s, loss=0.969]


Epoch [770/5000]: Train loss: 1.0120, Valid loss: 1.0187


Epoch [771/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.59it/s, loss=1.08]


Epoch [771/5000]: Train loss: 1.0231, Valid loss: 1.0682


Epoch [772/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.49it/s, loss=1.15]


Epoch [772/5000]: Train loss: 1.0211, Valid loss: 1.0504


Epoch [773/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.10it/s, loss=0.934]


Epoch [773/5000]: Train loss: 1.0210, Valid loss: 1.0256


Epoch [774/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.43it/s, loss=0.96]


Epoch [774/5000]: Train loss: 1.0234, Valid loss: 1.1563


Epoch [775/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.58it/s, loss=0.771]


Epoch [775/5000]: Train loss: 1.0218, Valid loss: 1.0386


Epoch [776/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.22it/s, loss=1.02]


Epoch [776/5000]: Train loss: 1.0123, Valid loss: 1.0176


Epoch [777/5000]: 100%|██████████| 10/10 [00:00<00:00, 235.71it/s, loss=0.738]


Epoch [777/5000]: Train loss: 1.0107, Valid loss: 1.0394


Epoch [778/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.23it/s, loss=0.926]


Epoch [778/5000]: Train loss: 1.0339, Valid loss: 1.0087


Epoch [779/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.90it/s, loss=0.952]


Epoch [779/5000]: Train loss: 1.0167, Valid loss: 1.0379


Epoch [780/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.09it/s, loss=1.33]


Epoch [780/5000]: Train loss: 1.0298, Valid loss: 1.0142


Epoch [781/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.02it/s, loss=1.27]


Epoch [781/5000]: Train loss: 1.0263, Valid loss: 1.0482


Epoch [782/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.85it/s, loss=0.86]


Epoch [782/5000]: Train loss: 1.0068, Valid loss: 1.0316


Epoch [783/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.31it/s, loss=1.15]


Epoch [783/5000]: Train loss: 1.0193, Valid loss: 1.0101


Epoch [784/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.17it/s, loss=0.929]


Epoch [784/5000]: Train loss: 1.0097, Valid loss: 1.0428


Epoch [785/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.47it/s, loss=1.1]


Epoch [785/5000]: Train loss: 1.0151, Valid loss: 1.0194


Epoch [786/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.63it/s, loss=1.02]


Epoch [786/5000]: Train loss: 1.0099, Valid loss: 1.0212


Epoch [787/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.35it/s, loss=0.952]


Epoch [787/5000]: Train loss: 1.0129, Valid loss: 1.0066


Epoch [788/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.51it/s, loss=0.815]


Epoch [788/5000]: Train loss: 1.0132, Valid loss: 1.0322


Epoch [789/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.09it/s, loss=1.18]

Epoch [789/5000]: Train loss: 1.0310, Valid loss: 1.0837



Epoch [790/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.81it/s, loss=1.35]


Epoch [790/5000]: Train loss: 1.0384, Valid loss: 1.0342


Epoch [791/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.06it/s, loss=0.801]


Epoch [791/5000]: Train loss: 1.0022, Valid loss: 1.0042


Epoch [792/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.64it/s, loss=1.12]


Epoch [792/5000]: Train loss: 1.0226, Valid loss: 1.0293


Epoch [793/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.77it/s, loss=0.944]


Epoch [793/5000]: Train loss: 1.0071, Valid loss: 1.0629


Epoch [794/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.70it/s, loss=1.59]


Epoch [794/5000]: Train loss: 1.0442, Valid loss: 1.0195


Epoch [795/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.58it/s, loss=0.826]


Epoch [795/5000]: Train loss: 1.0042, Valid loss: 0.9872


Epoch [796/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.07it/s, loss=0.921]


Epoch [796/5000]: Train loss: 1.0065, Valid loss: 1.0092


Epoch [797/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.94it/s, loss=0.978]


Epoch [797/5000]: Train loss: 1.0024, Valid loss: 0.9769
Saving model with loss 0.977...


Epoch [798/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.87it/s, loss=0.801]


Epoch [798/5000]: Train loss: 0.9964, Valid loss: 1.0721


Epoch [799/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.44it/s, loss=1.21]


Epoch [799/5000]: Train loss: 1.0291, Valid loss: 1.0320


Epoch [800/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.72it/s, loss=1.18]


Epoch [800/5000]: Train loss: 1.0213, Valid loss: 1.0430


Epoch [801/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.58it/s, loss=0.939]


Epoch [801/5000]: Train loss: 1.0077, Valid loss: 1.0306


Epoch [802/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.11it/s, loss=1.28]


Epoch [802/5000]: Train loss: 1.0300, Valid loss: 1.0236


Epoch [803/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.66it/s, loss=1.09]


Epoch [803/5000]: Train loss: 1.0093, Valid loss: 1.0137


Epoch [804/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.59it/s, loss=1.15]

Epoch [804/5000]: Train loss: 1.0217, Valid loss: 0.9918



Epoch [805/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.74it/s, loss=0.938]


Epoch [805/5000]: Train loss: 1.0187, Valid loss: 1.0551


Epoch [806/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.16it/s, loss=1.14]


Epoch [806/5000]: Train loss: 1.0473, Valid loss: 0.9722
Saving model with loss 0.972...


Epoch [807/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=0.908]


Epoch [807/5000]: Train loss: 1.0179, Valid loss: 0.9521
Saving model with loss 0.952...


Epoch [808/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.29it/s, loss=1.01]


Epoch [808/5000]: Train loss: 1.0228, Valid loss: 1.0203


Epoch [809/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.97it/s, loss=0.928]


Epoch [809/5000]: Train loss: 1.0158, Valid loss: 1.0932


Epoch [810/5000]: 100%|██████████| 10/10 [00:00<00:00, 79.08it/s, loss=1.02]


Epoch [810/5000]: Train loss: 1.0067, Valid loss: 1.0581


Epoch [811/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.38it/s, loss=0.853]


Epoch [811/5000]: Train loss: 0.9950, Valid loss: 0.9986


Epoch [812/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.21it/s, loss=0.837]


Epoch [812/5000]: Train loss: 0.9855, Valid loss: 1.0046


Epoch [813/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.12it/s, loss=1]


Epoch [813/5000]: Train loss: 1.0044, Valid loss: 0.9761


Epoch [814/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.61it/s, loss=1.25]


Epoch [814/5000]: Train loss: 1.0177, Valid loss: 1.0659


Epoch [815/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.09it/s, loss=1.01]


Epoch [815/5000]: Train loss: 1.0080, Valid loss: 0.9508
Saving model with loss 0.951...


Epoch [816/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.21it/s, loss=1.05]


Epoch [816/5000]: Train loss: 1.0140, Valid loss: 0.9797


Epoch [817/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.86it/s, loss=0.98]


Epoch [817/5000]: Train loss: 0.9991, Valid loss: 1.0318


Epoch [818/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=1.01]


Epoch [818/5000]: Train loss: 0.9974, Valid loss: 0.9750


Epoch [819/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.88it/s, loss=0.825]


Epoch [819/5000]: Train loss: 0.9907, Valid loss: 0.9705


Epoch [820/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.73it/s, loss=1.14]


Epoch [820/5000]: Train loss: 1.0084, Valid loss: 0.9926


Epoch [821/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.11it/s, loss=0.993]


Epoch [821/5000]: Train loss: 1.0099, Valid loss: 1.0307


Epoch [822/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.08it/s, loss=1.44]


Epoch [822/5000]: Train loss: 1.0213, Valid loss: 1.0865


Epoch [823/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=0.731]


Epoch [823/5000]: Train loss: 0.9792, Valid loss: 1.0234


Epoch [824/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.59it/s, loss=0.725]


Epoch [824/5000]: Train loss: 0.9788, Valid loss: 0.9950


Epoch [825/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.06it/s, loss=0.806]


Epoch [825/5000]: Train loss: 0.9869, Valid loss: 1.0416


Epoch [826/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.96it/s, loss=0.863]


Epoch [826/5000]: Train loss: 0.9969, Valid loss: 1.1024


Epoch [827/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.45it/s, loss=0.802]


Epoch [827/5000]: Train loss: 0.9946, Valid loss: 1.1107


Epoch [828/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.08it/s, loss=1.1]


Epoch [828/5000]: Train loss: 1.0154, Valid loss: 1.0389


Epoch [829/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.75it/s, loss=1.27]

Epoch [829/5000]: Train loss: 1.0142, Valid loss: 1.0049



Epoch [830/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.86it/s, loss=1.08]


Epoch [830/5000]: Train loss: 1.0017, Valid loss: 1.0280


Epoch [831/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.40it/s, loss=1.11]


Epoch [831/5000]: Train loss: 1.0007, Valid loss: 1.0061


Epoch [832/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.79it/s, loss=0.95]


Epoch [832/5000]: Train loss: 0.9900, Valid loss: 0.9903


Epoch [833/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.90it/s, loss=0.865]


Epoch [833/5000]: Train loss: 0.9841, Valid loss: 0.9859


Epoch [834/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.47it/s, loss=0.926]


Epoch [834/5000]: Train loss: 0.9885, Valid loss: 0.9722


Epoch [835/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.91it/s, loss=0.98]


Epoch [835/5000]: Train loss: 0.9896, Valid loss: 0.9815


Epoch [836/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.71it/s, loss=0.987]


Epoch [836/5000]: Train loss: 0.9870, Valid loss: 0.9981


Epoch [837/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.48it/s, loss=1.02]


Epoch [837/5000]: Train loss: 0.9922, Valid loss: 1.0836


Epoch [838/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.72it/s, loss=0.752]


Epoch [838/5000]: Train loss: 0.9815, Valid loss: 1.0325


Epoch [839/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.38it/s, loss=1.02]


Epoch [839/5000]: Train loss: 1.0013, Valid loss: 0.9694


Epoch [840/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.50it/s, loss=0.899]


Epoch [840/5000]: Train loss: 0.9854, Valid loss: 1.0335


Epoch [841/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.32it/s, loss=1.06]


Epoch [841/5000]: Train loss: 0.9946, Valid loss: 0.9912


Epoch [842/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.69it/s, loss=0.791]


Epoch [842/5000]: Train loss: 0.9757, Valid loss: 0.9723


Epoch [843/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.43it/s, loss=0.937]


Epoch [843/5000]: Train loss: 0.9843, Valid loss: 1.0359


Epoch [844/5000]: 100%|██████████| 10/10 [00:00<00:00, 311.07it/s, loss=0.994]


Epoch [844/5000]: Train loss: 0.9922, Valid loss: 0.9854


Epoch [845/5000]: 100%|██████████| 10/10 [00:00<00:00, 302.89it/s, loss=0.883]


Epoch [845/5000]: Train loss: 0.9814, Valid loss: 0.9606


Epoch [846/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.87it/s, loss=1.25]


Epoch [846/5000]: Train loss: 1.0257, Valid loss: 0.9942


Epoch [847/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.59it/s, loss=0.977]


Epoch [847/5000]: Train loss: 1.0095, Valid loss: 1.0173


Epoch [848/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.59it/s, loss=0.978]


Epoch [848/5000]: Train loss: 0.9992, Valid loss: 1.0086


Epoch [849/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.63it/s, loss=1.09]


Epoch [849/5000]: Train loss: 1.0056, Valid loss: 0.9459
Saving model with loss 0.946...


Epoch [850/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.12it/s, loss=1.19]


Epoch [850/5000]: Train loss: 1.0142, Valid loss: 1.0585


Epoch [851/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.40it/s, loss=1.23]


Epoch [851/5000]: Train loss: 1.0268, Valid loss: 1.0387


Epoch [852/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.15it/s, loss=1.29]


Epoch [852/5000]: Train loss: 1.0092, Valid loss: 1.0000


Epoch [853/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.69it/s, loss=1.02]


Epoch [853/5000]: Train loss: 0.9947, Valid loss: 1.0518


Epoch [854/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.54it/s, loss=0.785]


Epoch [854/5000]: Train loss: 0.9875, Valid loss: 0.9764


Epoch [855/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.37it/s, loss=1.12]


Epoch [855/5000]: Train loss: 0.9927, Valid loss: 0.9424
Saving model with loss 0.942...


Epoch [856/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.27it/s, loss=1.06]


Epoch [856/5000]: Train loss: 0.9971, Valid loss: 1.0279


Epoch [857/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.67it/s, loss=1.33]


Epoch [857/5000]: Train loss: 1.0146, Valid loss: 0.9418
Saving model with loss 0.942...


Epoch [858/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.92it/s, loss=1.39]


Epoch [858/5000]: Train loss: 1.0099, Valid loss: 0.9568


Epoch [859/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.01it/s, loss=1.02]


Epoch [859/5000]: Train loss: 0.9872, Valid loss: 1.0450


Epoch [860/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.38it/s, loss=1]


Epoch [860/5000]: Train loss: 0.9993, Valid loss: 0.9309
Saving model with loss 0.931...


Epoch [861/5000]: 100%|██████████| 10/10 [00:00<00:00, 308.33it/s, loss=1.06]


Epoch [861/5000]: Train loss: 0.9921, Valid loss: 0.9533


Epoch [862/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.65it/s, loss=1.17]


Epoch [862/5000]: Train loss: 0.9940, Valid loss: 0.9541


Epoch [863/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.48it/s, loss=1.07]


Epoch [863/5000]: Train loss: 0.9873, Valid loss: 0.9758


Epoch [864/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=0.863]


Epoch [864/5000]: Train loss: 0.9808, Valid loss: 1.0453


Epoch [865/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.10it/s, loss=0.933]


Epoch [865/5000]: Train loss: 0.9813, Valid loss: 0.9981


Epoch [866/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.90it/s, loss=1.17]


Epoch [866/5000]: Train loss: 0.9909, Valid loss: 0.9995


Epoch [867/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.29it/s, loss=0.863]


Epoch [867/5000]: Train loss: 0.9755, Valid loss: 0.9685


Epoch [868/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.49it/s, loss=0.935]


Epoch [868/5000]: Train loss: 0.9802, Valid loss: 0.9548


Epoch [869/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.34it/s, loss=0.636]


Epoch [869/5000]: Train loss: 0.9586, Valid loss: 0.9804


Epoch [870/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.46it/s, loss=0.83]


Epoch [870/5000]: Train loss: 0.9700, Valid loss: 1.1053


Epoch [871/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.01it/s, loss=0.794]


Epoch [871/5000]: Train loss: 0.9811, Valid loss: 1.0023


Epoch [872/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.01it/s, loss=1.05]


Epoch [872/5000]: Train loss: 0.9872, Valid loss: 1.0016


Epoch [873/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=0.918]


Epoch [873/5000]: Train loss: 0.9776, Valid loss: 0.9586


Epoch [874/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.38it/s, loss=1.1]


Epoch [874/5000]: Train loss: 0.9861, Valid loss: 0.9535


Epoch [875/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=0.855]


Epoch [875/5000]: Train loss: 0.9687, Valid loss: 1.0718


Epoch [876/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.36it/s, loss=0.914]


Epoch [876/5000]: Train loss: 0.9790, Valid loss: 1.0008


Epoch [877/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.35it/s, loss=1.26]


Epoch [877/5000]: Train loss: 1.0004, Valid loss: 0.9839


Epoch [878/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.99it/s, loss=0.803]


Epoch [878/5000]: Train loss: 0.9676, Valid loss: 0.9515


Epoch [879/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.91it/s, loss=0.904]


Epoch [879/5000]: Train loss: 0.9840, Valid loss: 0.9686


Epoch [880/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.46it/s, loss=0.736]


Epoch [880/5000]: Train loss: 0.9638, Valid loss: 1.0243


Epoch [881/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=0.893]


Epoch [881/5000]: Train loss: 0.9812, Valid loss: 0.9990


Epoch [882/5000]: 100%|██████████| 10/10 [00:00<00:00, 301.59it/s, loss=0.834]


Epoch [882/5000]: Train loss: 0.9704, Valid loss: 1.0278


Epoch [883/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=0.867]


Epoch [883/5000]: Train loss: 0.9702, Valid loss: 1.0016


Epoch [884/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.69it/s, loss=0.889]


Epoch [884/5000]: Train loss: 0.9710, Valid loss: 1.0503


Epoch [885/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.10it/s, loss=0.945]


Epoch [885/5000]: Train loss: 0.9847, Valid loss: 0.9647


Epoch [886/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.14it/s, loss=0.816]


Epoch [886/5000]: Train loss: 0.9653, Valid loss: 0.9810


Epoch [887/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.40it/s, loss=0.902]


Epoch [887/5000]: Train loss: 0.9796, Valid loss: 0.9350


Epoch [888/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.55it/s, loss=0.739]


Epoch [888/5000]: Train loss: 0.9607, Valid loss: 0.9087
Saving model with loss 0.909...


Epoch [889/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.04it/s, loss=1.01]


Epoch [889/5000]: Train loss: 0.9762, Valid loss: 1.0217


Epoch [890/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.63it/s, loss=0.946]


Epoch [890/5000]: Train loss: 0.9776, Valid loss: 0.9668


Epoch [891/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.47it/s, loss=0.887]


Epoch [891/5000]: Train loss: 0.9792, Valid loss: 0.9810


Epoch [892/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.61it/s, loss=0.704]


Epoch [892/5000]: Train loss: 0.9570, Valid loss: 0.9854


Epoch [893/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.81it/s, loss=0.79]


Epoch [893/5000]: Train loss: 0.9674, Valid loss: 1.0883


Epoch [894/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.82it/s, loss=0.879]


Epoch [894/5000]: Train loss: 0.9651, Valid loss: 0.9756


Epoch [895/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.92it/s, loss=1.18]


Epoch [895/5000]: Train loss: 0.9864, Valid loss: 1.0352


Epoch [896/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.31it/s, loss=0.896]


Epoch [896/5000]: Train loss: 0.9663, Valid loss: 0.9455


Epoch [897/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.35it/s, loss=0.797]


Epoch [897/5000]: Train loss: 0.9639, Valid loss: 0.9685


Epoch [898/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.50it/s, loss=0.781]


Epoch [898/5000]: Train loss: 0.9637, Valid loss: 1.0158


Epoch [899/5000]: 100%|██████████| 10/10 [00:00<00:00, 349.76it/s, loss=0.931]


Epoch [899/5000]: Train loss: 0.9717, Valid loss: 0.9747


Epoch [900/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.14it/s, loss=0.994]


Epoch [900/5000]: Train loss: 0.9722, Valid loss: 0.9855


Epoch [901/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.46it/s, loss=0.989]


Epoch [901/5000]: Train loss: 0.9699, Valid loss: 1.0191


Epoch [902/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.56it/s, loss=1.08]


Epoch [902/5000]: Train loss: 0.9928, Valid loss: 0.9943


Epoch [903/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=1.09]


Epoch [903/5000]: Train loss: 0.9808, Valid loss: 0.9244


Epoch [904/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.95it/s, loss=1.15]


Epoch [904/5000]: Train loss: 0.9820, Valid loss: 0.9789


Epoch [905/5000]: 100%|██████████| 10/10 [00:00<00:00, 341.50it/s, loss=0.852]


Epoch [905/5000]: Train loss: 0.9646, Valid loss: 0.9396


Epoch [906/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.58it/s, loss=1.06]


Epoch [906/5000]: Train loss: 0.9728, Valid loss: 0.9917


Epoch [907/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.50it/s, loss=1.16]


Epoch [907/5000]: Train loss: 0.9786, Valid loss: 0.9307


Epoch [908/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.70it/s, loss=0.911]


Epoch [908/5000]: Train loss: 0.9660, Valid loss: 0.9866


Epoch [909/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.09it/s, loss=0.701]


Epoch [909/5000]: Train loss: 0.9686, Valid loss: 0.9763


Epoch [910/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.11it/s, loss=0.967]


Epoch [910/5000]: Train loss: 0.9898, Valid loss: 1.0529


Epoch [911/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.49it/s, loss=1.14]


Epoch [911/5000]: Train loss: 1.0327, Valid loss: 0.9659


Epoch [912/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.61it/s, loss=0.868]


Epoch [912/5000]: Train loss: 0.9648, Valid loss: 0.9637


Epoch [913/5000]: 100%|██████████| 10/10 [00:00<00:00, 247.45it/s, loss=0.957]


Epoch [913/5000]: Train loss: 0.9674, Valid loss: 0.9495


Epoch [914/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.33it/s, loss=1.03]


Epoch [914/5000]: Train loss: 0.9709, Valid loss: 0.9414


Epoch [915/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.29it/s, loss=1.35]


Epoch [915/5000]: Train loss: 0.9940, Valid loss: 1.0056


Epoch [916/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.73it/s, loss=0.967]


Epoch [916/5000]: Train loss: 0.9759, Valid loss: 0.9618


Epoch [917/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.04it/s, loss=0.99]


Epoch [917/5000]: Train loss: 0.9843, Valid loss: 1.0209


Epoch [918/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.98it/s, loss=1.25]


Epoch [918/5000]: Train loss: 1.0022, Valid loss: 1.0029


Epoch [919/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.63it/s, loss=0.813]


Epoch [919/5000]: Train loss: 0.9792, Valid loss: 0.9779


Epoch [920/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.92it/s, loss=0.863]


Epoch [920/5000]: Train loss: 0.9770, Valid loss: 1.0277


Epoch [921/5000]: 100%|██████████| 10/10 [00:00<00:00, 213.31it/s, loss=1.53]


Epoch [921/5000]: Train loss: 1.0064, Valid loss: 1.0829


Epoch [922/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.82it/s, loss=1.08]


Epoch [922/5000]: Train loss: 0.9751, Valid loss: 0.9953


Epoch [923/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.81it/s, loss=1.36]


Epoch [923/5000]: Train loss: 0.9893, Valid loss: 0.9370


Epoch [924/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.83it/s, loss=0.877]


Epoch [924/5000]: Train loss: 0.9661, Valid loss: 0.9947


Epoch [925/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.68it/s, loss=1.07]


Epoch [925/5000]: Train loss: 0.9801, Valid loss: 0.9391


Epoch [926/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.41it/s, loss=1.14]


Epoch [926/5000]: Train loss: 0.9769, Valid loss: 0.9364


Epoch [927/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.94it/s, loss=0.829]


Epoch [927/5000]: Train loss: 0.9849, Valid loss: 0.9300


Epoch [928/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.24it/s, loss=0.945]


Epoch [928/5000]: Train loss: 0.9753, Valid loss: 0.9513


Epoch [929/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.57it/s, loss=0.905]


Epoch [929/5000]: Train loss: 0.9949, Valid loss: 0.9396


Epoch [930/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.23it/s, loss=0.892]


Epoch [930/5000]: Train loss: 0.9646, Valid loss: 0.9407


Epoch [931/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.01it/s, loss=0.949]


Epoch [931/5000]: Train loss: 0.9637, Valid loss: 0.9632


Epoch [932/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.31it/s, loss=1]


Epoch [932/5000]: Train loss: 0.9639, Valid loss: 1.0287


Epoch [933/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.07it/s, loss=0.797]


Epoch [933/5000]: Train loss: 0.9697, Valid loss: 0.9085
Saving model with loss 0.908...


Epoch [934/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.16it/s, loss=0.936]


Epoch [934/5000]: Train loss: 0.9648, Valid loss: 0.9756


Epoch [935/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.53it/s, loss=0.683]


Epoch [935/5000]: Train loss: 0.9475, Valid loss: 1.0025


Epoch [936/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.26it/s, loss=1.17]


Epoch [936/5000]: Train loss: 0.9786, Valid loss: 0.9326


Epoch [937/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.00it/s, loss=0.694]


Epoch [937/5000]: Train loss: 0.9574, Valid loss: 0.9967


Epoch [938/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.62it/s, loss=1.07]


Epoch [938/5000]: Train loss: 0.9703, Valid loss: 0.9052
Saving model with loss 0.905...


Epoch [939/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.81it/s, loss=1.04]


Epoch [939/5000]: Train loss: 0.9775, Valid loss: 0.9473


Epoch [940/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.67it/s, loss=0.899]


Epoch [940/5000]: Train loss: 0.9632, Valid loss: 0.9601


Epoch [941/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.75it/s, loss=1.19]


Epoch [941/5000]: Train loss: 0.9806, Valid loss: 0.9560


Epoch [942/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.34it/s, loss=0.879]


Epoch [942/5000]: Train loss: 0.9611, Valid loss: 0.9552


Epoch [943/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.56it/s, loss=0.903]


Epoch [943/5000]: Train loss: 0.9631, Valid loss: 0.9680


Epoch [944/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.02it/s, loss=0.966]


Epoch [944/5000]: Train loss: 0.9690, Valid loss: 0.9444


Epoch [945/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.09it/s, loss=0.844]


Epoch [945/5000]: Train loss: 0.9647, Valid loss: 0.9219


Epoch [946/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.67it/s, loss=0.908]


Epoch [946/5000]: Train loss: 0.9763, Valid loss: 1.0580


Epoch [947/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.03it/s, loss=0.683]


Epoch [947/5000]: Train loss: 0.9896, Valid loss: 1.0121


Epoch [948/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.07it/s, loss=1.08]

Epoch [948/5000]: Train loss: 0.9763, Valid loss: 0.9646



Epoch [949/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.57it/s, loss=1.03]


Epoch [949/5000]: Train loss: 0.9798, Valid loss: 0.9225


Epoch [950/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.92it/s, loss=1.33]


Epoch [950/5000]: Train loss: 0.9961, Valid loss: 0.9246


Epoch [951/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.75it/s, loss=1.06]


Epoch [951/5000]: Train loss: 0.9760, Valid loss: 0.9681


Epoch [952/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.97it/s, loss=0.813]


Epoch [952/5000]: Train loss: 0.9576, Valid loss: 0.9943


Epoch [953/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.76it/s, loss=1.19]


Epoch [953/5000]: Train loss: 0.9855, Valid loss: 0.8871
Saving model with loss 0.887...


Epoch [954/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.57it/s, loss=1.13]


Epoch [954/5000]: Train loss: 0.9719, Valid loss: 0.9368


Epoch [955/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.04it/s, loss=0.827]


Epoch [955/5000]: Train loss: 0.9533, Valid loss: 0.9440


Epoch [956/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.73it/s, loss=1.11]


Epoch [956/5000]: Train loss: 0.9686, Valid loss: 1.0034


Epoch [957/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.29it/s, loss=1.25]


Epoch [957/5000]: Train loss: 0.9811, Valid loss: 0.9544


Epoch [958/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.01it/s, loss=0.803]


Epoch [958/5000]: Train loss: 0.9544, Valid loss: 1.0097


Epoch [959/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.42it/s, loss=0.803]


Epoch [959/5000]: Train loss: 0.9531, Valid loss: 0.9469


Epoch [960/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.92it/s, loss=0.912]


Epoch [960/5000]: Train loss: 0.9627, Valid loss: 0.9494


Epoch [961/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.52it/s, loss=1.1]


Epoch [961/5000]: Train loss: 0.9773, Valid loss: 0.9433


Epoch [962/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.35it/s, loss=0.916]


Epoch [962/5000]: Train loss: 0.9702, Valid loss: 0.9518


Epoch [963/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.79it/s, loss=0.89]


Epoch [963/5000]: Train loss: 0.9527, Valid loss: 0.9632


Epoch [964/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.65it/s, loss=1]


Epoch [964/5000]: Train loss: 0.9584, Valid loss: 0.9483


Epoch [965/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.15it/s, loss=0.881]


Epoch [965/5000]: Train loss: 0.9592, Valid loss: 0.9205


Epoch [966/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.15it/s, loss=0.839]


Epoch [966/5000]: Train loss: 0.9698, Valid loss: 1.0057


Epoch [967/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.49it/s, loss=0.855]


Epoch [967/5000]: Train loss: 1.0097, Valid loss: 1.0573


Epoch [968/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.30it/s, loss=1.17]


Epoch [968/5000]: Train loss: 1.0087, Valid loss: 0.9832


Epoch [969/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.84it/s, loss=0.781]


Epoch [969/5000]: Train loss: 0.9711, Valid loss: 0.9511


Epoch [970/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.41it/s, loss=0.872]


Epoch [970/5000]: Train loss: 0.9760, Valid loss: 0.9622


Epoch [971/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.63it/s, loss=0.678]


Epoch [971/5000]: Train loss: 0.9469, Valid loss: 0.9870


Epoch [972/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.37it/s, loss=0.808]


Epoch [972/5000]: Train loss: 0.9718, Valid loss: 0.9700


Epoch [973/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.92it/s, loss=1.04]


Epoch [973/5000]: Train loss: 0.9619, Valid loss: 0.9911


Epoch [974/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.97it/s, loss=1.1]


Epoch [974/5000]: Train loss: 0.9667, Valid loss: 0.9455


Epoch [975/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.30it/s, loss=1.02]


Epoch [975/5000]: Train loss: 0.9856, Valid loss: 1.0027


Epoch [976/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.11it/s, loss=0.919]


Epoch [976/5000]: Train loss: 0.9983, Valid loss: 0.9794


Epoch [977/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.96it/s, loss=0.651]


Epoch [977/5000]: Train loss: 0.9513, Valid loss: 1.0109


Epoch [978/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.95it/s, loss=0.76]


Epoch [978/5000]: Train loss: 0.9490, Valid loss: 1.0318


Epoch [979/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.07it/s, loss=0.905]


Epoch [979/5000]: Train loss: 0.9602, Valid loss: 0.9477


Epoch [980/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.12it/s, loss=1.04]


Epoch [980/5000]: Train loss: 0.9659, Valid loss: 1.0263


Epoch [981/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.64it/s, loss=0.861]


Epoch [981/5000]: Train loss: 0.9773, Valid loss: 0.9592


Epoch [982/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.57it/s, loss=0.98]


Epoch [982/5000]: Train loss: 0.9571, Valid loss: 0.9831


Epoch [983/5000]: 100%|██████████| 10/10 [00:00<00:00, 81.88it/s, loss=0.854]


Epoch [983/5000]: Train loss: 0.9548, Valid loss: 0.9541


Epoch [984/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.68it/s, loss=1.13]


Epoch [984/5000]: Train loss: 0.9651, Valid loss: 0.9157


Epoch [985/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.06it/s, loss=1.08]


Epoch [985/5000]: Train loss: 0.9603, Valid loss: 0.9298


Epoch [986/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.50it/s, loss=1.18]


Epoch [986/5000]: Train loss: 0.9667, Valid loss: 1.0148


Epoch [987/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.47it/s, loss=0.739]


Epoch [987/5000]: Train loss: 0.9392, Valid loss: 0.9721


Epoch [988/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.45it/s, loss=0.728]


Epoch [988/5000]: Train loss: 0.9415, Valid loss: 0.9757


Epoch [989/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.06it/s, loss=0.671]


Epoch [989/5000]: Train loss: 0.9407, Valid loss: 0.9474


Epoch [990/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.44it/s, loss=0.953]


Epoch [990/5000]: Train loss: 0.9617, Valid loss: 1.0068


Epoch [991/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.08it/s, loss=1.27]


Epoch [991/5000]: Train loss: 0.9749, Valid loss: 0.9065


Epoch [992/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.74it/s, loss=0.893]


Epoch [992/5000]: Train loss: 0.9505, Valid loss: 1.0587


Epoch [993/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.07it/s, loss=0.864]


Epoch [993/5000]: Train loss: 0.9457, Valid loss: 0.9671


Epoch [994/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.81it/s, loss=0.647]


Epoch [994/5000]: Train loss: 0.9396, Valid loss: 0.9994


Epoch [995/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.50it/s, loss=0.956]


Epoch [995/5000]: Train loss: 0.9490, Valid loss: 0.9580


Epoch [996/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.59it/s, loss=0.725]


Epoch [996/5000]: Train loss: 0.9405, Valid loss: 0.9285


Epoch [997/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.96it/s, loss=0.676]


Epoch [997/5000]: Train loss: 0.9400, Valid loss: 0.9123


Epoch [998/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.55it/s, loss=0.847]


Epoch [998/5000]: Train loss: 0.9409, Valid loss: 0.9446


Epoch [999/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.41it/s, loss=0.802]


Epoch [999/5000]: Train loss: 0.9410, Valid loss: 0.9579


Epoch [1000/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.73it/s, loss=0.853]


Epoch [1000/5000]: Train loss: 0.9450, Valid loss: 0.9596


Epoch [1001/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.43it/s, loss=0.981]


Epoch [1001/5000]: Train loss: 0.9640, Valid loss: 0.9751


Epoch [1002/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.24it/s, loss=1.06]


Epoch [1002/5000]: Train loss: 0.9617, Valid loss: 0.9949


Epoch [1003/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.26it/s, loss=0.958]


Epoch [1003/5000]: Train loss: 0.9659, Valid loss: 0.9361


Epoch [1004/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.50it/s, loss=0.969]


Epoch [1004/5000]: Train loss: 0.9504, Valid loss: 0.9388


Epoch [1005/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.13it/s, loss=0.805]


Epoch [1005/5000]: Train loss: 0.9497, Valid loss: 1.0188


Epoch [1006/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.05it/s, loss=0.873]


Epoch [1006/5000]: Train loss: 0.9648, Valid loss: 0.8826
Saving model with loss 0.883...


Epoch [1007/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.46it/s, loss=0.781]


Epoch [1007/5000]: Train loss: 0.9590, Valid loss: 0.9532


Epoch [1008/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=1.43]


Epoch [1008/5000]: Train loss: 0.9994, Valid loss: 0.9462


Epoch [1009/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.30it/s, loss=1.35]


Epoch [1009/5000]: Train loss: 0.9853, Valid loss: 0.9243


Epoch [1010/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.91it/s, loss=0.894]


Epoch [1010/5000]: Train loss: 0.9523, Valid loss: 0.9331


Epoch [1011/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.41it/s, loss=0.902]


Epoch [1011/5000]: Train loss: 0.9475, Valid loss: 0.9709


Epoch [1012/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.05it/s, loss=0.959]


Epoch [1012/5000]: Train loss: 0.9607, Valid loss: 0.9213


Epoch [1013/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.74it/s, loss=1.4]


Epoch [1013/5000]: Train loss: 0.9782, Valid loss: 0.9408


Epoch [1014/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.58it/s, loss=0.848]


Epoch [1014/5000]: Train loss: 0.9445, Valid loss: 0.9475


Epoch [1015/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.04it/s, loss=0.828]


Epoch [1015/5000]: Train loss: 0.9407, Valid loss: 0.9081


Epoch [1016/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.01it/s, loss=0.934]


Epoch [1016/5000]: Train loss: 0.9461, Valid loss: 0.9606


Epoch [1017/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.93it/s, loss=0.958]


Epoch [1017/5000]: Train loss: 0.9525, Valid loss: 0.9429


Epoch [1018/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.80it/s, loss=0.882]


Epoch [1018/5000]: Train loss: 0.9451, Valid loss: 0.9433


Epoch [1019/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.93it/s, loss=0.779]


Epoch [1019/5000]: Train loss: 0.9363, Valid loss: 0.9715


Epoch [1020/5000]: 100%|██████████| 10/10 [00:00<00:00, 276.47it/s, loss=0.737]


Epoch [1020/5000]: Train loss: 0.9413, Valid loss: 0.9493


Epoch [1021/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.80it/s, loss=0.846]


Epoch [1021/5000]: Train loss: 0.9411, Valid loss: 0.9274


Epoch [1022/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.71it/s, loss=1.06]


Epoch [1022/5000]: Train loss: 0.9544, Valid loss: 0.9896


Epoch [1023/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.73it/s, loss=1.02]


Epoch [1023/5000]: Train loss: 0.9522, Valid loss: 0.9519


Epoch [1024/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.33it/s, loss=0.994]


Epoch [1024/5000]: Train loss: 0.9476, Valid loss: 0.9051


Epoch [1025/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.32it/s, loss=1.08]


Epoch [1025/5000]: Train loss: 0.9623, Valid loss: 0.9837


Epoch [1026/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.64it/s, loss=0.803]


Epoch [1026/5000]: Train loss: 0.9444, Valid loss: 0.9847


Epoch [1027/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.60it/s, loss=0.741]


Epoch [1027/5000]: Train loss: 0.9439, Valid loss: 0.9805


Epoch [1028/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.97it/s, loss=0.961]


Epoch [1028/5000]: Train loss: 0.9521, Valid loss: 0.9657


Epoch [1029/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.63it/s, loss=0.784]


Epoch [1029/5000]: Train loss: 0.9379, Valid loss: 1.0041


Epoch [1030/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.37it/s, loss=0.756]


Epoch [1030/5000]: Train loss: 0.9442, Valid loss: 0.9285


Epoch [1031/5000]: 100%|██████████| 10/10 [00:00<00:00, 254.77it/s, loss=0.908]


Epoch [1031/5000]: Train loss: 0.9495, Valid loss: 0.8935


Epoch [1032/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.03it/s, loss=0.796]


Epoch [1032/5000]: Train loss: 0.9381, Valid loss: 0.9629


Epoch [1033/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.58it/s, loss=1.11]


Epoch [1033/5000]: Train loss: 0.9644, Valid loss: 0.8943


Epoch [1034/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.40it/s, loss=1.25]


Epoch [1034/5000]: Train loss: 0.9671, Valid loss: 1.0203


Epoch [1035/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.18it/s, loss=0.7]


Epoch [1035/5000]: Train loss: 0.9379, Valid loss: 0.9329


Epoch [1036/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.60it/s, loss=0.964]


Epoch [1036/5000]: Train loss: 0.9465, Valid loss: 0.9463


Epoch [1037/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.81it/s, loss=0.924]


Epoch [1037/5000]: Train loss: 0.9424, Valid loss: 0.8945


Epoch [1038/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.19it/s, loss=0.8]


Epoch [1038/5000]: Train loss: 0.9343, Valid loss: 0.9669


Epoch [1039/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.14it/s, loss=1.21]


Epoch [1039/5000]: Train loss: 0.9693, Valid loss: 0.9859


Epoch [1040/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.59it/s, loss=1.19]


Epoch [1040/5000]: Train loss: 0.9754, Valid loss: 0.9737


Epoch [1041/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.98it/s, loss=1.06]


Epoch [1041/5000]: Train loss: 0.9637, Valid loss: 0.9238


Epoch [1042/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.41it/s, loss=0.97]


Epoch [1042/5000]: Train loss: 0.9539, Valid loss: 0.9382


Epoch [1043/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.04it/s, loss=0.931]


Epoch [1043/5000]: Train loss: 0.9520, Valid loss: 0.9579


Epoch [1044/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.52it/s, loss=0.699]


Epoch [1044/5000]: Train loss: 0.9350, Valid loss: 0.9258


Epoch [1045/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.87it/s, loss=0.878]


Epoch [1045/5000]: Train loss: 0.9365, Valid loss: 0.9651


Epoch [1046/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=1.03]


Epoch [1046/5000]: Train loss: 0.9553, Valid loss: 0.9713


Epoch [1047/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.07it/s, loss=1.08]


Epoch [1047/5000]: Train loss: 0.9536, Valid loss: 0.8645
Saving model with loss 0.864...


Epoch [1048/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.07it/s, loss=0.943]


Epoch [1048/5000]: Train loss: 0.9429, Valid loss: 0.8935


Epoch [1049/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.41it/s, loss=1.28]


Epoch [1049/5000]: Train loss: 0.9730, Valid loss: 0.9301


Epoch [1050/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.37it/s, loss=0.704]


Epoch [1050/5000]: Train loss: 0.9348, Valid loss: 0.9698


Epoch [1051/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.46it/s, loss=1.16]


Epoch [1051/5000]: Train loss: 0.9649, Valid loss: 0.9607


Epoch [1052/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.56it/s, loss=0.769]


Epoch [1052/5000]: Train loss: 0.9371, Valid loss: 0.9511


Epoch [1053/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.63it/s, loss=1.08]


Epoch [1053/5000]: Train loss: 0.9633, Valid loss: 0.9018


Epoch [1054/5000]: 100%|██████████| 10/10 [00:00<00:00, 308.68it/s, loss=0.873]


Epoch [1054/5000]: Train loss: 0.9542, Valid loss: 0.9898


Epoch [1055/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.36it/s, loss=0.762]


Epoch [1055/5000]: Train loss: 0.9353, Valid loss: 0.9368


Epoch [1056/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.11it/s, loss=0.847]


Epoch [1056/5000]: Train loss: 0.9387, Valid loss: 0.9200


Epoch [1057/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.07it/s, loss=0.999]


Epoch [1057/5000]: Train loss: 0.9427, Valid loss: 0.9037


Epoch [1058/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.33it/s, loss=1.37]


Epoch [1058/5000]: Train loss: 0.9720, Valid loss: 0.9663


Epoch [1059/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.97it/s, loss=0.963]


Epoch [1059/5000]: Train loss: 0.9477, Valid loss: 0.9246


Epoch [1060/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.63it/s, loss=0.985]


Epoch [1060/5000]: Train loss: 0.9504, Valid loss: 0.8858


Epoch [1061/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.37it/s, loss=0.716]


Epoch [1061/5000]: Train loss: 0.9295, Valid loss: 0.9705


Epoch [1062/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.75it/s, loss=0.691]


Epoch [1062/5000]: Train loss: 0.9313, Valid loss: 0.9687


Epoch [1063/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.70it/s, loss=1.13]


Epoch [1063/5000]: Train loss: 0.9519, Valid loss: 0.9260


Epoch [1064/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.73it/s, loss=1.43]


Epoch [1064/5000]: Train loss: 0.9888, Valid loss: 0.9856


Epoch [1065/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.63it/s, loss=0.815]


Epoch [1065/5000]: Train loss: 0.9433, Valid loss: 0.9240


Epoch [1066/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=0.836]


Epoch [1066/5000]: Train loss: 0.9341, Valid loss: 0.8749


Epoch [1067/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.34it/s, loss=1.12]


Epoch [1067/5000]: Train loss: 0.9502, Valid loss: 0.9419


Epoch [1068/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.38it/s, loss=0.735]


Epoch [1068/5000]: Train loss: 0.9300, Valid loss: 0.9283


Epoch [1069/5000]: 100%|██████████| 10/10 [00:00<00:00, 63.59it/s, loss=0.792]


Epoch [1069/5000]: Train loss: 0.9308, Valid loss: 0.9966


Epoch [1070/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.17it/s, loss=0.761]


Epoch [1070/5000]: Train loss: 0.9406, Valid loss: 0.9063


Epoch [1071/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.40it/s, loss=1.19]


Epoch [1071/5000]: Train loss: 0.9597, Valid loss: 0.9618


Epoch [1072/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.79it/s, loss=0.884]


Epoch [1072/5000]: Train loss: 0.9436, Valid loss: 0.9592


Epoch [1073/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.74it/s, loss=1.27]


Epoch [1073/5000]: Train loss: 0.9613, Valid loss: 0.9376


Epoch [1074/5000]: 100%|██████████| 10/10 [00:00<00:00, 269.25it/s, loss=0.98]


Epoch [1074/5000]: Train loss: 0.9518, Valid loss: 0.9636


Epoch [1075/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.54it/s, loss=1.02]


Epoch [1075/5000]: Train loss: 0.9452, Valid loss: 0.9385


Epoch [1076/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.75it/s, loss=1.14]


Epoch [1076/5000]: Train loss: 0.9511, Valid loss: 0.9252


Epoch [1077/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.19it/s, loss=0.837]


Epoch [1077/5000]: Train loss: 0.9676, Valid loss: 0.9519


Epoch [1078/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.63it/s, loss=0.749]


Epoch [1078/5000]: Train loss: 0.9669, Valid loss: 0.9458


Epoch [1079/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.58it/s, loss=0.613]


Epoch [1079/5000]: Train loss: 0.9189, Valid loss: 0.9426


Epoch [1080/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.46it/s, loss=1.01]


Epoch [1080/5000]: Train loss: 0.9445, Valid loss: 0.8858


Epoch [1081/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.89it/s, loss=0.714]


Epoch [1081/5000]: Train loss: 0.9262, Valid loss: 0.9941


Epoch [1082/5000]: 100%|██████████| 10/10 [00:00<00:00, 205.69it/s, loss=0.67]


Epoch [1082/5000]: Train loss: 0.9311, Valid loss: 0.9411


Epoch [1083/5000]: 100%|██████████| 10/10 [00:00<00:00, 200.62it/s, loss=0.872]


Epoch [1083/5000]: Train loss: 0.9392, Valid loss: 0.8883


Epoch [1084/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.67it/s, loss=0.972]


Epoch [1084/5000]: Train loss: 0.9429, Valid loss: 0.9531


Epoch [1085/5000]: 100%|██████████| 10/10 [00:00<00:00, 217.69it/s, loss=1.29]


Epoch [1085/5000]: Train loss: 0.9694, Valid loss: 0.9350


Epoch [1086/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.27it/s, loss=0.732]


Epoch [1086/5000]: Train loss: 0.9423, Valid loss: 0.9639


Epoch [1087/5000]: 100%|██████████| 10/10 [00:00<00:00, 236.20it/s, loss=1.26]


Epoch [1087/5000]: Train loss: 0.9663, Valid loss: 0.9458


Epoch [1088/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.97it/s, loss=0.844]


Epoch [1088/5000]: Train loss: 0.9333, Valid loss: 0.9009


Epoch [1089/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.25it/s, loss=1.05]


Epoch [1089/5000]: Train loss: 0.9460, Valid loss: 0.9276


Epoch [1090/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.63it/s, loss=1.07]


Epoch [1090/5000]: Train loss: 0.9497, Valid loss: 0.8982


Epoch [1091/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.27it/s, loss=0.807]


Epoch [1091/5000]: Train loss: 0.9282, Valid loss: 0.9179


Epoch [1092/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.67it/s, loss=0.682]


Epoch [1092/5000]: Train loss: 0.9214, Valid loss: 0.8882


Epoch [1093/5000]: 100%|██████████| 10/10 [00:00<00:00, 202.02it/s, loss=0.945]


Epoch [1093/5000]: Train loss: 0.9387, Valid loss: 0.9054


Epoch [1094/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.53it/s, loss=0.777]


Epoch [1094/5000]: Train loss: 0.9311, Valid loss: 0.9138


Epoch [1095/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.61it/s, loss=1.21]


Epoch [1095/5000]: Train loss: 0.9565, Valid loss: 0.9907


Epoch [1096/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.85it/s, loss=0.885]


Epoch [1096/5000]: Train loss: 0.9361, Valid loss: 0.9951


Epoch [1097/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.14it/s, loss=0.802]


Epoch [1097/5000]: Train loss: 0.9253, Valid loss: 1.0005


Epoch [1098/5000]: 100%|██████████| 10/10 [00:00<00:00, 241.90it/s, loss=0.845]


Epoch [1098/5000]: Train loss: 0.9308, Valid loss: 0.9089


Epoch [1099/5000]: 100%|██████████| 10/10 [00:00<00:00, 209.85it/s, loss=0.78]


Epoch [1099/5000]: Train loss: 0.9285, Valid loss: 0.9856


Epoch [1100/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.75it/s, loss=1.24]


Epoch [1100/5000]: Train loss: 0.9619, Valid loss: 0.9696


Epoch [1101/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.11it/s, loss=0.945]


Epoch [1101/5000]: Train loss: 0.9448, Valid loss: 0.9160


Epoch [1102/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.68it/s, loss=0.817]


Epoch [1102/5000]: Train loss: 0.9322, Valid loss: 0.9678


Epoch [1103/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.73it/s, loss=0.856]


Epoch [1103/5000]: Train loss: 0.9341, Valid loss: 0.9854


Epoch [1104/5000]: 100%|██████████| 10/10 [00:00<00:00, 210.45it/s, loss=1]


Epoch [1104/5000]: Train loss: 0.9579, Valid loss: 0.9341


Epoch [1105/5000]: 100%|██████████| 10/10 [00:00<00:00, 152.92it/s, loss=0.999]


Epoch [1105/5000]: Train loss: 0.9450, Valid loss: 0.8745


Epoch [1106/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.97it/s, loss=0.653]


Epoch [1106/5000]: Train loss: 0.9258, Valid loss: 0.9424


Epoch [1107/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.06it/s, loss=0.902]


Epoch [1107/5000]: Train loss: 0.9428, Valid loss: 0.9257


Epoch [1108/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.82it/s, loss=1.2]


Epoch [1108/5000]: Train loss: 0.9642, Valid loss: 0.9331


Epoch [1109/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.33it/s, loss=0.855]


Epoch [1109/5000]: Train loss: 0.9280, Valid loss: 0.9220


Epoch [1110/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.80it/s, loss=1.23]


Epoch [1110/5000]: Train loss: 0.9577, Valid loss: 0.9105


Epoch [1111/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.08it/s, loss=0.761]


Epoch [1111/5000]: Train loss: 0.9259, Valid loss: 0.8732


Epoch [1112/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.53it/s, loss=0.942]


Epoch [1112/5000]: Train loss: 0.9363, Valid loss: 0.9725


Epoch [1113/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.54it/s, loss=0.992]


Epoch [1113/5000]: Train loss: 0.9364, Valid loss: 0.9284


Epoch [1114/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.15it/s, loss=0.91]


Epoch [1114/5000]: Train loss: 0.9379, Valid loss: 0.9284


Epoch [1115/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.42it/s, loss=1.03]


Epoch [1115/5000]: Train loss: 0.9371, Valid loss: 0.9206


Epoch [1116/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.11it/s, loss=0.875]


Epoch [1116/5000]: Train loss: 0.9291, Valid loss: 0.9520


Epoch [1117/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.99it/s, loss=0.802]


Epoch [1117/5000]: Train loss: 0.9331, Valid loss: 0.9061


Epoch [1118/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.47it/s, loss=1.1]


Epoch [1118/5000]: Train loss: 0.9420, Valid loss: 0.9321


Epoch [1119/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.96it/s, loss=0.774]


Epoch [1119/5000]: Train loss: 0.9315, Valid loss: 0.8805


Epoch [1120/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.26it/s, loss=0.946]


Epoch [1120/5000]: Train loss: 0.9334, Valid loss: 0.9143


Epoch [1121/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.38it/s, loss=0.874]


Epoch [1121/5000]: Train loss: 0.9328, Valid loss: 0.9314


Epoch [1122/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.72it/s, loss=0.789]


Epoch [1122/5000]: Train loss: 0.9247, Valid loss: 0.9144


Epoch [1123/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.00it/s, loss=0.905]


Epoch [1123/5000]: Train loss: 0.9331, Valid loss: 0.9419


Epoch [1124/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.10it/s, loss=1.02]


Epoch [1124/5000]: Train loss: 0.9510, Valid loss: 0.9335


Epoch [1125/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.56it/s, loss=1.14]


Epoch [1125/5000]: Train loss: 0.9769, Valid loss: 0.9552


Epoch [1126/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.54it/s, loss=0.805]


Epoch [1126/5000]: Train loss: 0.9365, Valid loss: 0.8978


Epoch [1127/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.89it/s, loss=1.24]


Epoch [1127/5000]: Train loss: 0.9560, Valid loss: 0.8770


Epoch [1128/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.56it/s, loss=1.16]


Epoch [1128/5000]: Train loss: 0.9521, Valid loss: 0.8935


Epoch [1129/5000]: 100%|██████████| 10/10 [00:00<00:00, 246.78it/s, loss=0.867]


Epoch [1129/5000]: Train loss: 0.9316, Valid loss: 0.8594
Saving model with loss 0.859...


Epoch [1130/5000]: 100%|██████████| 10/10 [00:00<00:00, 237.85it/s, loss=1.08]


Epoch [1130/5000]: Train loss: 0.9411, Valid loss: 0.9090


Epoch [1131/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.81it/s, loss=0.796]


Epoch [1131/5000]: Train loss: 0.9236, Valid loss: 0.9616


Epoch [1132/5000]: 100%|██████████| 10/10 [00:00<00:00, 240.46it/s, loss=0.924]


Epoch [1132/5000]: Train loss: 0.9377, Valid loss: 1.0112


Epoch [1133/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.36it/s, loss=0.828]


Epoch [1133/5000]: Train loss: 0.9228, Valid loss: 0.8564
Saving model with loss 0.856...


Epoch [1134/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.26it/s, loss=0.719]


Epoch [1134/5000]: Train loss: 0.9169, Valid loss: 0.9070


Epoch [1135/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.66it/s, loss=1.21]


Epoch [1135/5000]: Train loss: 0.9452, Valid loss: 0.9153


Epoch [1136/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.36it/s, loss=0.773]


Epoch [1136/5000]: Train loss: 0.9240, Valid loss: 0.9146


Epoch [1137/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.85it/s, loss=0.95]


Epoch [1137/5000]: Train loss: 0.9314, Valid loss: 0.8783


Epoch [1138/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.11it/s, loss=0.919]


Epoch [1138/5000]: Train loss: 0.9335, Valid loss: 0.9401


Epoch [1139/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.26it/s, loss=0.844]


Epoch [1139/5000]: Train loss: 0.9445, Valid loss: 0.9913


Epoch [1140/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.34it/s, loss=1.09]


Epoch [1140/5000]: Train loss: 0.9699, Valid loss: 0.8994


Epoch [1141/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.79it/s, loss=0.935]


Epoch [1141/5000]: Train loss: 0.9413, Valid loss: 0.9195


Epoch [1142/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.71it/s, loss=0.85]


Epoch [1142/5000]: Train loss: 0.9516, Valid loss: 0.8907


Epoch [1143/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.67it/s, loss=0.927]


Epoch [1143/5000]: Train loss: 0.9373, Valid loss: 0.9384


Epoch [1144/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.65it/s, loss=1.2]


Epoch [1144/5000]: Train loss: 0.9496, Valid loss: 0.9678


Epoch [1145/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=1.1]


Epoch [1145/5000]: Train loss: 0.9494, Valid loss: 0.8541
Saving model with loss 0.854...


Epoch [1146/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.13it/s, loss=0.757]

Epoch [1146/5000]: Train loss: 0.9275, Valid loss: 0.9111



Epoch [1147/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.60it/s, loss=1.04]


Epoch [1147/5000]: Train loss: 0.9380, Valid loss: 0.9249


Epoch [1148/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.94it/s, loss=1.24]


Epoch [1148/5000]: Train loss: 0.9490, Valid loss: 0.8901


Epoch [1149/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.90it/s, loss=0.773]


Epoch [1149/5000]: Train loss: 0.9206, Valid loss: 0.9396


Epoch [1150/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.49it/s, loss=1.11]


Epoch [1150/5000]: Train loss: 0.9433, Valid loss: 0.8959


Epoch [1151/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.81it/s, loss=0.955]


Epoch [1151/5000]: Train loss: 0.9481, Valid loss: 0.9341


Epoch [1152/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.74it/s, loss=1.23]


Epoch [1152/5000]: Train loss: 0.9710, Valid loss: 0.9875


Epoch [1153/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.37it/s, loss=1.58]


Epoch [1153/5000]: Train loss: 0.9825, Valid loss: 0.9159


Epoch [1154/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.08it/s, loss=0.74]


Epoch [1154/5000]: Train loss: 0.9258, Valid loss: 0.9004


Epoch [1155/5000]: 100%|██████████| 10/10 [00:00<00:00, 78.52it/s, loss=0.829]


Epoch [1155/5000]: Train loss: 0.9227, Valid loss: 0.9870


Epoch [1156/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.51it/s, loss=0.79]


Epoch [1156/5000]: Train loss: 0.9256, Valid loss: 0.9133


Epoch [1157/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.75it/s, loss=0.823]


Epoch [1157/5000]: Train loss: 0.9271, Valid loss: 0.9055


Epoch [1158/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.71it/s, loss=1.1]


Epoch [1158/5000]: Train loss: 0.9429, Valid loss: 0.9374


Epoch [1159/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.25it/s, loss=0.855]


Epoch [1159/5000]: Train loss: 0.9244, Valid loss: 0.8813


Epoch [1160/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.12it/s, loss=0.938]


Epoch [1160/5000]: Train loss: 0.9249, Valid loss: 0.9745


Epoch [1161/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.07it/s, loss=1.06]


Epoch [1161/5000]: Train loss: 0.9380, Valid loss: 0.8849


Epoch [1162/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.52it/s, loss=0.791]


Epoch [1162/5000]: Train loss: 0.9244, Valid loss: 0.9296


Epoch [1163/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.96it/s, loss=0.845]


Epoch [1163/5000]: Train loss: 0.9235, Valid loss: 0.9285


Epoch [1164/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.23it/s, loss=0.927]


Epoch [1164/5000]: Train loss: 0.9282, Valid loss: 0.9202


Epoch [1165/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.97it/s, loss=0.843]


Epoch [1165/5000]: Train loss: 0.9215, Valid loss: 0.8964


Epoch [1166/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.65it/s, loss=0.994]


Epoch [1166/5000]: Train loss: 0.9344, Valid loss: 0.9402


Epoch [1167/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.87it/s, loss=0.808]


Epoch [1167/5000]: Train loss: 0.9198, Valid loss: 0.9224


Epoch [1168/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.98it/s, loss=0.884]


Epoch [1168/5000]: Train loss: 0.9356, Valid loss: 0.9669


Epoch [1169/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.24it/s, loss=0.761]


Epoch [1169/5000]: Train loss: 0.9562, Valid loss: 0.9502


Epoch [1170/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.41it/s, loss=0.865]


Epoch [1170/5000]: Train loss: 0.9432, Valid loss: 0.9502


Epoch [1171/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.78it/s, loss=0.679]


Epoch [1171/5000]: Train loss: 0.9112, Valid loss: 0.9000


Epoch [1172/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.68it/s, loss=0.918]


Epoch [1172/5000]: Train loss: 0.9303, Valid loss: 0.9512


Epoch [1173/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.71it/s, loss=0.96]


Epoch [1173/5000]: Train loss: 0.9298, Valid loss: 0.9016


Epoch [1174/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.98it/s, loss=0.93]

Epoch [1174/5000]: Train loss: 0.9246, Valid loss: 0.9009



Epoch [1175/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.09it/s, loss=0.938]


Epoch [1175/5000]: Train loss: 0.9452, Valid loss: 0.9730


Epoch [1176/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.88it/s, loss=0.722]


Epoch [1176/5000]: Train loss: 0.9496, Valid loss: 0.9322


Epoch [1177/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.21it/s, loss=1.11]


Epoch [1177/5000]: Train loss: 0.9495, Valid loss: 0.8544


Epoch [1178/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.11it/s, loss=0.907]


Epoch [1178/5000]: Train loss: 0.9288, Valid loss: 0.9174


Epoch [1179/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.72it/s, loss=0.671]


Epoch [1179/5000]: Train loss: 0.9144, Valid loss: 0.9306


Epoch [1180/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.99it/s, loss=0.95]


Epoch [1180/5000]: Train loss: 0.9300, Valid loss: 0.9727


Epoch [1181/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.60it/s, loss=0.939]

Epoch [1181/5000]: Train loss: 0.9381, Valid loss: 0.9460



Epoch [1182/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.70it/s, loss=1.13]


Epoch [1182/5000]: Train loss: 0.9474, Valid loss: 0.9387


Epoch [1183/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.91it/s, loss=1.06]


Epoch [1183/5000]: Train loss: 0.9398, Valid loss: 0.9994


Epoch [1184/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.97it/s, loss=1.08]


Epoch [1184/5000]: Train loss: 0.9347, Valid loss: 0.9462


Epoch [1185/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.01it/s, loss=0.816]


Epoch [1185/5000]: Train loss: 0.9330, Valid loss: 0.9385


Epoch [1186/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.90it/s, loss=1.01]


Epoch [1186/5000]: Train loss: 0.9435, Valid loss: 0.9344


Epoch [1187/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.68it/s, loss=0.783]


Epoch [1187/5000]: Train loss: 0.9343, Valid loss: 0.9361


Epoch [1188/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.67it/s, loss=1.04]


Epoch [1188/5000]: Train loss: 0.9390, Valid loss: 0.8943


Epoch [1189/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.56it/s, loss=0.846]


Epoch [1189/5000]: Train loss: 0.9264, Valid loss: 0.8634


Epoch [1190/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.09it/s, loss=0.618]


Epoch [1190/5000]: Train loss: 0.9086, Valid loss: 0.9044


Epoch [1191/5000]: 100%|██████████| 10/10 [00:00<00:00, 302.97it/s, loss=0.722]


Epoch [1191/5000]: Train loss: 0.9139, Valid loss: 0.9257


Epoch [1192/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.19it/s, loss=0.828]


Epoch [1192/5000]: Train loss: 0.9221, Valid loss: 0.9521


Epoch [1193/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.74it/s, loss=0.785]


Epoch [1193/5000]: Train loss: 0.9176, Valid loss: 0.9073


Epoch [1194/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.50it/s, loss=0.807]

Epoch [1194/5000]: Train loss: 0.9172, Valid loss: 0.9255



Epoch [1195/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.49it/s, loss=0.97]


Epoch [1195/5000]: Train loss: 0.9355, Valid loss: 0.8795


Epoch [1196/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.65it/s, loss=1.14]


Epoch [1196/5000]: Train loss: 0.9403, Valid loss: 0.8765


Epoch [1197/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.77it/s, loss=0.868]


Epoch [1197/5000]: Train loss: 0.9224, Valid loss: 0.9983


Epoch [1198/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.60it/s, loss=0.798]


Epoch [1198/5000]: Train loss: 0.9214, Valid loss: 0.8904


Epoch [1199/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.68it/s, loss=1.06]


Epoch [1199/5000]: Train loss: 0.9401, Valid loss: 0.9361


Epoch [1200/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.53it/s, loss=0.831]


Epoch [1200/5000]: Train loss: 0.9263, Valid loss: 0.9157


Epoch [1201/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=0.866]


Epoch [1201/5000]: Train loss: 0.9183, Valid loss: 0.9065


Epoch [1202/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.41it/s, loss=0.755]


Epoch [1202/5000]: Train loss: 0.9132, Valid loss: 0.9284


Epoch [1203/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.47it/s, loss=0.954]


Epoch [1203/5000]: Train loss: 0.9250, Valid loss: 0.9424


Epoch [1204/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.31it/s, loss=0.9]


Epoch [1204/5000]: Train loss: 0.9235, Valid loss: 0.8785


Epoch [1205/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.89it/s, loss=1.44]


Epoch [1205/5000]: Train loss: 0.9576, Valid loss: 0.8842


Epoch [1206/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.29it/s, loss=0.978]


Epoch [1206/5000]: Train loss: 0.9355, Valid loss: 0.8938


Epoch [1207/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.24it/s, loss=0.929]


Epoch [1207/5000]: Train loss: 0.9343, Valid loss: 0.9847


Epoch [1208/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.00it/s, loss=0.658]


Epoch [1208/5000]: Train loss: 0.9076, Valid loss: 0.9587


Epoch [1209/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.71it/s, loss=0.884]


Epoch [1209/5000]: Train loss: 0.9213, Valid loss: 0.9022


Epoch [1210/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.48it/s, loss=0.999]


Epoch [1210/5000]: Train loss: 0.9285, Valid loss: 0.8387
Saving model with loss 0.839...


Epoch [1211/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.82it/s, loss=1.05]


Epoch [1211/5000]: Train loss: 0.9333, Valid loss: 0.9481


Epoch [1212/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=0.874]


Epoch [1212/5000]: Train loss: 0.9191, Valid loss: 0.9800


Epoch [1213/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.91it/s, loss=0.896]


Epoch [1213/5000]: Train loss: 0.9224, Valid loss: 0.9339


Epoch [1214/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=0.924]


Epoch [1214/5000]: Train loss: 0.9222, Valid loss: 0.9094


Epoch [1215/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.77it/s, loss=0.937]


Epoch [1215/5000]: Train loss: 0.9285, Valid loss: 0.8936


Epoch [1216/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.15it/s, loss=0.702]


Epoch [1216/5000]: Train loss: 0.9132, Valid loss: 0.8572


Epoch [1217/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.91it/s, loss=0.632]


Epoch [1217/5000]: Train loss: 0.9043, Valid loss: 0.8938


Epoch [1218/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.70it/s, loss=0.99]


Epoch [1218/5000]: Train loss: 0.9261, Valid loss: 0.8972


Epoch [1219/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.88it/s, loss=1.16]


Epoch [1219/5000]: Train loss: 0.9396, Valid loss: 0.9150


Epoch [1220/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.92it/s, loss=1.05]


Epoch [1220/5000]: Train loss: 0.9333, Valid loss: 0.8971


Epoch [1221/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.10it/s, loss=0.881]


Epoch [1221/5000]: Train loss: 0.9320, Valid loss: 0.9081


Epoch [1222/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.65it/s, loss=1.55]


Epoch [1222/5000]: Train loss: 0.9673, Valid loss: 0.9287


Epoch [1223/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=0.942]


Epoch [1223/5000]: Train loss: 0.9276, Valid loss: 0.8926


Epoch [1224/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.15it/s, loss=1.02]


Epoch [1224/5000]: Train loss: 0.9448, Valid loss: 0.8869


Epoch [1225/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.09it/s, loss=0.976]


Epoch [1225/5000]: Train loss: 0.9316, Valid loss: 0.8695


Epoch [1226/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.79it/s, loss=1.02]


Epoch [1226/5000]: Train loss: 0.9277, Valid loss: 0.9411


Epoch [1227/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.84it/s, loss=0.755]


Epoch [1227/5000]: Train loss: 0.9093, Valid loss: 0.9490


Epoch [1228/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.20it/s, loss=0.55]


Epoch [1228/5000]: Train loss: 0.9074, Valid loss: 0.9438


Epoch [1229/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.35it/s, loss=0.941]


Epoch [1229/5000]: Train loss: 0.9273, Valid loss: 0.8732


Epoch [1230/5000]: 100%|██████████| 10/10 [00:00<00:00, 228.45it/s, loss=0.827]


Epoch [1230/5000]: Train loss: 0.9224, Valid loss: 0.9355


Epoch [1231/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.00it/s, loss=0.868]


Epoch [1231/5000]: Train loss: 0.9244, Valid loss: 0.8637


Epoch [1232/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.42it/s, loss=0.748]


Epoch [1232/5000]: Train loss: 0.9160, Valid loss: 0.9006


Epoch [1233/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.34it/s, loss=1.15]


Epoch [1233/5000]: Train loss: 0.9389, Valid loss: 0.9361


Epoch [1234/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.59it/s, loss=1.01]


Epoch [1234/5000]: Train loss: 0.9408, Valid loss: 0.8945


Epoch [1235/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.31it/s, loss=0.825]


Epoch [1235/5000]: Train loss: 0.9161, Valid loss: 0.9401


Epoch [1236/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.31it/s, loss=1.03]


Epoch [1236/5000]: Train loss: 0.9277, Valid loss: 0.8864


Epoch [1237/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.57it/s, loss=0.73]


Epoch [1237/5000]: Train loss: 0.9075, Valid loss: 0.9250


Epoch [1238/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.68it/s, loss=1.02]


Epoch [1238/5000]: Train loss: 0.9248, Valid loss: 0.8825


Epoch [1239/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.48it/s, loss=0.967]


Epoch [1239/5000]: Train loss: 0.9265, Valid loss: 0.8825


Epoch [1240/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.64it/s, loss=1.08]


Epoch [1240/5000]: Train loss: 0.9358, Valid loss: 0.9097


Epoch [1241/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.51it/s, loss=0.851]


Epoch [1241/5000]: Train loss: 0.9186, Valid loss: 0.9073


Epoch [1242/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.00it/s, loss=0.847]


Epoch [1242/5000]: Train loss: 0.9274, Valid loss: 0.9415


Epoch [1243/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.67it/s, loss=0.948]


Epoch [1243/5000]: Train loss: 0.9273, Valid loss: 0.8917


Epoch [1244/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.77it/s, loss=1.04]


Epoch [1244/5000]: Train loss: 0.9335, Valid loss: 0.8717


Epoch [1245/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.79it/s, loss=0.952]


Epoch [1245/5000]: Train loss: 0.9224, Valid loss: 0.9097


Epoch [1246/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.10it/s, loss=0.83]


Epoch [1246/5000]: Train loss: 0.9148, Valid loss: 0.9042


Epoch [1247/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.73it/s, loss=0.66]


Epoch [1247/5000]: Train loss: 0.9062, Valid loss: 0.9181


Epoch [1248/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.03it/s, loss=0.892]


Epoch [1248/5000]: Train loss: 0.9142, Valid loss: 0.8710


Epoch [1249/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.87it/s, loss=0.811]


Epoch [1249/5000]: Train loss: 0.9271, Valid loss: 0.9212


Epoch [1250/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.70it/s, loss=1.02]


Epoch [1250/5000]: Train loss: 0.9581, Valid loss: 0.9215


Epoch [1251/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.82it/s, loss=0.732]


Epoch [1251/5000]: Train loss: 0.9140, Valid loss: 0.8872


Epoch [1252/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.59it/s, loss=0.989]


Epoch [1252/5000]: Train loss: 0.9276, Valid loss: 0.8892


Epoch [1253/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.73it/s, loss=0.609]


Epoch [1253/5000]: Train loss: 0.9148, Valid loss: 0.8882


Epoch [1254/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.11it/s, loss=0.784]


Epoch [1254/5000]: Train loss: 0.9289, Valid loss: 0.9484


Epoch [1255/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.59it/s, loss=0.899]


Epoch [1255/5000]: Train loss: 0.9438, Valid loss: 0.9294


Epoch [1256/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.80it/s, loss=0.933]


Epoch [1256/5000]: Train loss: 0.9409, Valid loss: 0.8900


Epoch [1257/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.11it/s, loss=1.2]


Epoch [1257/5000]: Train loss: 0.9519, Valid loss: 0.8883


Epoch [1258/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.16it/s, loss=0.832]


Epoch [1258/5000]: Train loss: 0.9140, Valid loss: 0.8641


Epoch [1259/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.11it/s, loss=0.82]


Epoch [1259/5000]: Train loss: 0.9182, Valid loss: 0.8282
Saving model with loss 0.828...


Epoch [1260/5000]: 100%|██████████| 10/10 [00:00<00:00, 237.35it/s, loss=1.23]


Epoch [1260/5000]: Train loss: 0.9347, Valid loss: 0.8758


Epoch [1261/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.64it/s, loss=0.839]


Epoch [1261/5000]: Train loss: 0.9287, Valid loss: 0.9055


Epoch [1262/5000]: 100%|██████████| 10/10 [00:00<00:00, 340.99it/s, loss=0.683]


Epoch [1262/5000]: Train loss: 0.9125, Valid loss: 0.8565


Epoch [1263/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.90it/s, loss=0.803]


Epoch [1263/5000]: Train loss: 0.9207, Valid loss: 0.8909


Epoch [1264/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.64it/s, loss=0.945]


Epoch [1264/5000]: Train loss: 0.9294, Valid loss: 0.9952


Epoch [1265/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.38it/s, loss=0.768]


Epoch [1265/5000]: Train loss: 0.9576, Valid loss: 1.0237


Epoch [1266/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.93it/s, loss=1.06]


Epoch [1266/5000]: Train loss: 0.9632, Valid loss: 0.9220


Epoch [1267/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.78it/s, loss=1.25]


Epoch [1267/5000]: Train loss: 0.9430, Valid loss: 0.9006


Epoch [1268/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.72it/s, loss=0.999]

Epoch [1268/5000]: Train loss: 0.9235, Valid loss: 0.9049



Epoch [1269/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.67it/s, loss=0.925]


Epoch [1269/5000]: Train loss: 0.9252, Valid loss: 0.8973


Epoch [1270/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.22it/s, loss=0.704]


Epoch [1270/5000]: Train loss: 0.9097, Valid loss: 0.9052


Epoch [1271/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.57it/s, loss=0.885]


Epoch [1271/5000]: Train loss: 0.9170, Valid loss: 0.8983


Epoch [1272/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=1.08]


Epoch [1272/5000]: Train loss: 0.9248, Valid loss: 0.8960


Epoch [1273/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.86it/s, loss=0.81]


Epoch [1273/5000]: Train loss: 0.9146, Valid loss: 0.9136


Epoch [1274/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.02it/s, loss=0.936]


Epoch [1274/5000]: Train loss: 0.9203, Valid loss: 0.8575


Epoch [1275/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.75it/s, loss=0.853]


Epoch [1275/5000]: Train loss: 0.9248, Valid loss: 0.8696


Epoch [1276/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.18it/s, loss=0.821]


Epoch [1276/5000]: Train loss: 0.9137, Valid loss: 0.9189


Epoch [1277/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.38it/s, loss=1.1]


Epoch [1277/5000]: Train loss: 0.9277, Valid loss: 0.8920


Epoch [1278/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.17it/s, loss=1.1]

Epoch [1278/5000]: Train loss: 0.9315, Valid loss: 0.8641



Epoch [1279/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.53it/s, loss=0.903]


Epoch [1279/5000]: Train loss: 0.9205, Valid loss: 0.8372


Epoch [1280/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.10it/s, loss=0.951]


Epoch [1280/5000]: Train loss: 0.9260, Valid loss: 0.9883


Epoch [1281/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.02it/s, loss=0.865]


Epoch [1281/5000]: Train loss: 0.9151, Valid loss: 0.9171


Epoch [1282/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.16it/s, loss=0.678]


Epoch [1282/5000]: Train loss: 0.9047, Valid loss: 0.9093


Epoch [1283/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.04it/s, loss=0.882]


Epoch [1283/5000]: Train loss: 0.9139, Valid loss: 0.9110


Epoch [1284/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.33it/s, loss=0.899]


Epoch [1284/5000]: Train loss: 0.9255, Valid loss: 0.8885


Epoch [1285/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.35it/s, loss=1.14]


Epoch [1285/5000]: Train loss: 0.9448, Valid loss: 0.8614


Epoch [1286/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.26it/s, loss=0.898]


Epoch [1286/5000]: Train loss: 0.9136, Valid loss: 0.8752


Epoch [1287/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.02it/s, loss=1.01]


Epoch [1287/5000]: Train loss: 0.9226, Valid loss: 0.8694


Epoch [1288/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.46it/s, loss=0.772]


Epoch [1288/5000]: Train loss: 0.9066, Valid loss: 0.9019


Epoch [1289/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.38it/s, loss=0.886]


Epoch [1289/5000]: Train loss: 0.9436, Valid loss: 0.8700


Epoch [1290/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.69it/s, loss=0.775]


Epoch [1290/5000]: Train loss: 0.9198, Valid loss: 0.9285


Epoch [1291/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.78it/s, loss=1.28]


Epoch [1291/5000]: Train loss: 0.9499, Valid loss: 0.8898


Epoch [1292/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.17it/s, loss=0.796]


Epoch [1292/5000]: Train loss: 0.9107, Valid loss: 0.8382


Epoch [1293/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.00it/s, loss=0.978]


Epoch [1293/5000]: Train loss: 0.9301, Valid loss: 0.8828


Epoch [1294/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.62it/s, loss=1.19]


Epoch [1294/5000]: Train loss: 0.9329, Valid loss: 0.8720


Epoch [1295/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.79it/s, loss=0.957]


Epoch [1295/5000]: Train loss: 0.9196, Valid loss: 0.8597


Epoch [1296/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.35it/s, loss=0.828]


Epoch [1296/5000]: Train loss: 0.9128, Valid loss: 0.8587


Epoch [1297/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.93it/s, loss=0.959]


Epoch [1297/5000]: Train loss: 0.9184, Valid loss: 0.9117


Epoch [1298/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.66it/s, loss=1.06]


Epoch [1298/5000]: Train loss: 0.9281, Valid loss: 0.8613


Epoch [1299/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.13it/s, loss=1.42]


Epoch [1299/5000]: Train loss: 0.9504, Valid loss: 0.8413


Epoch [1300/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.48it/s, loss=0.799]


Epoch [1300/5000]: Train loss: 0.9074, Valid loss: 0.9192


Epoch [1301/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.55it/s, loss=0.832]


Epoch [1301/5000]: Train loss: 0.9116, Valid loss: 0.8861


Epoch [1302/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.43it/s, loss=0.63]


Epoch [1302/5000]: Train loss: 0.8975, Valid loss: 0.9103


Epoch [1303/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.28it/s, loss=1.02]


Epoch [1303/5000]: Train loss: 0.9223, Valid loss: 0.8890


Epoch [1304/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.54it/s, loss=0.737]


Epoch [1304/5000]: Train loss: 0.9045, Valid loss: 0.9161


Epoch [1305/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.87it/s, loss=0.993]


Epoch [1305/5000]: Train loss: 0.9208, Valid loss: 0.8877


Epoch [1306/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.33it/s, loss=0.755]


Epoch [1306/5000]: Train loss: 0.9047, Valid loss: 0.8784


Epoch [1307/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.39it/s, loss=0.943]


Epoch [1307/5000]: Train loss: 0.9208, Valid loss: 0.9228


Epoch [1308/5000]: 100%|██████████| 10/10 [00:00<00:00, 319.80it/s, loss=0.991]


Epoch [1308/5000]: Train loss: 0.9212, Valid loss: 0.8872


Epoch [1309/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.88it/s, loss=0.859]


Epoch [1309/5000]: Train loss: 0.9107, Valid loss: 0.8632


Epoch [1310/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.98it/s, loss=1.01]


Epoch [1310/5000]: Train loss: 0.9277, Valid loss: 0.9271


Epoch [1311/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.98it/s, loss=0.688]


Epoch [1311/5000]: Train loss: 0.9088, Valid loss: 0.8959


Epoch [1312/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.32it/s, loss=0.818]


Epoch [1312/5000]: Train loss: 0.9176, Valid loss: 0.8845


Epoch [1313/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.11it/s, loss=0.706]


Epoch [1313/5000]: Train loss: 0.9023, Valid loss: 0.8848


Epoch [1314/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.13it/s, loss=0.902]


Epoch [1314/5000]: Train loss: 0.9200, Valid loss: 0.9115


Epoch [1315/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.09it/s, loss=0.857]


Epoch [1315/5000]: Train loss: 0.9134, Valid loss: 0.8751


Epoch [1316/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.73it/s, loss=0.94]


Epoch [1316/5000]: Train loss: 0.9183, Valid loss: 0.9177


Epoch [1317/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.78it/s, loss=0.927]


Epoch [1317/5000]: Train loss: 0.9156, Valid loss: 0.8978


Epoch [1318/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.03it/s, loss=0.865]


Epoch [1318/5000]: Train loss: 0.9124, Valid loss: 0.9138


Epoch [1319/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.62it/s, loss=0.838]


Epoch [1319/5000]: Train loss: 0.9097, Valid loss: 0.9943


Epoch [1320/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.44it/s, loss=0.948]


Epoch [1320/5000]: Train loss: 0.9413, Valid loss: 0.9681


Epoch [1321/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.06it/s, loss=0.879]


Epoch [1321/5000]: Train loss: 0.9594, Valid loss: 0.9141


Epoch [1322/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.18it/s, loss=0.748]


Epoch [1322/5000]: Train loss: 0.9366, Valid loss: 0.8798


Epoch [1323/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.33it/s, loss=0.987]


Epoch [1323/5000]: Train loss: 0.9227, Valid loss: 0.8623


Epoch [1324/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.14it/s, loss=0.898]


Epoch [1324/5000]: Train loss: 0.9160, Valid loss: 0.8564


Epoch [1325/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.32it/s, loss=1.19]


Epoch [1325/5000]: Train loss: 0.9309, Valid loss: 0.8692


Epoch [1326/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.55it/s, loss=0.846]


Epoch [1326/5000]: Train loss: 0.9127, Valid loss: 0.9542


Epoch [1327/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.53it/s, loss=0.878]


Epoch [1327/5000]: Train loss: 0.9291, Valid loss: 0.9280


Epoch [1328/5000]: 100%|██████████| 10/10 [00:00<00:00, 40.66it/s, loss=0.841]


Epoch [1328/5000]: Train loss: 0.9229, Valid loss: 0.8726


Epoch [1329/5000]: 100%|██████████| 10/10 [00:00<00:00, 220.09it/s, loss=0.825]


Epoch [1329/5000]: Train loss: 0.9120, Valid loss: 0.9436


Epoch [1330/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.83it/s, loss=0.952]


Epoch [1330/5000]: Train loss: 0.9385, Valid loss: 0.8754


Epoch [1331/5000]: 100%|██████████| 10/10 [00:00<00:00, 226.13it/s, loss=1.05]


Epoch [1331/5000]: Train loss: 0.9379, Valid loss: 0.9132


Epoch [1332/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.30it/s, loss=1.38]


Epoch [1332/5000]: Train loss: 0.9485, Valid loss: 0.8639


Epoch [1333/5000]: 100%|██████████| 10/10 [00:00<00:00, 200.51it/s, loss=0.868]


Epoch [1333/5000]: Train loss: 0.9128, Valid loss: 0.9446


Epoch [1334/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.55it/s, loss=0.933]


Epoch [1334/5000]: Train loss: 0.9270, Valid loss: 0.9507


Epoch [1335/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.05it/s, loss=1.58]


Epoch [1335/5000]: Train loss: 0.9791, Valid loss: 0.9424


Epoch [1336/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.89it/s, loss=1.22]


Epoch [1336/5000]: Train loss: 0.9530, Valid loss: 0.9594


Epoch [1337/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.93it/s, loss=0.765]


Epoch [1337/5000]: Train loss: 0.9220, Valid loss: 0.9026


Epoch [1338/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.10it/s, loss=0.81]


Epoch [1338/5000]: Train loss: 0.9291, Valid loss: 0.9068


Epoch [1339/5000]: 100%|██████████| 10/10 [00:00<00:00, 243.24it/s, loss=0.857]


Epoch [1339/5000]: Train loss: 0.9100, Valid loss: 0.8841


Epoch [1340/5000]: 100%|██████████| 10/10 [00:00<00:00, 262.24it/s, loss=0.744]


Epoch [1340/5000]: Train loss: 0.9051, Valid loss: 0.9601


Epoch [1341/5000]: 100%|██████████| 10/10 [00:00<00:00, 255.90it/s, loss=1.01]


Epoch [1341/5000]: Train loss: 0.9293, Valid loss: 0.8793


Epoch [1342/5000]: 100%|██████████| 10/10 [00:00<00:00, 242.82it/s, loss=1.14]


Epoch [1342/5000]: Train loss: 0.9289, Valid loss: 0.8800


Epoch [1343/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.22it/s, loss=0.525]


Epoch [1343/5000]: Train loss: 0.8975, Valid loss: 0.9009


Epoch [1344/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.60it/s, loss=0.893]


Epoch [1344/5000]: Train loss: 0.9194, Valid loss: 0.9255


Epoch [1345/5000]: 100%|██████████| 10/10 [00:00<00:00, 232.48it/s, loss=1.01]


Epoch [1345/5000]: Train loss: 0.9272, Valid loss: 0.8672


Epoch [1346/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.15it/s, loss=0.601]


Epoch [1346/5000]: Train loss: 0.8958, Valid loss: 0.8993


Epoch [1347/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.04it/s, loss=0.791]


Epoch [1347/5000]: Train loss: 0.9118, Valid loss: 0.8260
Saving model with loss 0.826...


Epoch [1348/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.43it/s, loss=0.897]


Epoch [1348/5000]: Train loss: 0.9135, Valid loss: 0.9535


Epoch [1349/5000]: 100%|██████████| 10/10 [00:00<00:00, 269.11it/s, loss=0.914]


Epoch [1349/5000]: Train loss: 0.9296, Valid loss: 0.8752


Epoch [1350/5000]: 100%|██████████| 10/10 [00:00<00:00, 223.62it/s, loss=0.939]


Epoch [1350/5000]: Train loss: 0.9209, Valid loss: 0.9070


Epoch [1351/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.18it/s, loss=0.964]

Epoch [1351/5000]: Train loss: 0.9135, Valid loss: 0.9021



Epoch [1352/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.33it/s, loss=0.787]


Epoch [1352/5000]: Train loss: 0.9089, Valid loss: 0.8828


Epoch [1353/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.57it/s, loss=0.849]


Epoch [1353/5000]: Train loss: 0.9361, Valid loss: 0.9058


Epoch [1354/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.16it/s, loss=0.763]


Epoch [1354/5000]: Train loss: 0.9065, Valid loss: 0.8891


Epoch [1355/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.28it/s, loss=1.1]


Epoch [1355/5000]: Train loss: 0.9368, Valid loss: 0.9423


Epoch [1356/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.66it/s, loss=0.921]


Epoch [1356/5000]: Train loss: 0.9215, Valid loss: 0.9047


Epoch [1357/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.27it/s, loss=0.995]


Epoch [1357/5000]: Train loss: 0.9251, Valid loss: 0.8788


Epoch [1358/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.71it/s, loss=1.22]


Epoch [1358/5000]: Train loss: 0.9361, Valid loss: 0.8744


Epoch [1359/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.41it/s, loss=1.1]


Epoch [1359/5000]: Train loss: 0.9216, Valid loss: 0.8664


Epoch [1360/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.14it/s, loss=1.12]


Epoch [1360/5000]: Train loss: 0.9234, Valid loss: 0.8775


Epoch [1361/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.10it/s, loss=1.04]


Epoch [1361/5000]: Train loss: 0.9203, Valid loss: 0.9553


Epoch [1362/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.16it/s, loss=0.927]


Epoch [1362/5000]: Train loss: 0.9221, Valid loss: 0.9199


Epoch [1363/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.76it/s, loss=0.832]


Epoch [1363/5000]: Train loss: 0.9168, Valid loss: 0.9161


Epoch [1364/5000]: 100%|██████████| 10/10 [00:00<00:00, 207.75it/s, loss=0.769]


Epoch [1364/5000]: Train loss: 0.9171, Valid loss: 0.9428


Epoch [1365/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.54it/s, loss=0.781]


Epoch [1365/5000]: Train loss: 0.9245, Valid loss: 0.9519


Epoch [1366/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.23it/s, loss=1.19]


Epoch [1366/5000]: Train loss: 0.9326, Valid loss: 0.8517


Epoch [1367/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.43it/s, loss=0.754]


Epoch [1367/5000]: Train loss: 0.9233, Valid loss: 0.9331


Epoch [1368/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.97it/s, loss=1.16]


Epoch [1368/5000]: Train loss: 0.9378, Valid loss: 0.8637


Epoch [1369/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.14it/s, loss=0.881]


Epoch [1369/5000]: Train loss: 0.9192, Valid loss: 0.8618


Epoch [1370/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.67it/s, loss=0.917]


Epoch [1370/5000]: Train loss: 0.9146, Valid loss: 0.8582


Epoch [1371/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.53it/s, loss=0.928]


Epoch [1371/5000]: Train loss: 0.9108, Valid loss: 0.9522


Epoch [1372/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.56it/s, loss=0.842]


Epoch [1372/5000]: Train loss: 0.9108, Valid loss: 0.9204


Epoch [1373/5000]: 100%|██████████| 10/10 [00:00<00:00, 199.27it/s, loss=0.802]


Epoch [1373/5000]: Train loss: 0.9143, Valid loss: 0.8737


Epoch [1374/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.80it/s, loss=1.1]


Epoch [1374/5000]: Train loss: 0.9232, Valid loss: 0.8577


Epoch [1375/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.69it/s, loss=0.808]


Epoch [1375/5000]: Train loss: 0.9043, Valid loss: 0.8595


Epoch [1376/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.93it/s, loss=0.834]


Epoch [1376/5000]: Train loss: 0.9049, Valid loss: 0.8432


Epoch [1377/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.50it/s, loss=0.679]


Epoch [1377/5000]: Train loss: 0.8964, Valid loss: 0.8593


Epoch [1378/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.26it/s, loss=0.892]


Epoch [1378/5000]: Train loss: 0.9092, Valid loss: 0.9224


Epoch [1379/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.62it/s, loss=0.952]


Epoch [1379/5000]: Train loss: 0.9137, Valid loss: 0.9019


Epoch [1380/5000]: 100%|██████████| 10/10 [00:00<00:00, 269.71it/s, loss=1]


Epoch [1380/5000]: Train loss: 0.9249, Valid loss: 0.9454


Epoch [1381/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.22it/s, loss=0.845]


Epoch [1381/5000]: Train loss: 0.9336, Valid loss: 0.9643


Epoch [1382/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.01it/s, loss=0.787]


Epoch [1382/5000]: Train loss: 0.9176, Valid loss: 0.9475


Epoch [1383/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.59it/s, loss=0.966]


Epoch [1383/5000]: Train loss: 0.9274, Valid loss: 0.9775


Epoch [1384/5000]: 100%|██████████| 10/10 [00:00<00:00, 275.61it/s, loss=0.933]


Epoch [1384/5000]: Train loss: 0.9251, Valid loss: 0.9094


Epoch [1385/5000]: 100%|██████████| 10/10 [00:00<00:00, 230.03it/s, loss=0.775]


Epoch [1385/5000]: Train loss: 0.9107, Valid loss: 0.8687


Epoch [1386/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=0.905]


Epoch [1386/5000]: Train loss: 0.9195, Valid loss: 0.8808


Epoch [1387/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.10it/s, loss=0.865]

Epoch [1387/5000]: Train loss: 0.9137, Valid loss: 0.8863



Epoch [1388/5000]: 100%|██████████| 10/10 [00:00<00:00, 245.12it/s, loss=0.871]


Epoch [1388/5000]: Train loss: 0.9232, Valid loss: 0.8580


Epoch [1389/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.26it/s, loss=0.821]


Epoch [1389/5000]: Train loss: 0.9105, Valid loss: 0.8625


Epoch [1390/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=0.725]


Epoch [1390/5000]: Train loss: 0.9063, Valid loss: 0.8852


Epoch [1391/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.55it/s, loss=0.758]


Epoch [1391/5000]: Train loss: 0.9118, Valid loss: 0.8864


Epoch [1392/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.59it/s, loss=0.869]


Epoch [1392/5000]: Train loss: 0.9103, Valid loss: 0.8864


Epoch [1393/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.65it/s, loss=0.871]


Epoch [1393/5000]: Train loss: 0.9222, Valid loss: 0.9323


Epoch [1394/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.34it/s, loss=1.03]


Epoch [1394/5000]: Train loss: 0.9299, Valid loss: 0.8479


Epoch [1395/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.77it/s, loss=1.08]


Epoch [1395/5000]: Train loss: 0.9230, Valid loss: 0.9286


Epoch [1396/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.37it/s, loss=0.668]


Epoch [1396/5000]: Train loss: 0.8990, Valid loss: 0.8154
Saving model with loss 0.815...


Epoch [1397/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=0.922]


Epoch [1397/5000]: Train loss: 0.9111, Valid loss: 0.9017


Epoch [1398/5000]: 100%|██████████| 10/10 [00:00<00:00, 188.75it/s, loss=0.951]


Epoch [1398/5000]: Train loss: 0.9120, Valid loss: 0.9349


Epoch [1399/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.77it/s, loss=0.707]


Epoch [1399/5000]: Train loss: 0.8991, Valid loss: 0.9156


Epoch [1400/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.14it/s, loss=0.872]


Epoch [1400/5000]: Train loss: 0.9083, Valid loss: 0.8889


Epoch [1401/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.72it/s, loss=0.838]


Epoch [1401/5000]: Train loss: 0.9122, Valid loss: 0.9109


Epoch [1402/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.16it/s, loss=1.07]


Epoch [1402/5000]: Train loss: 0.9327, Valid loss: 0.9426


Epoch [1403/5000]: 100%|██████████| 10/10 [00:00<00:00, 171.19it/s, loss=0.854]


Epoch [1403/5000]: Train loss: 0.9285, Valid loss: 0.9267


Epoch [1404/5000]: 100%|██████████| 10/10 [00:00<00:00, 204.84it/s, loss=0.86]


Epoch [1404/5000]: Train loss: 0.9074, Valid loss: 0.9201


Epoch [1405/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.84it/s, loss=0.994]


Epoch [1405/5000]: Train loss: 0.9150, Valid loss: 0.8762


Epoch [1406/5000]: 100%|██████████| 10/10 [00:00<00:00, 150.14it/s, loss=0.928]


Epoch [1406/5000]: Train loss: 0.9126, Valid loss: 0.8846


Epoch [1407/5000]: 100%|██████████| 10/10 [00:00<00:00, 175.91it/s, loss=0.804]


Epoch [1407/5000]: Train loss: 0.9028, Valid loss: 0.9035


Epoch [1408/5000]: 100%|██████████| 10/10 [00:00<00:00, 199.06it/s, loss=1.01]


Epoch [1408/5000]: Train loss: 0.9140, Valid loss: 0.8607


Epoch [1409/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.03it/s, loss=1.09]


Epoch [1409/5000]: Train loss: 0.9265, Valid loss: 0.9015


Epoch [1410/5000]: 100%|██████████| 10/10 [00:00<00:00, 188.99it/s, loss=1.24]


Epoch [1410/5000]: Train loss: 0.9302, Valid loss: 0.8164


Epoch [1411/5000]: 100%|██████████| 10/10 [00:00<00:00, 235.74it/s, loss=1.42]


Epoch [1411/5000]: Train loss: 0.9460, Valid loss: 0.9584


Epoch [1412/5000]: 100%|██████████| 10/10 [00:00<00:00, 218.12it/s, loss=1.01]


Epoch [1412/5000]: Train loss: 0.9427, Valid loss: 0.8809


Epoch [1413/5000]: 100%|██████████| 10/10 [00:00<00:00, 197.47it/s, loss=0.819]


Epoch [1413/5000]: Train loss: 0.9223, Valid loss: 0.9152


Epoch [1414/5000]: 100%|██████████| 10/10 [00:00<00:00, 78.63it/s, loss=1.21]


Epoch [1414/5000]: Train loss: 0.9284, Valid loss: 0.9023


Epoch [1415/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.07it/s, loss=1.28]


Epoch [1415/5000]: Train loss: 0.9314, Valid loss: 0.8993


Epoch [1416/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.56it/s, loss=0.854]


Epoch [1416/5000]: Train loss: 0.9042, Valid loss: 0.9177


Epoch [1417/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.55it/s, loss=0.977]


Epoch [1417/5000]: Train loss: 0.9104, Valid loss: 0.9320


Epoch [1418/5000]: 100%|██████████| 10/10 [00:00<00:00, 225.08it/s, loss=0.729]

Epoch [1418/5000]: Train loss: 0.9147, Valid loss: 0.8998



Epoch [1419/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.85it/s, loss=0.766]


Epoch [1419/5000]: Train loss: 0.9121, Valid loss: 0.8719


Epoch [1420/5000]: 100%|██████████| 10/10 [00:00<00:00, 220.80it/s, loss=1.18]


Epoch [1420/5000]: Train loss: 0.9427, Valid loss: 0.9346


Epoch [1421/5000]: 100%|██████████| 10/10 [00:00<00:00, 209.74it/s, loss=1.01]


Epoch [1421/5000]: Train loss: 0.9247, Valid loss: 0.9060


Epoch [1422/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.94it/s, loss=0.839]


Epoch [1422/5000]: Train loss: 0.9093, Valid loss: 0.9121


Epoch [1423/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.95it/s, loss=0.775]


Epoch [1423/5000]: Train loss: 0.9014, Valid loss: 0.9269


Epoch [1424/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.85it/s, loss=0.802]


Epoch [1424/5000]: Train loss: 0.9111, Valid loss: 0.8920


Epoch [1425/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.23it/s, loss=0.743]


Epoch [1425/5000]: Train loss: 0.9142, Valid loss: 0.8812


Epoch [1426/5000]: 100%|██████████| 10/10 [00:00<00:00, 265.83it/s, loss=0.813]


Epoch [1426/5000]: Train loss: 0.9054, Valid loss: 0.8929


Epoch [1427/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.24it/s, loss=1.11]


Epoch [1427/5000]: Train loss: 0.9234, Valid loss: 0.8706


Epoch [1428/5000]: 100%|██████████| 10/10 [00:00<00:00, 272.28it/s, loss=1.23]


Epoch [1428/5000]: Train loss: 0.9285, Valid loss: 0.8260


Epoch [1429/5000]: 100%|██████████| 10/10 [00:00<00:00, 219.82it/s, loss=0.664]


Epoch [1429/5000]: Train loss: 0.8951, Valid loss: 0.8753


Epoch [1430/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.53it/s, loss=1.04]


Epoch [1430/5000]: Train loss: 0.9204, Valid loss: 0.8628


Epoch [1431/5000]: 100%|██████████| 10/10 [00:00<00:00, 195.76it/s, loss=1.03]


Epoch [1431/5000]: Train loss: 0.9303, Valid loss: 0.9772


Epoch [1432/5000]: 100%|██████████| 10/10 [00:00<00:00, 255.92it/s, loss=0.701]


Epoch [1432/5000]: Train loss: 0.9206, Valid loss: 0.8900


Epoch [1433/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.17it/s, loss=1.02]


Epoch [1433/5000]: Train loss: 0.9166, Valid loss: 0.9271


Epoch [1434/5000]: 100%|██████████| 10/10 [00:00<00:00, 234.50it/s, loss=1.25]


Epoch [1434/5000]: Train loss: 0.9502, Valid loss: 0.8665


Epoch [1435/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.65it/s, loss=0.912]


Epoch [1435/5000]: Train loss: 0.9131, Valid loss: 0.8136
Saving model with loss 0.814...


Epoch [1436/5000]: 100%|██████████| 10/10 [00:00<00:00, 236.54it/s, loss=0.778]


Epoch [1436/5000]: Train loss: 0.9010, Valid loss: 0.8986


Epoch [1437/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.19it/s, loss=0.982]


Epoch [1437/5000]: Train loss: 0.9121, Valid loss: 0.9396


Epoch [1438/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.01it/s, loss=0.721]


Epoch [1438/5000]: Train loss: 0.8974, Valid loss: 0.9092


Epoch [1439/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.05it/s, loss=1.21]


Epoch [1439/5000]: Train loss: 0.9310, Valid loss: 0.8790


Epoch [1440/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.27it/s, loss=1.16]


Epoch [1440/5000]: Train loss: 0.9207, Valid loss: 0.8520


Epoch [1441/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.09it/s, loss=0.919]


Epoch [1441/5000]: Train loss: 0.9069, Valid loss: 0.8589


Epoch [1442/5000]: 100%|██████████| 10/10 [00:00<00:00, 192.95it/s, loss=0.863]


Epoch [1442/5000]: Train loss: 0.9039, Valid loss: 0.8451


Epoch [1443/5000]: 100%|██████████| 10/10 [00:00<00:00, 247.64it/s, loss=0.821]


Epoch [1443/5000]: Train loss: 0.9024, Valid loss: 0.8731


Epoch [1444/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.48it/s, loss=0.91]


Epoch [1444/5000]: Train loss: 0.9235, Valid loss: 0.8444


Epoch [1445/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.10it/s, loss=0.751]

Epoch [1445/5000]: Train loss: 0.9044, Valid loss: 0.9057



Epoch [1446/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.02it/s, loss=0.758]


Epoch [1446/5000]: Train loss: 0.9006, Valid loss: 0.8691


Epoch [1447/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.72it/s, loss=1.1]


Epoch [1447/5000]: Train loss: 0.9267, Valid loss: 0.8737


Epoch [1448/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.16it/s, loss=1.05]


Epoch [1448/5000]: Train loss: 0.9166, Valid loss: 0.9486


Epoch [1449/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.23it/s, loss=0.889]


Epoch [1449/5000]: Train loss: 0.9035, Valid loss: 0.9401


Epoch [1450/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.84it/s, loss=1.26]


Epoch [1450/5000]: Train loss: 0.9269, Valid loss: 0.8321


Epoch [1451/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.04it/s, loss=0.757]


Epoch [1451/5000]: Train loss: 0.9005, Valid loss: 0.9210


Epoch [1452/5000]: 100%|██████████| 10/10 [00:00<00:00, 217.45it/s, loss=1.11]


Epoch [1452/5000]: Train loss: 0.9213, Valid loss: 0.8857


Epoch [1453/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.13it/s, loss=1.27]


Epoch [1453/5000]: Train loss: 0.9267, Valid loss: 0.8410


Epoch [1454/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.06it/s, loss=0.755]


Epoch [1454/5000]: Train loss: 0.8962, Valid loss: 0.9111


Epoch [1455/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.71it/s, loss=0.744]


Epoch [1455/5000]: Train loss: 0.9022, Valid loss: 0.9007


Epoch [1456/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.57it/s, loss=1.16]


Epoch [1456/5000]: Train loss: 0.9384, Valid loss: 0.8944


Epoch [1457/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.92it/s, loss=1.09]


Epoch [1457/5000]: Train loss: 0.9241, Valid loss: 0.9688


Epoch [1458/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.27it/s, loss=0.816]


Epoch [1458/5000]: Train loss: 0.9016, Valid loss: 0.9464


Epoch [1459/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.15it/s, loss=1.25]


Epoch [1459/5000]: Train loss: 0.9307, Valid loss: 0.8376


Epoch [1460/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.98it/s, loss=0.74]


Epoch [1460/5000]: Train loss: 0.9031, Valid loss: 0.8541


Epoch [1461/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.30it/s, loss=1.24]


Epoch [1461/5000]: Train loss: 0.9382, Valid loss: 0.8900


Epoch [1462/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.06it/s, loss=0.979]


Epoch [1462/5000]: Train loss: 0.9174, Valid loss: 0.8529


Epoch [1463/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.00it/s, loss=1.13]


Epoch [1463/5000]: Train loss: 0.9230, Valid loss: 0.9072


Epoch [1464/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.33it/s, loss=1.05]


Epoch [1464/5000]: Train loss: 0.9174, Valid loss: 0.8797


Epoch [1465/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.67it/s, loss=0.807]


Epoch [1465/5000]: Train loss: 0.9006, Valid loss: 0.9206


Epoch [1466/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.49it/s, loss=0.744]


Epoch [1466/5000]: Train loss: 0.8969, Valid loss: 0.9140


Epoch [1467/5000]: 100%|██████████| 10/10 [00:00<00:00, 256.96it/s, loss=1]


Epoch [1467/5000]: Train loss: 0.9112, Valid loss: 0.8744


Epoch [1468/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.51it/s, loss=0.79]


Epoch [1468/5000]: Train loss: 0.8974, Valid loss: 0.8376


Epoch [1469/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.11it/s, loss=0.737]


Epoch [1469/5000]: Train loss: 0.8954, Valid loss: 0.8795


Epoch [1470/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.45it/s, loss=1.03]


Epoch [1470/5000]: Train loss: 0.9131, Valid loss: 0.8504


Epoch [1471/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.89it/s, loss=0.956]


Epoch [1471/5000]: Train loss: 0.9095, Valid loss: 0.8625


Epoch [1472/5000]: 100%|██████████| 10/10 [00:00<00:00, 236.06it/s, loss=0.675]


Epoch [1472/5000]: Train loss: 0.8985, Valid loss: 0.8602


Epoch [1473/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.32it/s, loss=0.817]


Epoch [1473/5000]: Train loss: 0.9026, Valid loss: 0.9226


Epoch [1474/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.64it/s, loss=0.759]


Epoch [1474/5000]: Train loss: 0.9116, Valid loss: 0.8939


Epoch [1475/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.70it/s, loss=1.01]


Epoch [1475/5000]: Train loss: 0.9185, Valid loss: 0.8829


Epoch [1476/5000]: 100%|██████████| 10/10 [00:00<00:00, 260.48it/s, loss=1.11]


Epoch [1476/5000]: Train loss: 0.9172, Valid loss: 0.8982


Epoch [1477/5000]: 100%|██████████| 10/10 [00:00<00:00, 268.85it/s, loss=1.07]


Epoch [1477/5000]: Train loss: 0.9188, Valid loss: 0.8175


Epoch [1478/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.30it/s, loss=0.669]


Epoch [1478/5000]: Train loss: 0.8935, Valid loss: 0.9066


Epoch [1479/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.95it/s, loss=0.915]


Epoch [1479/5000]: Train loss: 0.9142, Valid loss: 0.8677


Epoch [1480/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.89it/s, loss=0.727]


Epoch [1480/5000]: Train loss: 0.9018, Valid loss: 0.8994


Epoch [1481/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.18it/s, loss=0.874]


Epoch [1481/5000]: Train loss: 0.9068, Valid loss: 0.9192


Epoch [1482/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.44it/s, loss=1.45]


Epoch [1482/5000]: Train loss: 0.9380, Valid loss: 0.8459


Epoch [1483/5000]: 100%|██████████| 10/10 [00:00<00:00, 229.61it/s, loss=0.826]


Epoch [1483/5000]: Train loss: 0.8997, Valid loss: 0.8896


Epoch [1484/5000]: 100%|██████████| 10/10 [00:00<00:00, 263.05it/s, loss=0.879]


Epoch [1484/5000]: Train loss: 0.9028, Valid loss: 0.8734


Epoch [1485/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.43it/s, loss=0.886]


Epoch [1485/5000]: Train loss: 0.9027, Valid loss: 0.8913


Epoch [1486/5000]: 100%|██████████| 10/10 [00:00<00:00, 362.90it/s, loss=1.29]


Epoch [1486/5000]: Train loss: 0.9283, Valid loss: 0.8708


Epoch [1487/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.24it/s, loss=0.991]


Epoch [1487/5000]: Train loss: 0.9118, Valid loss: 0.8673


Epoch [1488/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.38it/s, loss=0.637]


Epoch [1488/5000]: Train loss: 0.8890, Valid loss: 0.8535


Epoch [1489/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.16it/s, loss=1.13]


Epoch [1489/5000]: Train loss: 0.9173, Valid loss: 0.9229


Epoch [1490/5000]: 100%|██████████| 10/10 [00:00<00:00, 310.88it/s, loss=0.915]


Epoch [1490/5000]: Train loss: 0.9124, Valid loss: 0.8857


Epoch [1491/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.83it/s, loss=1.04]


Epoch [1491/5000]: Train loss: 0.9155, Valid loss: 0.8728


Epoch [1492/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.46it/s, loss=0.693]


Epoch [1492/5000]: Train loss: 0.8993, Valid loss: 0.8406


Epoch [1493/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.30it/s, loss=1.12]


Epoch [1493/5000]: Train loss: 0.9216, Valid loss: 0.8613


Epoch [1494/5000]: 100%|██████████| 10/10 [00:00<00:00, 378.43it/s, loss=0.797]


Epoch [1494/5000]: Train loss: 0.9067, Valid loss: 0.8938


Epoch [1495/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.20it/s, loss=0.904]


Epoch [1495/5000]: Train loss: 0.9105, Valid loss: 0.8559


Epoch [1496/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.43it/s, loss=0.958]


Epoch [1496/5000]: Train loss: 0.9077, Valid loss: 0.8692


Epoch [1497/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.36it/s, loss=0.82]


Epoch [1497/5000]: Train loss: 0.9106, Valid loss: 0.9072


Epoch [1498/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.66it/s, loss=0.737]


Epoch [1498/5000]: Train loss: 0.9046, Valid loss: 0.9329


Epoch [1499/5000]: 100%|██████████| 10/10 [00:00<00:00, 262.99it/s, loss=0.868]


Epoch [1499/5000]: Train loss: 0.9104, Valid loss: 0.8845


Epoch [1500/5000]: 100%|██████████| 10/10 [00:00<00:00, 85.05it/s, loss=0.714]


Epoch [1500/5000]: Train loss: 0.8924, Valid loss: 0.8545


Epoch [1501/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.65it/s, loss=1.03]


Epoch [1501/5000]: Train loss: 0.9116, Valid loss: 0.8641


Epoch [1502/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.86it/s, loss=0.791]


Epoch [1502/5000]: Train loss: 0.8978, Valid loss: 0.8552


Epoch [1503/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.58it/s, loss=0.641]


Epoch [1503/5000]: Train loss: 0.8915, Valid loss: 0.8695


Epoch [1504/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=0.819]


Epoch [1504/5000]: Train loss: 0.9061, Valid loss: 0.9056


Epoch [1505/5000]: 100%|██████████| 10/10 [00:00<00:00, 301.42it/s, loss=1.17]


Epoch [1505/5000]: Train loss: 0.9281, Valid loss: 0.8728


Epoch [1506/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=0.748]


Epoch [1506/5000]: Train loss: 0.8975, Valid loss: 0.9258


Epoch [1507/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.49it/s, loss=0.947]


Epoch [1507/5000]: Train loss: 0.9159, Valid loss: 0.8945


Epoch [1508/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.21it/s, loss=0.826]


Epoch [1508/5000]: Train loss: 0.9050, Valid loss: 0.9277


Epoch [1509/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.05it/s, loss=0.634]


Epoch [1509/5000]: Train loss: 0.8872, Valid loss: 0.8756


Epoch [1510/5000]: 100%|██████████| 10/10 [00:00<00:00, 308.14it/s, loss=0.969]


Epoch [1510/5000]: Train loss: 0.9147, Valid loss: 0.9497


Epoch [1511/5000]: 100%|██████████| 10/10 [00:00<00:00, 243.64it/s, loss=1.38]


Epoch [1511/5000]: Train loss: 0.9444, Valid loss: 0.8204


Epoch [1512/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.29it/s, loss=0.871]


Epoch [1512/5000]: Train loss: 0.9053, Valid loss: 0.8365


Epoch [1513/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.14it/s, loss=0.932]


Epoch [1513/5000]: Train loss: 0.9069, Valid loss: 0.8846


Epoch [1514/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.32it/s, loss=0.947]


Epoch [1514/5000]: Train loss: 0.9267, Valid loss: 0.8931


Epoch [1515/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.30it/s, loss=0.933]


Epoch [1515/5000]: Train loss: 0.9153, Valid loss: 0.8590


Epoch [1516/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.10it/s, loss=0.982]


Epoch [1516/5000]: Train loss: 0.9088, Valid loss: 0.8847


Epoch [1517/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.51it/s, loss=1.15]


Epoch [1517/5000]: Train loss: 0.9242, Valid loss: 0.9263


Epoch [1518/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.23it/s, loss=0.772]


Epoch [1518/5000]: Train loss: 0.8967, Valid loss: 0.8631


Epoch [1519/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.49it/s, loss=1.43]


Epoch [1519/5000]: Train loss: 0.9384, Valid loss: 0.8555


Epoch [1520/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.47it/s, loss=0.951]


Epoch [1520/5000]: Train loss: 0.9097, Valid loss: 0.8562


Epoch [1521/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.49it/s, loss=0.706]


Epoch [1521/5000]: Train loss: 0.8914, Valid loss: 0.8801


Epoch [1522/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.81it/s, loss=0.814]


Epoch [1522/5000]: Train loss: 0.8985, Valid loss: 0.9136


Epoch [1523/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.28it/s, loss=0.857]


Epoch [1523/5000]: Train loss: 0.9024, Valid loss: 0.9186


Epoch [1524/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.20it/s, loss=0.874]


Epoch [1524/5000]: Train loss: 0.9123, Valid loss: 0.8360


Epoch [1525/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.91it/s, loss=1.05]


Epoch [1525/5000]: Train loss: 0.9120, Valid loss: 0.8597


Epoch [1526/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.24it/s, loss=1.04]


Epoch [1526/5000]: Train loss: 0.9161, Valid loss: 0.8949


Epoch [1527/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.12it/s, loss=1.01]


Epoch [1527/5000]: Train loss: 0.9111, Valid loss: 0.8834


Epoch [1528/5000]: 100%|██████████| 10/10 [00:00<00:00, 249.71it/s, loss=0.852]


Epoch [1528/5000]: Train loss: 0.9026, Valid loss: 0.9126


Epoch [1529/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.05it/s, loss=0.857]


Epoch [1529/5000]: Train loss: 0.9020, Valid loss: 0.8885


Epoch [1530/5000]: 100%|██████████| 10/10 [00:00<00:00, 262.11it/s, loss=0.972]


Epoch [1530/5000]: Train loss: 0.9071, Valid loss: 0.8589


Epoch [1531/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.86it/s, loss=0.925]


Epoch [1531/5000]: Train loss: 0.9065, Valid loss: 0.9662


Epoch [1532/5000]: 100%|██████████| 10/10 [00:00<00:00, 356.91it/s, loss=0.801]


Epoch [1532/5000]: Train loss: 0.9003, Valid loss: 0.8824


Epoch [1533/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.90it/s, loss=0.991]


Epoch [1533/5000]: Train loss: 0.9104, Valid loss: 0.8855


Epoch [1534/5000]: 100%|██████████| 10/10 [00:00<00:00, 307.82it/s, loss=0.76]


Epoch [1534/5000]: Train loss: 0.9027, Valid loss: 0.9219


Epoch [1535/5000]: 100%|██████████| 10/10 [00:00<00:00, 344.35it/s, loss=0.9]


Epoch [1535/5000]: Train loss: 0.9127, Valid loss: 0.8340


Epoch [1536/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.35it/s, loss=1.21]


Epoch [1536/5000]: Train loss: 0.9235, Valid loss: 0.8560


Epoch [1537/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.76it/s, loss=0.723]


Epoch [1537/5000]: Train loss: 0.8985, Valid loss: 0.9292


Epoch [1538/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=1.09]


Epoch [1538/5000]: Train loss: 0.9207, Valid loss: 0.8741


Epoch [1539/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.75it/s, loss=1.19]

Epoch [1539/5000]: Train loss: 0.9176, Valid loss: 0.8686



Epoch [1540/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.93it/s, loss=0.765]


Epoch [1540/5000]: Train loss: 0.9085, Valid loss: 0.8586


Epoch [1541/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.49it/s, loss=1]


Epoch [1541/5000]: Train loss: 0.9219, Valid loss: 0.9085


Epoch [1542/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.93it/s, loss=0.667]


Epoch [1542/5000]: Train loss: 0.8968, Valid loss: 0.9177


Epoch [1543/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.80it/s, loss=0.876]


Epoch [1543/5000]: Train loss: 0.9192, Valid loss: 0.9190


Epoch [1544/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.12it/s, loss=0.83]


Epoch [1544/5000]: Train loss: 0.9304, Valid loss: 0.9423


Epoch [1545/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.46it/s, loss=0.933]


Epoch [1545/5000]: Train loss: 0.9164, Valid loss: 0.8430


Epoch [1546/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.11it/s, loss=0.731]


Epoch [1546/5000]: Train loss: 0.8996, Valid loss: 0.8660


Epoch [1547/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.95it/s, loss=0.915]


Epoch [1547/5000]: Train loss: 0.9077, Valid loss: 0.8851


Epoch [1548/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.15it/s, loss=0.7]


Epoch [1548/5000]: Train loss: 0.8978, Valid loss: 0.8792


Epoch [1549/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.83it/s, loss=0.656]


Epoch [1549/5000]: Train loss: 0.8870, Valid loss: 0.8690


Epoch [1550/5000]: 100%|██████████| 10/10 [00:00<00:00, 275.70it/s, loss=0.705]


Epoch [1550/5000]: Train loss: 0.8949, Valid loss: 0.8264


Epoch [1551/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.62it/s, loss=0.653]


Epoch [1551/5000]: Train loss: 0.8870, Valid loss: 0.9234


Epoch [1552/5000]: 100%|██████████| 10/10 [00:00<00:00, 273.59it/s, loss=0.699]


Epoch [1552/5000]: Train loss: 0.8889, Valid loss: 0.9033


Epoch [1553/5000]: 100%|██████████| 10/10 [00:00<00:00, 254.27it/s, loss=0.916]


Epoch [1553/5000]: Train loss: 0.9039, Valid loss: 0.9246


Epoch [1554/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.95it/s, loss=0.881]


Epoch [1554/5000]: Train loss: 0.9080, Valid loss: 0.9288


Epoch [1555/5000]: 100%|██████████| 10/10 [00:00<00:00, 269.10it/s, loss=0.915]


Epoch [1555/5000]: Train loss: 0.9013, Valid loss: 0.8431


Epoch [1556/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.01it/s, loss=0.978]


Epoch [1556/5000]: Train loss: 0.9053, Valid loss: 0.8602


Epoch [1557/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.40it/s, loss=0.941]


Epoch [1557/5000]: Train loss: 0.9085, Valid loss: 0.9036


Epoch [1558/5000]: 100%|██████████| 10/10 [00:00<00:00, 344.69it/s, loss=1.41]


Epoch [1558/5000]: Train loss: 0.9462, Valid loss: 0.9182


Epoch [1559/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.17it/s, loss=0.902]


Epoch [1559/5000]: Train loss: 0.9088, Valid loss: 0.8671


Epoch [1560/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.03it/s, loss=0.966]


Epoch [1560/5000]: Train loss: 0.9254, Valid loss: 0.8556


Epoch [1561/5000]: 100%|██████████| 10/10 [00:00<00:00, 247.12it/s, loss=0.752]


Epoch [1561/5000]: Train loss: 0.9051, Valid loss: 0.8826


Epoch [1562/5000]: 100%|██████████| 10/10 [00:00<00:00, 364.47it/s, loss=0.818]


Epoch [1562/5000]: Train loss: 0.8971, Valid loss: 0.8423


Epoch [1563/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.12it/s, loss=0.793]


Epoch [1563/5000]: Train loss: 0.9068, Valid loss: 0.9421


Epoch [1564/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.75it/s, loss=0.741]


Epoch [1564/5000]: Train loss: 0.9025, Valid loss: 0.8784


Epoch [1565/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.04it/s, loss=1.02]


Epoch [1565/5000]: Train loss: 0.9123, Valid loss: 0.9144


Epoch [1566/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.03it/s, loss=1.14]


Epoch [1566/5000]: Train loss: 0.9147, Valid loss: 0.9205


Epoch [1567/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.46it/s, loss=0.814]


Epoch [1567/5000]: Train loss: 0.8967, Valid loss: 0.8489


Epoch [1568/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.43it/s, loss=0.832]


Epoch [1568/5000]: Train loss: 0.8948, Valid loss: 0.8937


Epoch [1569/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.72it/s, loss=0.678]


Epoch [1569/5000]: Train loss: 0.8960, Valid loss: 0.8571


Epoch [1570/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.10it/s, loss=0.945]


Epoch [1570/5000]: Train loss: 0.9047, Valid loss: 0.7920
Saving model with loss 0.792...


Epoch [1571/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.56it/s, loss=0.923]


Epoch [1571/5000]: Train loss: 0.9021, Valid loss: 0.8817


Epoch [1572/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=1.03]


Epoch [1572/5000]: Train loss: 0.9095, Valid loss: 0.9155


Epoch [1573/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.63it/s, loss=0.752]


Epoch [1573/5000]: Train loss: 0.8909, Valid loss: 0.8698


Epoch [1574/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.31it/s, loss=0.81]


Epoch [1574/5000]: Train loss: 0.8957, Valid loss: 0.9055


Epoch [1575/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.76it/s, loss=0.675]


Epoch [1575/5000]: Train loss: 0.8998, Valid loss: 0.9615


Epoch [1576/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.17it/s, loss=0.89]


Epoch [1576/5000]: Train loss: 0.9122, Valid loss: 0.8799


Epoch [1577/5000]: 100%|██████████| 10/10 [00:00<00:00, 243.41it/s, loss=0.718]


Epoch [1577/5000]: Train loss: 0.8984, Valid loss: 0.8809


Epoch [1578/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.91it/s, loss=0.836]


Epoch [1578/5000]: Train loss: 0.8980, Valid loss: 0.8648


Epoch [1579/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.39it/s, loss=0.854]


Epoch [1579/5000]: Train loss: 0.9009, Valid loss: 0.9388


Epoch [1580/5000]: 100%|██████████| 10/10 [00:00<00:00, 344.65it/s, loss=0.733]


Epoch [1580/5000]: Train loss: 0.8920, Valid loss: 0.8696


Epoch [1581/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.99it/s, loss=1.03]


Epoch [1581/5000]: Train loss: 0.9142, Valid loss: 0.8655


Epoch [1582/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.12it/s, loss=0.834]


Epoch [1582/5000]: Train loss: 0.9030, Valid loss: 0.8444


Epoch [1583/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.48it/s, loss=0.828]


Epoch [1583/5000]: Train loss: 0.9074, Valid loss: 0.9032


Epoch [1584/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.61it/s, loss=1.02]


Epoch [1584/5000]: Train loss: 0.9092, Valid loss: 0.8321


Epoch [1585/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.45it/s, loss=0.85]


Epoch [1585/5000]: Train loss: 0.8981, Valid loss: 0.9003


Epoch [1586/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.85it/s, loss=0.936]


Epoch [1586/5000]: Train loss: 0.9093, Valid loss: 0.8529


Epoch [1587/5000]: 100%|██████████| 10/10 [00:00<00:00, 85.55it/s, loss=1.05]


Epoch [1587/5000]: Train loss: 0.9135, Valid loss: 0.8629


Epoch [1588/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.73it/s, loss=0.965]


Epoch [1588/5000]: Train loss: 0.9044, Valid loss: 0.8463


Epoch [1589/5000]: 100%|██████████| 10/10 [00:00<00:00, 237.84it/s, loss=1.24]


Epoch [1589/5000]: Train loss: 0.9234, Valid loss: 0.8881


Epoch [1590/5000]: 100%|██████████| 10/10 [00:00<00:00, 350.27it/s, loss=0.963]


Epoch [1590/5000]: Train loss: 0.9117, Valid loss: 0.9036


Epoch [1591/5000]: 100%|██████████| 10/10 [00:00<00:00, 266.97it/s, loss=0.964]


Epoch [1591/5000]: Train loss: 0.9242, Valid loss: 0.9690


Epoch [1592/5000]: 100%|██████████| 10/10 [00:00<00:00, 357.59it/s, loss=0.96]


Epoch [1592/5000]: Train loss: 0.9334, Valid loss: 0.9023


Epoch [1593/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.94it/s, loss=0.808]


Epoch [1593/5000]: Train loss: 0.9230, Valid loss: 0.8453


Epoch [1594/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.31it/s, loss=0.912]


Epoch [1594/5000]: Train loss: 0.9279, Valid loss: 0.8768


Epoch [1595/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.23it/s, loss=0.869]


Epoch [1595/5000]: Train loss: 0.9041, Valid loss: 0.8484


Epoch [1596/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.38it/s, loss=0.633]


Epoch [1596/5000]: Train loss: 0.8895, Valid loss: 0.8592


Epoch [1597/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.97it/s, loss=0.731]


Epoch [1597/5000]: Train loss: 0.8902, Valid loss: 0.8583


Epoch [1598/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.93it/s, loss=1.14]


Epoch [1598/5000]: Train loss: 0.9151, Valid loss: 0.8338


Epoch [1599/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=0.994]


Epoch [1599/5000]: Train loss: 0.9043, Valid loss: 0.8787


Epoch [1600/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.66it/s, loss=0.925]


Epoch [1600/5000]: Train loss: 0.9011, Valid loss: 0.8936


Epoch [1601/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.95it/s, loss=0.933]


Epoch [1601/5000]: Train loss: 0.9147, Valid loss: 0.8769


Epoch [1602/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.86it/s, loss=0.817]


Epoch [1602/5000]: Train loss: 0.8967, Valid loss: 0.8406


Epoch [1603/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.66it/s, loss=0.902]


Epoch [1603/5000]: Train loss: 0.9016, Valid loss: 0.8694


Epoch [1604/5000]: 100%|██████████| 10/10 [00:00<00:00, 244.30it/s, loss=1.09]


Epoch [1604/5000]: Train loss: 0.9127, Valid loss: 0.9261


Epoch [1605/5000]: 100%|██████████| 10/10 [00:00<00:00, 370.90it/s, loss=0.831]


Epoch [1605/5000]: Train loss: 0.9079, Valid loss: 0.8351


Epoch [1606/5000]: 100%|██████████| 10/10 [00:00<00:00, 311.99it/s, loss=0.732]


Epoch [1606/5000]: Train loss: 0.8988, Valid loss: 0.9482


Epoch [1607/5000]: 100%|██████████| 10/10 [00:00<00:00, 370.18it/s, loss=0.746]


Epoch [1607/5000]: Train loss: 0.8940, Valid loss: 0.8933


Epoch [1608/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.22it/s, loss=0.743]


Epoch [1608/5000]: Train loss: 0.8983, Valid loss: 0.8421


Epoch [1609/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.90it/s, loss=0.781]


Epoch [1609/5000]: Train loss: 0.8929, Valid loss: 0.8950


Epoch [1610/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.58it/s, loss=1.17]


Epoch [1610/5000]: Train loss: 0.9154, Valid loss: 0.8677


Epoch [1611/5000]: 100%|██████████| 10/10 [00:00<00:00, 332.26it/s, loss=1.15]


Epoch [1611/5000]: Train loss: 0.9202, Valid loss: 0.8814


Epoch [1612/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.41it/s, loss=0.99]


Epoch [1612/5000]: Train loss: 0.9082, Valid loss: 0.9012


Epoch [1613/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.64it/s, loss=0.694]


Epoch [1613/5000]: Train loss: 0.8876, Valid loss: 0.9171


Epoch [1614/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.62it/s, loss=1.06]


Epoch [1614/5000]: Train loss: 0.9187, Valid loss: 0.8566


Epoch [1615/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.95it/s, loss=0.989]


Epoch [1615/5000]: Train loss: 0.9164, Valid loss: 0.8991


Epoch [1616/5000]: 100%|██████████| 10/10 [00:00<00:00, 330.62it/s, loss=0.836]


Epoch [1616/5000]: Train loss: 0.9102, Valid loss: 0.9542


Epoch [1617/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.47it/s, loss=0.853]


Epoch [1617/5000]: Train loss: 0.8996, Valid loss: 0.8771


Epoch [1618/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.38it/s, loss=0.766]


Epoch [1618/5000]: Train loss: 0.8962, Valid loss: 0.8737


Epoch [1619/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.91it/s, loss=1.08]


Epoch [1619/5000]: Train loss: 0.9214, Valid loss: 0.8556


Epoch [1620/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.21it/s, loss=0.909]


Epoch [1620/5000]: Train loss: 0.9046, Valid loss: 0.8853


Epoch [1621/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.32it/s, loss=0.89]


Epoch [1621/5000]: Train loss: 0.9019, Valid loss: 0.8406


Epoch [1622/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.22it/s, loss=0.608]


Epoch [1622/5000]: Train loss: 0.8823, Valid loss: 0.7953


Epoch [1623/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.48it/s, loss=1.2]


Epoch [1623/5000]: Train loss: 0.9179, Valid loss: 0.9430


Epoch [1624/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.52it/s, loss=0.818]


Epoch [1624/5000]: Train loss: 0.8990, Valid loss: 0.9050


Epoch [1625/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.08it/s, loss=0.852]


Epoch [1625/5000]: Train loss: 0.8998, Valid loss: 0.9186


Epoch [1626/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.86it/s, loss=0.719]


Epoch [1626/5000]: Train loss: 0.8933, Valid loss: 0.8941


Epoch [1627/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.03it/s, loss=0.659]


Epoch [1627/5000]: Train loss: 0.8955, Valid loss: 0.8798


Epoch [1628/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.24it/s, loss=1.07]


Epoch [1628/5000]: Train loss: 0.9133, Valid loss: 0.8867


Epoch [1629/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.94it/s, loss=0.855]


Epoch [1629/5000]: Train loss: 0.9026, Valid loss: 0.8794


Epoch [1630/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.43it/s, loss=1.17]


Epoch [1630/5000]: Train loss: 0.9378, Valid loss: 0.9863


Epoch [1631/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.63it/s, loss=1.04]


Epoch [1631/5000]: Train loss: 0.9335, Valid loss: 1.0055


Epoch [1632/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.47it/s, loss=0.993]


Epoch [1632/5000]: Train loss: 0.9253, Valid loss: 0.8633


Epoch [1633/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.49it/s, loss=1.1]


Epoch [1633/5000]: Train loss: 0.9217, Valid loss: 0.8994


Epoch [1634/5000]: 100%|██████████| 10/10 [00:00<00:00, 392.57it/s, loss=0.868]


Epoch [1634/5000]: Train loss: 0.9099, Valid loss: 0.8557


Epoch [1635/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.23it/s, loss=0.717]


Epoch [1635/5000]: Train loss: 0.8948, Valid loss: 0.9017


Epoch [1636/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.98it/s, loss=0.883]


Epoch [1636/5000]: Train loss: 0.9188, Valid loss: 0.9025


Epoch [1637/5000]: 100%|██████████| 10/10 [00:00<00:00, 347.88it/s, loss=1.03]


Epoch [1637/5000]: Train loss: 0.9090, Valid loss: 0.8765


Epoch [1638/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.50it/s, loss=0.984]


Epoch [1638/5000]: Train loss: 0.9058, Valid loss: 0.8792


Epoch [1639/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.00it/s, loss=0.791]


Epoch [1639/5000]: Train loss: 0.8972, Valid loss: 0.9001


Epoch [1640/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.93it/s, loss=0.879]


Epoch [1640/5000]: Train loss: 0.8982, Valid loss: 0.8819


Epoch [1641/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.90it/s, loss=1.17]


Epoch [1641/5000]: Train loss: 0.9177, Valid loss: 0.8363


Epoch [1642/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.46it/s, loss=1.16]


Epoch [1642/5000]: Train loss: 0.9135, Valid loss: 0.8867


Epoch [1643/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.92it/s, loss=0.906]


Epoch [1643/5000]: Train loss: 0.9068, Valid loss: 0.8876


Epoch [1644/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.03it/s, loss=1.08]


Epoch [1644/5000]: Train loss: 0.9243, Valid loss: 0.8370


Epoch [1645/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.04it/s, loss=0.927]


Epoch [1645/5000]: Train loss: 0.9150, Valid loss: 0.9074


Epoch [1646/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.37it/s, loss=0.866]


Epoch [1646/5000]: Train loss: 0.9180, Valid loss: 0.8989


Epoch [1647/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.91it/s, loss=0.946]


Epoch [1647/5000]: Train loss: 0.9044, Valid loss: 0.9094


Epoch [1648/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.49it/s, loss=1.09]


Epoch [1648/5000]: Train loss: 0.9096, Valid loss: 0.8808


Epoch [1649/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.93it/s, loss=0.825]


Epoch [1649/5000]: Train loss: 0.8994, Valid loss: 0.8491


Epoch [1650/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.26it/s, loss=0.987]


Epoch [1650/5000]: Train loss: 0.9161, Valid loss: 0.8833


Epoch [1651/5000]: 100%|██████████| 10/10 [00:00<00:00, 385.28it/s, loss=0.997]


Epoch [1651/5000]: Train loss: 0.9420, Valid loss: 0.8683


Epoch [1652/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.16it/s, loss=0.883]

Epoch [1652/5000]: Train loss: 0.9027, Valid loss: 0.9106



Epoch [1653/5000]: 100%|██████████| 10/10 [00:00<00:00, 290.64it/s, loss=0.889]


Epoch [1653/5000]: Train loss: 0.9056, Valid loss: 0.8926


Epoch [1654/5000]: 100%|██████████| 10/10 [00:00<00:00, 320.82it/s, loss=0.711]


Epoch [1654/5000]: Train loss: 0.9111, Valid loss: 0.8556


Epoch [1655/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.09it/s, loss=0.681]


Epoch [1655/5000]: Train loss: 0.8907, Valid loss: 0.9467


Epoch [1656/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.62it/s, loss=1.02]


Epoch [1656/5000]: Train loss: 0.9112, Valid loss: 0.8035


Epoch [1657/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.38it/s, loss=1.14]


Epoch [1657/5000]: Train loss: 0.9221, Valid loss: 0.8718


Epoch [1658/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.47it/s, loss=0.821]


Epoch [1658/5000]: Train loss: 0.8968, Valid loss: 0.9040


Epoch [1659/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.45it/s, loss=0.869]


Epoch [1659/5000]: Train loss: 0.8997, Valid loss: 0.8442


Epoch [1660/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.74it/s, loss=0.767]


Epoch [1660/5000]: Train loss: 0.8978, Valid loss: 0.9048


Epoch [1661/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.00it/s, loss=0.865]


Epoch [1661/5000]: Train loss: 0.9166, Valid loss: 0.8536


Epoch [1662/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=1.13]


Epoch [1662/5000]: Train loss: 0.9183, Valid loss: 0.8806


Epoch [1663/5000]: 100%|██████████| 10/10 [00:00<00:00, 319.19it/s, loss=0.801]


Epoch [1663/5000]: Train loss: 0.8909, Valid loss: 0.8784


Epoch [1664/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.60it/s, loss=0.945]


Epoch [1664/5000]: Train loss: 0.9111, Valid loss: 0.8695


Epoch [1665/5000]: 100%|██████████| 10/10 [00:00<00:00, 249.79it/s, loss=0.807]


Epoch [1665/5000]: Train loss: 0.8938, Valid loss: 0.8078


Epoch [1666/5000]: 100%|██████████| 10/10 [00:00<00:00, 370.33it/s, loss=0.793]


Epoch [1666/5000]: Train loss: 0.8913, Valid loss: 0.8548


Epoch [1667/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.41it/s, loss=0.876]


Epoch [1667/5000]: Train loss: 0.8983, Valid loss: 0.8715


Epoch [1668/5000]: 100%|██████████| 10/10 [00:00<00:00, 385.90it/s, loss=1.09]


Epoch [1668/5000]: Train loss: 0.9112, Valid loss: 0.8421


Epoch [1669/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.63it/s, loss=0.773]


Epoch [1669/5000]: Train loss: 0.8901, Valid loss: 0.9478


Epoch [1670/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.82it/s, loss=0.664]


Epoch [1670/5000]: Train loss: 0.8872, Valid loss: 0.9003


Epoch [1671/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.94it/s, loss=0.837]


Epoch [1671/5000]: Train loss: 0.8996, Valid loss: 0.8617


Epoch [1672/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.12it/s, loss=0.95]


Epoch [1672/5000]: Train loss: 0.9071, Valid loss: 0.8806


Epoch [1673/5000]: 100%|██████████| 10/10 [00:00<00:00, 86.66it/s, loss=0.71]


Epoch [1673/5000]: Train loss: 0.8911, Valid loss: 0.8801


Epoch [1674/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.50it/s, loss=0.818]


Epoch [1674/5000]: Train loss: 0.8983, Valid loss: 0.9093


Epoch [1675/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.65it/s, loss=1.22]


Epoch [1675/5000]: Train loss: 0.9234, Valid loss: 0.8862


Epoch [1676/5000]: 100%|██████████| 10/10 [00:00<00:00, 372.11it/s, loss=1.01]


Epoch [1676/5000]: Train loss: 0.9035, Valid loss: 0.9062


Epoch [1677/5000]: 100%|██████████| 10/10 [00:00<00:00, 332.99it/s, loss=1]


Epoch [1677/5000]: Train loss: 0.9059, Valid loss: 0.8684


Epoch [1678/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.75it/s, loss=0.741]


Epoch [1678/5000]: Train loss: 0.8947, Valid loss: 0.8238


Epoch [1679/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.27it/s, loss=0.827]


Epoch [1679/5000]: Train loss: 0.9064, Valid loss: 0.8634


Epoch [1680/5000]: 100%|██████████| 10/10 [00:00<00:00, 287.99it/s, loss=0.583]


Epoch [1680/5000]: Train loss: 0.8798, Valid loss: 0.8659


Epoch [1681/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.67it/s, loss=1.1]


Epoch [1681/5000]: Train loss: 0.9118, Valid loss: 0.9093


Epoch [1682/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.53it/s, loss=0.744]


Epoch [1682/5000]: Train loss: 0.8991, Valid loss: 0.9038


Epoch [1683/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.88it/s, loss=0.91]


Epoch [1683/5000]: Train loss: 0.9021, Valid loss: 0.8679


Epoch [1684/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.51it/s, loss=0.894]


Epoch [1684/5000]: Train loss: 0.9028, Valid loss: 0.8633


Epoch [1685/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.11it/s, loss=0.65]


Epoch [1685/5000]: Train loss: 0.8829, Valid loss: 0.8528


Epoch [1686/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.71it/s, loss=0.813]

Epoch [1686/5000]: Train loss: 0.8929, Valid loss: 0.8638



Epoch [1687/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.48it/s, loss=0.815]


Epoch [1687/5000]: Train loss: 0.8945, Valid loss: 0.9191


Epoch [1688/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.78it/s, loss=1.16]


Epoch [1688/5000]: Train loss: 0.9170, Valid loss: 0.7852
Saving model with loss 0.785...


Epoch [1689/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.94it/s, loss=0.97]


Epoch [1689/5000]: Train loss: 0.9042, Valid loss: 0.8515


Epoch [1690/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.61it/s, loss=0.859]


Epoch [1690/5000]: Train loss: 0.8966, Valid loss: 0.8812


Epoch [1691/5000]: 100%|██████████| 10/10 [00:00<00:00, 307.43it/s, loss=1.07]


Epoch [1691/5000]: Train loss: 0.9073, Valid loss: 0.8384


Epoch [1692/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.52it/s, loss=1.08]


Epoch [1692/5000]: Train loss: 0.9106, Valid loss: 0.9346


Epoch [1693/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=0.849]


Epoch [1693/5000]: Train loss: 0.8974, Valid loss: 0.8562


Epoch [1694/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.59it/s, loss=1.21]


Epoch [1694/5000]: Train loss: 0.9135, Valid loss: 0.8259


Epoch [1695/5000]: 100%|██████████| 10/10 [00:00<00:00, 242.44it/s, loss=1.15]


Epoch [1695/5000]: Train loss: 0.9221, Valid loss: 0.9095


Epoch [1696/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.98it/s, loss=0.755]


Epoch [1696/5000]: Train loss: 0.8938, Valid loss: 0.8218


Epoch [1697/5000]: 100%|██████████| 10/10 [00:00<00:00, 274.07it/s, loss=1.02]


Epoch [1697/5000]: Train loss: 0.9042, Valid loss: 0.8242


Epoch [1698/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.81it/s, loss=0.714]


Epoch [1698/5000]: Train loss: 0.8864, Valid loss: 0.8222


Epoch [1699/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.50it/s, loss=0.99]


Epoch [1699/5000]: Train loss: 0.9033, Valid loss: 0.8650


Epoch [1700/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.32it/s, loss=0.717]


Epoch [1700/5000]: Train loss: 0.8931, Valid loss: 0.9080


Epoch [1701/5000]: 100%|██████████| 10/10 [00:00<00:00, 285.94it/s, loss=0.964]


Epoch [1701/5000]: Train loss: 0.9088, Valid loss: 0.8589


Epoch [1702/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=1.15]


Epoch [1702/5000]: Train loss: 0.9204, Valid loss: 0.9457


Epoch [1703/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.43it/s, loss=1.15]


Epoch [1703/5000]: Train loss: 0.9315, Valid loss: 0.9135


Epoch [1704/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.87it/s, loss=0.712]


Epoch [1704/5000]: Train loss: 0.8878, Valid loss: 0.8685


Epoch [1705/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.35it/s, loss=0.89]


Epoch [1705/5000]: Train loss: 0.9098, Valid loss: 0.8803


Epoch [1706/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.15it/s, loss=0.906]


Epoch [1706/5000]: Train loss: 0.9133, Valid loss: 0.8288


Epoch [1707/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.42it/s, loss=0.758]


Epoch [1707/5000]: Train loss: 0.9027, Valid loss: 0.8482


Epoch [1708/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.56it/s, loss=0.977]


Epoch [1708/5000]: Train loss: 0.9118, Valid loss: 0.8213


Epoch [1709/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.72it/s, loss=1.14]


Epoch [1709/5000]: Train loss: 0.9143, Valid loss: 0.8295


Epoch [1710/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.73it/s, loss=0.867]


Epoch [1710/5000]: Train loss: 0.8998, Valid loss: 0.8511


Epoch [1711/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.88it/s, loss=0.849]


Epoch [1711/5000]: Train loss: 0.9001, Valid loss: 0.8635


Epoch [1712/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.92it/s, loss=1.15]


Epoch [1712/5000]: Train loss: 0.9165, Valid loss: 0.8871


Epoch [1713/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.44it/s, loss=0.865]


Epoch [1713/5000]: Train loss: 0.8962, Valid loss: 0.9025


Epoch [1714/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.97it/s, loss=0.816]


Epoch [1714/5000]: Train loss: 0.8927, Valid loss: 0.8750


Epoch [1715/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.90it/s, loss=0.887]


Epoch [1715/5000]: Train loss: 0.8980, Valid loss: 0.9417


Epoch [1716/5000]: 100%|██████████| 10/10 [00:00<00:00, 341.97it/s, loss=0.65]


Epoch [1716/5000]: Train loss: 0.8884, Valid loss: 0.8897


Epoch [1717/5000]: 100%|██████████| 10/10 [00:00<00:00, 370.78it/s, loss=1.01]


Epoch [1717/5000]: Train loss: 0.9222, Valid loss: 0.8778


Epoch [1718/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.36it/s, loss=1.26]


Epoch [1718/5000]: Train loss: 0.9335, Valid loss: 0.9101


Epoch [1719/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.98it/s, loss=0.711]


Epoch [1719/5000]: Train loss: 0.8972, Valid loss: 0.8659


Epoch [1720/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.06it/s, loss=0.847]


Epoch [1720/5000]: Train loss: 0.8963, Valid loss: 0.8468


Epoch [1721/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.44it/s, loss=1.2]


Epoch [1721/5000]: Train loss: 0.9173, Valid loss: 0.8682


Epoch [1722/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.78it/s, loss=0.813]


Epoch [1722/5000]: Train loss: 0.8895, Valid loss: 0.9052


Epoch [1723/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.79it/s, loss=0.738]


Epoch [1723/5000]: Train loss: 0.8874, Valid loss: 0.8611


Epoch [1724/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.98it/s, loss=0.858]


Epoch [1724/5000]: Train loss: 0.9063, Valid loss: 0.8477


Epoch [1725/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.35it/s, loss=0.814]


Epoch [1725/5000]: Train loss: 0.9064, Valid loss: 0.8383


Epoch [1726/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.72it/s, loss=0.593]

Epoch [1726/5000]: Train loss: 0.8810, Valid loss: 0.8824



Epoch [1727/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.57it/s, loss=0.718]


Epoch [1727/5000]: Train loss: 0.8891, Valid loss: 0.9304


Epoch [1728/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.90it/s, loss=0.77]


Epoch [1728/5000]: Train loss: 0.8939, Valid loss: 0.8280


Epoch [1729/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.01it/s, loss=0.832]


Epoch [1729/5000]: Train loss: 0.8945, Valid loss: 0.9353


Epoch [1730/5000]: 100%|██████████| 10/10 [00:00<00:00, 370.62it/s, loss=0.952]


Epoch [1730/5000]: Train loss: 0.9093, Valid loss: 0.9242


Epoch [1731/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.61it/s, loss=0.809]


Epoch [1731/5000]: Train loss: 0.9028, Valid loss: 0.8696


Epoch [1732/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.65it/s, loss=0.915]


Epoch [1732/5000]: Train loss: 0.9027, Valid loss: 0.8430


Epoch [1733/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.22it/s, loss=0.578]


Epoch [1733/5000]: Train loss: 0.8842, Valid loss: 0.8443


Epoch [1734/5000]: 100%|██████████| 10/10 [00:00<00:00, 258.18it/s, loss=0.734]


Epoch [1734/5000]: Train loss: 0.8931, Valid loss: 0.8814


Epoch [1735/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.10it/s, loss=1.36]


Epoch [1735/5000]: Train loss: 0.9235, Valid loss: 0.8724


Epoch [1736/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.54it/s, loss=1.17]


Epoch [1736/5000]: Train loss: 0.9218, Valid loss: 0.9734


Epoch [1737/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.59it/s, loss=0.98]


Epoch [1737/5000]: Train loss: 0.9074, Valid loss: 0.8429


Epoch [1738/5000]: 100%|██████████| 10/10 [00:00<00:00, 320.65it/s, loss=1.01]


Epoch [1738/5000]: Train loss: 0.9061, Valid loss: 0.8451


Epoch [1739/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.79it/s, loss=0.927]


Epoch [1739/5000]: Train loss: 0.9013, Valid loss: 0.8679


Epoch [1740/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.11it/s, loss=0.994]


Epoch [1740/5000]: Train loss: 0.9169, Valid loss: 0.8493


Epoch [1741/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.89it/s, loss=1.07]


Epoch [1741/5000]: Train loss: 0.9072, Valid loss: 0.9250


Epoch [1742/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.53it/s, loss=0.784]


Epoch [1742/5000]: Train loss: 0.8953, Valid loss: 0.9007


Epoch [1743/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.19it/s, loss=0.816]


Epoch [1743/5000]: Train loss: 0.8928, Valid loss: 0.8846


Epoch [1744/5000]: 100%|██████████| 10/10 [00:00<00:00, 319.37it/s, loss=0.698]


Epoch [1744/5000]: Train loss: 0.8918, Valid loss: 0.8171


Epoch [1745/5000]: 100%|██████████| 10/10 [00:00<00:00, 292.89it/s, loss=0.963]


Epoch [1745/5000]: Train loss: 0.9079, Valid loss: 0.8438


Epoch [1746/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.35it/s, loss=1.11]


Epoch [1746/5000]: Train loss: 0.9146, Valid loss: 0.9582


Epoch [1747/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.53it/s, loss=0.873]


Epoch [1747/5000]: Train loss: 0.8978, Valid loss: 0.8550


Epoch [1748/5000]: 100%|██████████| 10/10 [00:00<00:00, 350.52it/s, loss=0.79]


Epoch [1748/5000]: Train loss: 0.8878, Valid loss: 0.8397


Epoch [1749/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.19it/s, loss=0.895]


Epoch [1749/5000]: Train loss: 0.8972, Valid loss: 0.8234


Epoch [1750/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.93it/s, loss=0.848]


Epoch [1750/5000]: Train loss: 0.8934, Valid loss: 0.8972


Epoch [1751/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.46it/s, loss=0.853]


Epoch [1751/5000]: Train loss: 0.8939, Valid loss: 0.8322


Epoch [1752/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.49it/s, loss=0.994]

Epoch [1752/5000]: Train loss: 0.9052, Valid loss: 0.8737



Epoch [1753/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.66it/s, loss=1.01]


Epoch [1753/5000]: Train loss: 0.9035, Valid loss: 0.9325


Epoch [1754/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.67it/s, loss=0.884]


Epoch [1754/5000]: Train loss: 0.8958, Valid loss: 0.8921


Epoch [1755/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.29it/s, loss=1.14]


Epoch [1755/5000]: Train loss: 0.9227, Valid loss: 0.8287


Epoch [1756/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.71it/s, loss=0.665]


Epoch [1756/5000]: Train loss: 0.8861, Valid loss: 0.9084


Epoch [1757/5000]: 100%|██████████| 10/10 [00:00<00:00, 320.41it/s, loss=1.28]


Epoch [1757/5000]: Train loss: 0.9227, Valid loss: 0.8774


Epoch [1758/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.97it/s, loss=1.05]


Epoch [1758/5000]: Train loss: 0.9076, Valid loss: 0.9019


Epoch [1759/5000]: 100%|██████████| 10/10 [00:00<00:00, 80.54it/s, loss=0.667]


Epoch [1759/5000]: Train loss: 0.8837, Valid loss: 0.8933


Epoch [1760/5000]: 100%|██████████| 10/10 [00:00<00:00, 268.76it/s, loss=0.935]


Epoch [1760/5000]: Train loss: 0.9010, Valid loss: 0.8673


Epoch [1761/5000]: 100%|██████████| 10/10 [00:00<00:00, 222.79it/s, loss=0.91]


Epoch [1761/5000]: Train loss: 0.9082, Valid loss: 0.8541


Epoch [1762/5000]: 100%|██████████| 10/10 [00:00<00:00, 302.89it/s, loss=0.907]


Epoch [1762/5000]: Train loss: 0.9053, Valid loss: 0.8481


Epoch [1763/5000]: 100%|██████████| 10/10 [00:00<00:00, 280.09it/s, loss=1.37]


Epoch [1763/5000]: Train loss: 0.9316, Valid loss: 0.8143


Epoch [1764/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.81it/s, loss=0.811]


Epoch [1764/5000]: Train loss: 0.8910, Valid loss: 0.8807


Epoch [1765/5000]: 100%|██████████| 10/10 [00:00<00:00, 325.55it/s, loss=0.658]


Epoch [1765/5000]: Train loss: 0.8855, Valid loss: 0.8822


Epoch [1766/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.08it/s, loss=1.18]


Epoch [1766/5000]: Train loss: 0.9131, Valid loss: 0.8014


Epoch [1767/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.47it/s, loss=0.946]


Epoch [1767/5000]: Train loss: 0.9085, Valid loss: 0.8796


Epoch [1768/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.58it/s, loss=0.835]


Epoch [1768/5000]: Train loss: 0.8964, Valid loss: 0.8257


Epoch [1769/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.62it/s, loss=0.932]


Epoch [1769/5000]: Train loss: 0.9096, Valid loss: 0.8443


Epoch [1770/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.98it/s, loss=0.808]

Epoch [1770/5000]: Train loss: 0.8902, Valid loss: 0.8011



Epoch [1771/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.82it/s, loss=1.12]


Epoch [1771/5000]: Train loss: 0.9109, Valid loss: 0.8473


Epoch [1772/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.31it/s, loss=0.932]


Epoch [1772/5000]: Train loss: 0.9130, Valid loss: 0.8324


Epoch [1773/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.03it/s, loss=0.803]


Epoch [1773/5000]: Train loss: 0.9073, Valid loss: 0.8134


Epoch [1774/5000]: 100%|██████████| 10/10 [00:00<00:00, 344.85it/s, loss=0.807]


Epoch [1774/5000]: Train loss: 0.8988, Valid loss: 0.8011


Epoch [1775/5000]: 100%|██████████| 10/10 [00:00<00:00, 238.74it/s, loss=1.21]


Epoch [1775/5000]: Train loss: 0.9177, Valid loss: 0.9567


Epoch [1776/5000]: 100%|██████████| 10/10 [00:00<00:00, 348.87it/s, loss=1.01]


Epoch [1776/5000]: Train loss: 0.9125, Valid loss: 0.9393


Epoch [1777/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.11it/s, loss=0.903]


Epoch [1777/5000]: Train loss: 0.8974, Valid loss: 0.8496


Epoch [1778/5000]: 100%|██████████| 10/10 [00:00<00:00, 384.28it/s, loss=0.685]


Epoch [1778/5000]: Train loss: 0.8837, Valid loss: 0.8612


Epoch [1779/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.69it/s, loss=1.08]


Epoch [1779/5000]: Train loss: 0.9062, Valid loss: 0.8824


Epoch [1780/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.84it/s, loss=0.702]


Epoch [1780/5000]: Train loss: 0.8818, Valid loss: 0.8999


Epoch [1781/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.73it/s, loss=0.84]


Epoch [1781/5000]: Train loss: 0.8941, Valid loss: 0.8538


Epoch [1782/5000]: 100%|██████████| 10/10 [00:00<00:00, 297.65it/s, loss=0.494]


Epoch [1782/5000]: Train loss: 0.8707, Valid loss: 0.9650


Epoch [1783/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=0.65]


Epoch [1783/5000]: Train loss: 0.8848, Valid loss: 0.8178


Epoch [1784/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.18it/s, loss=0.914]


Epoch [1784/5000]: Train loss: 0.8962, Valid loss: 0.9277


Epoch [1785/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.84it/s, loss=0.923]


Epoch [1785/5000]: Train loss: 0.9012, Valid loss: 0.8230


Epoch [1786/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.68it/s, loss=0.933]


Epoch [1786/5000]: Train loss: 0.8976, Valid loss: 0.8796


Epoch [1787/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.01it/s, loss=0.898]


Epoch [1787/5000]: Train loss: 0.8952, Valid loss: 0.8605


Epoch [1788/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.68it/s, loss=0.876]


Epoch [1788/5000]: Train loss: 0.8955, Valid loss: 0.8674


Epoch [1789/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.29it/s, loss=0.841]


Epoch [1789/5000]: Train loss: 0.9040, Valid loss: 0.8895


Epoch [1790/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.92it/s, loss=0.85]


Epoch [1790/5000]: Train loss: 0.8948, Valid loss: 0.9114


Epoch [1791/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.28it/s, loss=1]


Epoch [1791/5000]: Train loss: 0.9035, Valid loss: 0.8418


Epoch [1792/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.77it/s, loss=0.879]


Epoch [1792/5000]: Train loss: 0.8946, Valid loss: 0.8179


Epoch [1793/5000]: 100%|██████████| 10/10 [00:00<00:00, 310.17it/s, loss=0.808]


Epoch [1793/5000]: Train loss: 0.8941, Valid loss: 0.9088


Epoch [1794/5000]: 100%|██████████| 10/10 [00:00<00:00, 332.05it/s, loss=0.916]


Epoch [1794/5000]: Train loss: 0.8984, Valid loss: 0.8386


Epoch [1795/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.01it/s, loss=0.756]


Epoch [1795/5000]: Train loss: 0.8900, Valid loss: 0.8920


Epoch [1796/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.70it/s, loss=0.91]


Epoch [1796/5000]: Train loss: 0.8993, Valid loss: 0.8530


Epoch [1797/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.41it/s, loss=0.894]


Epoch [1797/5000]: Train loss: 0.9154, Valid loss: 0.9415


Epoch [1798/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.60it/s, loss=0.828]


Epoch [1798/5000]: Train loss: 0.9165, Valid loss: 0.8495


Epoch [1799/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.18it/s, loss=1.27]


Epoch [1799/5000]: Train loss: 0.9191, Valid loss: 0.9210


Epoch [1800/5000]: 100%|██████████| 10/10 [00:00<00:00, 298.30it/s, loss=0.873]


Epoch [1800/5000]: Train loss: 0.9073, Valid loss: 0.8541


Epoch [1801/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.74it/s, loss=0.924]


Epoch [1801/5000]: Train loss: 0.8959, Valid loss: 0.8062


Epoch [1802/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.07it/s, loss=1.14]


Epoch [1802/5000]: Train loss: 0.9197, Valid loss: 0.8474


Epoch [1803/5000]: 100%|██████████| 10/10 [00:00<00:00, 252.21it/s, loss=1.02]


Epoch [1803/5000]: Train loss: 0.9084, Valid loss: 0.8267


Epoch [1804/5000]: 100%|██████████| 10/10 [00:00<00:00, 278.71it/s, loss=0.845]


Epoch [1804/5000]: Train loss: 0.9068, Valid loss: 0.8980


Epoch [1805/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.26it/s, loss=0.852]


Epoch [1805/5000]: Train loss: 0.9122, Valid loss: 0.9909


Epoch [1806/5000]: 100%|██████████| 10/10 [00:00<00:00, 250.75it/s, loss=0.737]


Epoch [1806/5000]: Train loss: 0.8938, Valid loss: 0.8248


Epoch [1807/5000]: 100%|██████████| 10/10 [00:00<00:00, 202.48it/s, loss=0.938]


Epoch [1807/5000]: Train loss: 0.9129, Valid loss: 0.9022


Epoch [1808/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.28it/s, loss=0.743]


Epoch [1808/5000]: Train loss: 0.8988, Valid loss: 0.8831


Epoch [1809/5000]: 100%|██████████| 10/10 [00:00<00:00, 289.84it/s, loss=1.13]


Epoch [1809/5000]: Train loss: 0.9074, Valid loss: 0.8375


Epoch [1810/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.02it/s, loss=1.46]


Epoch [1810/5000]: Train loss: 0.9272, Valid loss: 0.9139


Epoch [1811/5000]: 100%|██████████| 10/10 [00:00<00:00, 279.49it/s, loss=0.889]


Epoch [1811/5000]: Train loss: 0.8949, Valid loss: 0.8159


Epoch [1812/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.21it/s, loss=0.835]


Epoch [1812/5000]: Train loss: 0.8905, Valid loss: 0.9081


Epoch [1813/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.63it/s, loss=0.939]


Epoch [1813/5000]: Train loss: 0.8984, Valid loss: 0.8518


Epoch [1814/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.46it/s, loss=1.03]


Epoch [1814/5000]: Train loss: 0.9058, Valid loss: 0.8543


Epoch [1815/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.89it/s, loss=0.874]


Epoch [1815/5000]: Train loss: 0.8935, Valid loss: 0.8714


Epoch [1816/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.46it/s, loss=1.02]


Epoch [1816/5000]: Train loss: 0.9035, Valid loss: 0.8488


Epoch [1817/5000]: 100%|██████████| 10/10 [00:00<00:00, 291.72it/s, loss=0.834]


Epoch [1817/5000]: Train loss: 0.8994, Valid loss: 0.8557


Epoch [1818/5000]: 100%|██████████| 10/10 [00:00<00:00, 349.74it/s, loss=0.657]


Epoch [1818/5000]: Train loss: 0.8923, Valid loss: 0.8789


Epoch [1819/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.22it/s, loss=0.733]


Epoch [1819/5000]: Train loss: 0.8842, Valid loss: 0.8950


Epoch [1820/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.26it/s, loss=1]


Epoch [1820/5000]: Train loss: 0.9079, Valid loss: 0.8434


Epoch [1821/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.73it/s, loss=0.9]


Epoch [1821/5000]: Train loss: 0.8965, Valid loss: 0.7994


Epoch [1822/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.23it/s, loss=1.11]


Epoch [1822/5000]: Train loss: 0.9084, Valid loss: 0.8312


Epoch [1823/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.84it/s, loss=1.49]


Epoch [1823/5000]: Train loss: 0.9339, Valid loss: 0.8933


Epoch [1824/5000]: 100%|██████████| 10/10 [00:00<00:00, 269.25it/s, loss=0.794]


Epoch [1824/5000]: Train loss: 0.8903, Valid loss: 0.9291


Epoch [1825/5000]: 100%|██████████| 10/10 [00:00<00:00, 275.47it/s, loss=0.896]


Epoch [1825/5000]: Train loss: 0.8971, Valid loss: 0.8458


Epoch [1826/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.29it/s, loss=1.25]


Epoch [1826/5000]: Train loss: 0.9173, Valid loss: 0.9225


Epoch [1827/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.96it/s, loss=1.08]


Epoch [1827/5000]: Train loss: 0.9161, Valid loss: 0.8964


Epoch [1828/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.79it/s, loss=1.06]


Epoch [1828/5000]: Train loss: 0.9190, Valid loss: 0.8585


Epoch [1829/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.60it/s, loss=1.03]


Epoch [1829/5000]: Train loss: 0.9025, Valid loss: 0.8425


Epoch [1830/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.27it/s, loss=0.762]


Epoch [1830/5000]: Train loss: 0.8862, Valid loss: 0.9172


Epoch [1831/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.08it/s, loss=0.739]


Epoch [1831/5000]: Train loss: 0.8866, Valid loss: 0.9056


Epoch [1832/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.49it/s, loss=0.866]


Epoch [1832/5000]: Train loss: 0.8934, Valid loss: 0.8802


Epoch [1833/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.60it/s, loss=0.823]


Epoch [1833/5000]: Train loss: 0.8919, Valid loss: 0.8601


Epoch [1834/5000]: 100%|██████████| 10/10 [00:00<00:00, 339.15it/s, loss=0.918]


Epoch [1834/5000]: Train loss: 0.9005, Valid loss: 0.8471


Epoch [1835/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.45it/s, loss=0.996]


Epoch [1835/5000]: Train loss: 0.8990, Valid loss: 0.8503


Epoch [1836/5000]: 100%|██████████| 10/10 [00:00<00:00, 355.32it/s, loss=0.701]


Epoch [1836/5000]: Train loss: 0.8996, Valid loss: 0.9277


Epoch [1837/5000]: 100%|██████████| 10/10 [00:00<00:00, 317.76it/s, loss=1.08]


Epoch [1837/5000]: Train loss: 0.9211, Valid loss: 0.9017


Epoch [1838/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.02it/s, loss=1.3]


Epoch [1838/5000]: Train loss: 0.9321, Valid loss: 0.8841


Epoch [1839/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.68it/s, loss=1.12]


Epoch [1839/5000]: Train loss: 0.9068, Valid loss: 0.9028


Epoch [1840/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.35it/s, loss=0.917]


Epoch [1840/5000]: Train loss: 0.8935, Valid loss: 0.8447


Epoch [1841/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.03it/s, loss=1.22]


Epoch [1841/5000]: Train loss: 0.9366, Valid loss: 0.8656


Epoch [1842/5000]: 100%|██████████| 10/10 [00:00<00:00, 362.97it/s, loss=0.908]


Epoch [1842/5000]: Train loss: 0.9179, Valid loss: 0.8365


Epoch [1843/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.56it/s, loss=1.15]


Epoch [1843/5000]: Train loss: 0.9122, Valid loss: 0.8409


Epoch [1844/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.44it/s, loss=0.687]


Epoch [1844/5000]: Train loss: 0.8927, Valid loss: 0.8320


Epoch [1845/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.56it/s, loss=0.993]


Epoch [1845/5000]: Train loss: 0.9009, Valid loss: 0.8685


Epoch [1846/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.18it/s, loss=0.879]


Epoch [1846/5000]: Train loss: 0.8919, Valid loss: 0.8289


Epoch [1847/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.54it/s, loss=0.57]


Epoch [1847/5000]: Train loss: 0.8788, Valid loss: 0.8820


Epoch [1848/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.70it/s, loss=0.748]


Epoch [1848/5000]: Train loss: 0.8885, Valid loss: 0.8117


Epoch [1849/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.43it/s, loss=0.692]


Epoch [1849/5000]: Train loss: 0.8917, Valid loss: 0.8579


Epoch [1850/5000]: 100%|██████████| 10/10 [00:00<00:00, 332.94it/s, loss=1.38]


Epoch [1850/5000]: Train loss: 0.9237, Valid loss: 0.9077


Epoch [1851/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.08it/s, loss=0.921]


Epoch [1851/5000]: Train loss: 0.9049, Valid loss: 0.8714


Epoch [1852/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.94it/s, loss=0.745]


Epoch [1852/5000]: Train loss: 0.8990, Valid loss: 0.8807


Epoch [1853/5000]: 100%|██████████| 10/10 [00:00<00:00, 254.58it/s, loss=0.72]


Epoch [1853/5000]: Train loss: 0.8859, Valid loss: 0.8594


Epoch [1854/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.31it/s, loss=0.89]


Epoch [1854/5000]: Train loss: 0.9020, Valid loss: 0.8717


Epoch [1855/5000]: 100%|██████████| 10/10 [00:00<00:00, 261.52it/s, loss=0.757]


Epoch [1855/5000]: Train loss: 0.9037, Valid loss: 0.8479


Epoch [1856/5000]: 100%|██████████| 10/10 [00:00<00:00, 368.52it/s, loss=1.69]


Epoch [1856/5000]: Train loss: 0.9492, Valid loss: 0.9024


Epoch [1857/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.29it/s, loss=0.825]


Epoch [1857/5000]: Train loss: 0.8899, Valid loss: 0.8023


Epoch [1858/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.38it/s, loss=1.21]


Epoch [1858/5000]: Train loss: 0.9114, Valid loss: 0.8138


Epoch [1859/5000]: 100%|██████████| 10/10 [00:00<00:00, 356.77it/s, loss=0.663]


Epoch [1859/5000]: Train loss: 0.8803, Valid loss: 0.8871


Epoch [1860/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.35it/s, loss=1.08]


Epoch [1860/5000]: Train loss: 0.9046, Valid loss: 0.8738


Epoch [1861/5000]: 100%|██████████| 10/10 [00:00<00:00, 328.51it/s, loss=1.26]


Epoch [1861/5000]: Train loss: 0.9187, Valid loss: 0.8124


Epoch [1862/5000]: 100%|██████████| 10/10 [00:00<00:00, 286.67it/s, loss=0.559]


Epoch [1862/5000]: Train loss: 0.8828, Valid loss: 0.8452


Epoch [1863/5000]: 100%|██████████| 10/10 [00:00<00:00, 324.49it/s, loss=0.806]


Epoch [1863/5000]: Train loss: 0.9030, Valid loss: 0.8998


Epoch [1864/5000]: 100%|██████████| 10/10 [00:00<00:00, 315.35it/s, loss=0.65]


Epoch [1864/5000]: Train loss: 0.9068, Valid loss: 0.8861


Epoch [1865/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.53it/s, loss=0.782]


Epoch [1865/5000]: Train loss: 0.8904, Valid loss: 0.8289


Epoch [1866/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.58it/s, loss=1.01]


Epoch [1866/5000]: Train loss: 0.9077, Valid loss: 0.7942


Epoch [1867/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.47it/s, loss=0.963]


Epoch [1867/5000]: Train loss: 0.8966, Valid loss: 0.9149


Epoch [1868/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.71it/s, loss=0.693]


Epoch [1868/5000]: Train loss: 0.8909, Valid loss: 0.9034


Epoch [1869/5000]: 100%|██████████| 10/10 [00:00<00:00, 245.41it/s, loss=1.06]


Epoch [1869/5000]: Train loss: 0.9120, Valid loss: 0.8559


Epoch [1870/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.29it/s, loss=1.09]


Epoch [1870/5000]: Train loss: 0.9107, Valid loss: 0.8501


Epoch [1871/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.01it/s, loss=0.928]


Epoch [1871/5000]: Train loss: 0.8999, Valid loss: 0.8286


Epoch [1872/5000]: 100%|██████████| 10/10 [00:00<00:00, 361.88it/s, loss=0.789]


Epoch [1872/5000]: Train loss: 0.9056, Valid loss: 0.8709


Epoch [1873/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.59it/s, loss=0.944]


Epoch [1873/5000]: Train loss: 0.8960, Valid loss: 0.9032


Epoch [1874/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.09it/s, loss=0.732]


Epoch [1874/5000]: Train loss: 0.8951, Valid loss: 0.8553


Epoch [1875/5000]: 100%|██████████| 10/10 [00:00<00:00, 264.02it/s, loss=0.912]


Epoch [1875/5000]: Train loss: 0.9005, Valid loss: 0.8632


Epoch [1876/5000]: 100%|██████████| 10/10 [00:00<00:00, 251.48it/s, loss=0.896]


Epoch [1876/5000]: Train loss: 0.8983, Valid loss: 0.8930


Epoch [1877/5000]: 100%|██████████| 10/10 [00:00<00:00, 259.65it/s, loss=1.1]


Epoch [1877/5000]: Train loss: 0.9112, Valid loss: 0.8592


Epoch [1878/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.00it/s, loss=0.59]


Epoch [1878/5000]: Train loss: 0.8817, Valid loss: 0.8230


Epoch [1879/5000]: 100%|██████████| 10/10 [00:00<00:00, 255.55it/s, loss=0.763]


Epoch [1879/5000]: Train loss: 0.8952, Valid loss: 0.8764


Epoch [1880/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.15it/s, loss=0.888]


Epoch [1880/5000]: Train loss: 0.9022, Valid loss: 0.9054


Epoch [1881/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.64it/s, loss=0.67]


Epoch [1881/5000]: Train loss: 0.8980, Valid loss: 0.9250


Epoch [1882/5000]: 100%|██████████| 10/10 [00:00<00:00, 307.06it/s, loss=1.12]


Epoch [1882/5000]: Train loss: 0.9351, Valid loss: 0.9022


Epoch [1883/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.65it/s, loss=0.884]


Epoch [1883/5000]: Train loss: 0.9302, Valid loss: 0.8764


Epoch [1884/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.80it/s, loss=0.894]


Epoch [1884/5000]: Train loss: 0.9089, Valid loss: 0.8872


Epoch [1885/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.30it/s, loss=0.869]


Epoch [1885/5000]: Train loss: 0.8995, Valid loss: 0.8906


Epoch [1886/5000]: 100%|██████████| 10/10 [00:00<00:00, 257.27it/s, loss=0.778]


Epoch [1886/5000]: Train loss: 0.9001, Valid loss: 0.8886


Epoch [1887/5000]: 100%|██████████| 10/10 [00:00<00:00, 314.60it/s, loss=0.748]


Epoch [1887/5000]: Train loss: 0.8866, Valid loss: 0.8807


Epoch [1888/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.14it/s, loss=0.97]


Epoch [1888/5000]: Train loss: 0.9057, Valid loss: 0.8424


Epoch [1889/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.05it/s, loss=1.11]


Epoch [1889/5000]: Train loss: 0.9435, Valid loss: 0.9680


Epoch [1890/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.24it/s, loss=1.43]


Epoch [1890/5000]: Train loss: 0.9777, Valid loss: 0.9582


Epoch [1891/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.54it/s, loss=0.769]


Epoch [1891/5000]: Train loss: 0.9092, Valid loss: 0.8849


Epoch [1892/5000]: 100%|██████████| 10/10 [00:00<00:00, 307.34it/s, loss=1.19]


Epoch [1892/5000]: Train loss: 0.9289, Valid loss: 0.8247


Epoch [1893/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.72it/s, loss=1.07]


Epoch [1893/5000]: Train loss: 0.9092, Valid loss: 0.8019


Epoch [1894/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.45it/s, loss=0.86]


Epoch [1894/5000]: Train loss: 0.8917, Valid loss: 0.9300


Epoch [1895/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.76it/s, loss=0.829]


Epoch [1895/5000]: Train loss: 0.8934, Valid loss: 0.9371


Epoch [1896/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.40it/s, loss=0.928]


Epoch [1896/5000]: Train loss: 0.8968, Valid loss: 0.8218


Epoch [1897/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.55it/s, loss=0.697]


Epoch [1897/5000]: Train loss: 0.8847, Valid loss: 0.8341


Epoch [1898/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.09it/s, loss=0.728]


Epoch [1898/5000]: Train loss: 0.8881, Valid loss: 0.8876


Epoch [1899/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.64it/s, loss=0.83]


Epoch [1899/5000]: Train loss: 0.9047, Valid loss: 0.8799


Epoch [1900/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.19it/s, loss=0.989]


Epoch [1900/5000]: Train loss: 0.9170, Valid loss: 0.8527


Epoch [1901/5000]: 100%|██████████| 10/10 [00:00<00:00, 254.90it/s, loss=0.689]


Epoch [1901/5000]: Train loss: 0.8877, Valid loss: 0.8855


Epoch [1902/5000]: 100%|██████████| 10/10 [00:00<00:00, 357.93it/s, loss=0.87]


Epoch [1902/5000]: Train loss: 0.9237, Valid loss: 0.9033


Epoch [1903/5000]: 100%|██████████| 10/10 [00:00<00:00, 276.56it/s, loss=0.719]


Epoch [1903/5000]: Train loss: 0.9080, Valid loss: 0.8751


Epoch [1904/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.21it/s, loss=0.858]


Epoch [1904/5000]: Train loss: 0.8989, Valid loss: 0.8756


Epoch [1905/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.80it/s, loss=0.993]


Epoch [1905/5000]: Train loss: 0.9011, Valid loss: 0.8388


Epoch [1906/5000]: 100%|██████████| 10/10 [00:00<00:00, 277.15it/s, loss=1.09]


Epoch [1906/5000]: Train loss: 0.9096, Valid loss: 0.8237


Epoch [1907/5000]: 100%|██████████| 10/10 [00:00<00:00, 262.00it/s, loss=0.514]


Epoch [1907/5000]: Train loss: 0.8751, Valid loss: 0.8077


Epoch [1908/5000]: 100%|██████████| 10/10 [00:00<00:00, 275.14it/s, loss=0.781]


Epoch [1908/5000]: Train loss: 0.8852, Valid loss: 0.8576


Epoch [1909/5000]: 100%|██████████| 10/10 [00:00<00:00, 270.76it/s, loss=0.864]


Epoch [1909/5000]: Train loss: 0.8916, Valid loss: 0.8551


Epoch [1910/5000]: 100%|██████████| 10/10 [00:00<00:00, 233.23it/s, loss=0.838]


Epoch [1910/5000]: Train loss: 0.8907, Valid loss: 0.8364


Epoch [1911/5000]: 100%|██████████| 10/10 [00:00<00:00, 283.27it/s, loss=0.951]

Epoch [1911/5000]: Train loss: 0.8966, Valid loss: 0.8777



Epoch [1912/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.11it/s, loss=0.842]


Epoch [1912/5000]: Train loss: 0.8887, Valid loss: 0.8605


Epoch [1913/5000]: 100%|██████████| 10/10 [00:00<00:00, 309.81it/s, loss=1.07]


Epoch [1913/5000]: Train loss: 0.9521, Valid loss: 0.9011


Epoch [1914/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.36it/s, loss=0.679]


Epoch [1914/5000]: Train loss: 0.8962, Valid loss: 0.8553


Epoch [1915/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.51it/s, loss=0.778]


Epoch [1915/5000]: Train loss: 0.9028, Valid loss: 0.9032


Epoch [1916/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.74it/s, loss=0.766]


Epoch [1916/5000]: Train loss: 0.8887, Valid loss: 0.8222


Epoch [1917/5000]: 100%|██████████| 10/10 [00:00<00:00, 333.79it/s, loss=0.679]


Epoch [1917/5000]: Train loss: 0.8934, Valid loss: 0.8438


Epoch [1918/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.90it/s, loss=0.813]


Epoch [1918/5000]: Train loss: 0.9108, Valid loss: 0.8285


Epoch [1919/5000]: 100%|██████████| 10/10 [00:00<00:00, 340.84it/s, loss=1.07]


Epoch [1919/5000]: Train loss: 0.9075, Valid loss: 0.8589


Epoch [1920/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.62it/s, loss=0.69]


Epoch [1920/5000]: Train loss: 0.8856, Valid loss: 0.8696


Epoch [1921/5000]: 100%|██████████| 10/10 [00:00<00:00, 338.87it/s, loss=0.869]


Epoch [1921/5000]: Train loss: 0.8966, Valid loss: 0.8613


Epoch [1922/5000]: 100%|██████████| 10/10 [00:00<00:00, 288.89it/s, loss=0.91]


Epoch [1922/5000]: Train loss: 0.9025, Valid loss: 0.9523


Epoch [1923/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.06it/s, loss=0.777]


Epoch [1923/5000]: Train loss: 0.8978, Valid loss: 0.8937


Epoch [1924/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.15it/s, loss=0.916]


Epoch [1924/5000]: Train loss: 0.9038, Valid loss: 0.8521


Epoch [1925/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.21it/s, loss=0.647]


Epoch [1925/5000]: Train loss: 0.8853, Valid loss: 0.8510


Epoch [1926/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.20it/s, loss=1.07]


Epoch [1926/5000]: Train loss: 0.9056, Valid loss: 0.8840


Epoch [1927/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.07it/s, loss=0.942]


Epoch [1927/5000]: Train loss: 0.9003, Valid loss: 0.8696


Epoch [1928/5000]: 100%|██████████| 10/10 [00:00<00:00, 302.61it/s, loss=1.04]


Epoch [1928/5000]: Train loss: 0.9064, Valid loss: 0.8989


Epoch [1929/5000]: 100%|██████████| 10/10 [00:00<00:00, 337.99it/s, loss=1.16]


Epoch [1929/5000]: Train loss: 0.9210, Valid loss: 0.9265


Epoch [1930/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.38it/s, loss=0.829]


Epoch [1930/5000]: Train loss: 0.9089, Valid loss: 0.8606


Epoch [1931/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.52it/s, loss=0.981]

Epoch [1931/5000]: Train loss: 0.9030, Valid loss: 0.8261



Epoch [1932/5000]: 100%|██████████| 10/10 [00:00<00:00, 84.67it/s, loss=0.898]


Epoch [1932/5000]: Train loss: 0.9029, Valid loss: 0.8579


Epoch [1933/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.51it/s, loss=0.73]


Epoch [1933/5000]: Train loss: 0.9084, Valid loss: 0.8707


Epoch [1934/5000]: 100%|██████████| 10/10 [00:00<00:00, 253.30it/s, loss=0.729]


Epoch [1934/5000]: Train loss: 0.8941, Valid loss: 0.8186


Epoch [1935/5000]: 100%|██████████| 10/10 [00:00<00:00, 345.06it/s, loss=0.844]


Epoch [1935/5000]: Train loss: 0.9010, Valid loss: 0.8642


Epoch [1936/5000]: 100%|██████████| 10/10 [00:00<00:00, 293.36it/s, loss=1.14]


Epoch [1936/5000]: Train loss: 0.9219, Valid loss: 0.8547


Epoch [1937/5000]: 100%|██████████| 10/10 [00:00<00:00, 335.39it/s, loss=0.79]


Epoch [1937/5000]: Train loss: 0.9032, Valid loss: 0.9151


Epoch [1938/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.35it/s, loss=0.733]


Epoch [1938/5000]: Train loss: 0.8933, Valid loss: 0.8456


Epoch [1939/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.13it/s, loss=1.07]


Epoch [1939/5000]: Train loss: 0.9122, Valid loss: 0.8410


Epoch [1940/5000]: 100%|██████████| 10/10 [00:00<00:00, 307.03it/s, loss=0.913]


Epoch [1940/5000]: Train loss: 0.8960, Valid loss: 0.8623


Epoch [1941/5000]: 100%|██████████| 10/10 [00:00<00:00, 294.84it/s, loss=1.02]


Epoch [1941/5000]: Train loss: 0.9008, Valid loss: 0.8453


Epoch [1942/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.67it/s, loss=0.818]


Epoch [1942/5000]: Train loss: 0.8961, Valid loss: 0.8552


Epoch [1943/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.23it/s, loss=1.34]


Epoch [1943/5000]: Train loss: 0.9265, Valid loss: 0.8727


Epoch [1944/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.06it/s, loss=0.945]


Epoch [1944/5000]: Train loss: 0.9012, Valid loss: 0.8609


Epoch [1945/5000]: 100%|██████████| 10/10 [00:00<00:00, 284.28it/s, loss=0.9]


Epoch [1945/5000]: Train loss: 0.8995, Valid loss: 0.8411


Epoch [1946/5000]: 100%|██████████| 10/10 [00:00<00:00, 358.29it/s, loss=0.925]


Epoch [1946/5000]: Train loss: 0.9032, Valid loss: 0.9037


Epoch [1947/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.09it/s, loss=0.798]


Epoch [1947/5000]: Train loss: 0.9122, Valid loss: 0.8883


Epoch [1948/5000]: 100%|██████████| 10/10 [00:00<00:00, 321.26it/s, loss=0.82]


Epoch [1948/5000]: Train loss: 0.9048, Valid loss: 0.8569


Epoch [1949/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.45it/s, loss=1.27]


Epoch [1949/5000]: Train loss: 0.9288, Valid loss: 0.8712


Epoch [1950/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.57it/s, loss=0.921]


Epoch [1950/5000]: Train loss: 0.8968, Valid loss: 0.8997


Epoch [1951/5000]: 100%|██████████| 10/10 [00:00<00:00, 248.88it/s, loss=0.68]


Epoch [1951/5000]: Train loss: 0.8809, Valid loss: 0.8251


Epoch [1952/5000]: 100%|██████████| 10/10 [00:00<00:00, 282.92it/s, loss=0.885]


Epoch [1952/5000]: Train loss: 0.8914, Valid loss: 0.9169


Epoch [1953/5000]: 100%|██████████| 10/10 [00:00<00:00, 378.48it/s, loss=1.5]


Epoch [1953/5000]: Train loss: 0.9333, Valid loss: 0.8614


Epoch [1954/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.29it/s, loss=0.959]


Epoch [1954/5000]: Train loss: 0.8966, Valid loss: 0.8544


Epoch [1955/5000]: 100%|██████████| 10/10 [00:00<00:00, 312.63it/s, loss=0.954]


Epoch [1955/5000]: Train loss: 0.8959, Valid loss: 0.8842


Epoch [1956/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.95it/s, loss=0.755]


Epoch [1956/5000]: Train loss: 0.8986, Valid loss: 0.9114


Epoch [1957/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.55it/s, loss=0.809]


Epoch [1957/5000]: Train loss: 0.8999, Valid loss: 0.8338


Epoch [1958/5000]: 100%|██████████| 10/10 [00:00<00:00, 336.08it/s, loss=1.03]


Epoch [1958/5000]: Train loss: 0.9201, Valid loss: 0.8136


Epoch [1959/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.45it/s, loss=1.13]


Epoch [1959/5000]: Train loss: 0.9111, Valid loss: 0.8732


Epoch [1960/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.05it/s, loss=0.835]

Epoch [1960/5000]: Train loss: 0.8864, Valid loss: 0.8624



Epoch [1961/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.10it/s, loss=1.08]


Epoch [1961/5000]: Train loss: 0.9085, Valid loss: 0.8792


Epoch [1962/5000]: 100%|██████████| 10/10 [00:00<00:00, 313.49it/s, loss=0.474]


Epoch [1962/5000]: Train loss: 0.8697, Valid loss: 0.8457


Epoch [1963/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.50it/s, loss=0.633]


Epoch [1963/5000]: Train loss: 0.8760, Valid loss: 0.8341


Epoch [1964/5000]: 100%|██████████| 10/10 [00:00<00:00, 296.31it/s, loss=0.938]


Epoch [1964/5000]: Train loss: 0.8967, Valid loss: 0.9210


Epoch [1965/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.01it/s, loss=1.07]


Epoch [1965/5000]: Train loss: 0.9027, Valid loss: 0.8360


Epoch [1966/5000]: 100%|██████████| 10/10 [00:00<00:00, 239.14it/s, loss=0.658]


Epoch [1966/5000]: Train loss: 0.8778, Valid loss: 0.8855


Epoch [1967/5000]: 100%|██████████| 10/10 [00:00<00:00, 372.31it/s, loss=0.946]


Epoch [1967/5000]: Train loss: 0.9004, Valid loss: 0.8131


Epoch [1968/5000]: 100%|██████████| 10/10 [00:00<00:00, 267.28it/s, loss=0.993]


Epoch [1968/5000]: Train loss: 0.9150, Valid loss: 0.8785


Epoch [1969/5000]: 100%|██████████| 10/10 [00:00<00:00, 363.63it/s, loss=0.879]


Epoch [1969/5000]: Train loss: 0.8992, Valid loss: 0.8290


Epoch [1970/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.98it/s, loss=0.968]


Epoch [1970/5000]: Train loss: 0.8972, Valid loss: 0.8051


Epoch [1971/5000]: 100%|██████████| 10/10 [00:00<00:00, 322.78it/s, loss=1.1]


Epoch [1971/5000]: Train loss: 0.9021, Valid loss: 0.8639


Epoch [1972/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.24it/s, loss=0.684]


Epoch [1972/5000]: Train loss: 0.8791, Valid loss: 0.9343


Epoch [1973/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.75it/s, loss=1.27]


Epoch [1973/5000]: Train loss: 0.9146, Valid loss: 0.8510


Epoch [1974/5000]: 100%|██████████| 10/10 [00:00<00:00, 300.31it/s, loss=0.761]


Epoch [1974/5000]: Train loss: 0.8851, Valid loss: 0.8998


Epoch [1975/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.63it/s, loss=0.775]


Epoch [1975/5000]: Train loss: 0.8912, Valid loss: 0.8801


Epoch [1976/5000]: 100%|██████████| 10/10 [00:00<00:00, 303.02it/s, loss=0.982]


Epoch [1976/5000]: Train loss: 0.9124, Valid loss: 0.9367


Epoch [1977/5000]: 100%|██████████| 10/10 [00:00<00:00, 281.46it/s, loss=0.993]


Epoch [1977/5000]: Train loss: 0.9030, Valid loss: 0.8382


Epoch [1978/5000]: 100%|██████████| 10/10 [00:00<00:00, 295.05it/s, loss=0.892]


Epoch [1978/5000]: Train loss: 0.8930, Valid loss: 0.9043


Epoch [1979/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.05it/s, loss=0.629]


Epoch [1979/5000]: Train loss: 0.8752, Valid loss: 0.9027


Epoch [1980/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.61it/s, loss=0.803]


Epoch [1980/5000]: Train loss: 0.8941, Valid loss: 0.8744


Epoch [1981/5000]: 100%|██████████| 10/10 [00:00<00:00, 344.78it/s, loss=0.89]


Epoch [1981/5000]: Train loss: 0.9055, Valid loss: 0.8484


Epoch [1982/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.37it/s, loss=0.629]


Epoch [1982/5000]: Train loss: 0.8864, Valid loss: 0.9294


Epoch [1983/5000]: 100%|██████████| 10/10 [00:00<00:00, 271.99it/s, loss=0.765]


Epoch [1983/5000]: Train loss: 0.8839, Valid loss: 0.8385


Epoch [1984/5000]: 100%|██████████| 10/10 [00:00<00:00, 371.61it/s, loss=1.32]


Epoch [1984/5000]: Train loss: 0.9168, Valid loss: 0.8935


Epoch [1985/5000]: 100%|██████████| 10/10 [00:00<00:00, 365.47it/s, loss=0.814]


Epoch [1985/5000]: Train loss: 0.8909, Valid loss: 0.8926


Epoch [1986/5000]: 100%|██████████| 10/10 [00:00<00:00, 299.91it/s, loss=0.728]


Epoch [1986/5000]: Train loss: 0.8818, Valid loss: 0.8667


Epoch [1987/5000]: 100%|██████████| 10/10 [00:00<00:00, 330.76it/s, loss=1.29]


Epoch [1987/5000]: Train loss: 0.9174, Valid loss: 0.8785


Epoch [1988/5000]: 100%|██████████| 10/10 [00:00<00:00, 306.66it/s, loss=0.759]


Epoch [1988/5000]: Train loss: 0.8823, Valid loss: 0.8394


Epoch [1989/5000]: 100%|██████████| 10/10 [00:00<00:00, 310.93it/s, loss=0.964]


Epoch [1989/5000]: Train loss: 0.8961, Valid loss: 0.8897


Epoch [1990/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.52it/s, loss=0.876]


Epoch [1990/5000]: Train loss: 0.9028, Valid loss: 0.8908


Epoch [1991/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.65it/s, loss=0.498]


Epoch [1991/5000]: Train loss: 0.8728, Valid loss: 0.8511


Epoch [1992/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.82it/s, loss=1.07]


Epoch [1992/5000]: Train loss: 0.9037, Valid loss: 0.8487


Epoch [1993/5000]: 100%|██████████| 10/10 [00:00<00:00, 332.93it/s, loss=0.873]


Epoch [1993/5000]: Train loss: 0.8968, Valid loss: 0.8534


Epoch [1994/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.52it/s, loss=0.791]


Epoch [1994/5000]: Train loss: 0.8844, Valid loss: 0.8510


Epoch [1995/5000]: 100%|██████████| 10/10 [00:00<00:00, 351.11it/s, loss=0.983]


Epoch [1995/5000]: Train loss: 0.8962, Valid loss: 0.8996


Epoch [1996/5000]: 100%|██████████| 10/10 [00:00<00:00, 326.24it/s, loss=0.658]


Epoch [1996/5000]: Train loss: 0.8955, Valid loss: 0.8801


Epoch [1997/5000]: 100%|██████████| 10/10 [00:00<00:00, 320.65it/s, loss=0.709]


Epoch [1997/5000]: Train loss: 0.8852, Valid loss: 0.8473


Epoch [1998/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.39it/s, loss=0.768]


Epoch [1998/5000]: Train loss: 0.8890, Valid loss: 0.8446


Epoch [1999/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.94it/s, loss=0.964]


Epoch [1999/5000]: Train loss: 0.9061, Valid loss: 0.8465


Epoch [2000/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.60it/s, loss=0.908]

Epoch [2000/5000]: Train loss: 0.9322, Valid loss: 0.8673



Epoch [2001/5000]: 100%|██████████| 10/10 [00:00<00:00, 318.86it/s, loss=0.658]


Epoch [2001/5000]: Train loss: 0.8994, Valid loss: 0.8937


Epoch [2002/5000]: 100%|██████████| 10/10 [00:00<00:00, 305.40it/s, loss=1.04]


Epoch [2002/5000]: Train loss: 0.9298, Valid loss: 0.8480


Epoch [2003/5000]: 100%|██████████| 10/10 [00:00<00:00, 346.74it/s, loss=0.875]


Epoch [2003/5000]: Train loss: 0.9168, Valid loss: 0.8398


Epoch [2004/5000]: 100%|██████████| 10/10 [00:00<00:00, 304.83it/s, loss=0.8]


Epoch [2004/5000]: Train loss: 0.9028, Valid loss: 0.8006


Epoch [2005/5000]: 100%|██████████| 10/10 [00:00<00:00, 320.58it/s, loss=0.789]


Epoch [2005/5000]: Train loss: 0.8877, Valid loss: 0.9614


Epoch [2006/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.11it/s, loss=1.01]


Epoch [2006/5000]: Train loss: 0.9075, Valid loss: 0.8972


Epoch [2007/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.68it/s, loss=0.845]


Epoch [2007/5000]: Train loss: 0.8953, Valid loss: 0.9736


Epoch [2008/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.20it/s, loss=0.808]


Epoch [2008/5000]: Train loss: 0.9036, Valid loss: 0.9028


Epoch [2009/5000]: 100%|██████████| 10/10 [00:00<00:00, 343.90it/s, loss=0.973]


Epoch [2009/5000]: Train loss: 0.9058, Valid loss: 0.8871


Epoch [2010/5000]: 100%|██████████| 10/10 [00:00<00:00, 316.57it/s, loss=1.01]


Epoch [2010/5000]: Train loss: 0.9067, Valid loss: 0.8312


Epoch [2011/5000]: 100%|██████████| 10/10 [00:00<00:00, 323.67it/s, loss=0.846]


Epoch [2011/5000]: Train loss: 0.8919, Valid loss: 0.8895


Epoch [2012/5000]: 100%|██████████| 10/10 [00:00<00:00, 334.38it/s, loss=0.773]


Epoch [2012/5000]: Train loss: 0.8913, Valid loss: 0.8759


Epoch [2013/5000]: 100%|██████████| 10/10 [00:00<00:00, 327.90it/s, loss=0.991]


Epoch [2013/5000]: Train loss: 0.9027, Valid loss: 0.9164


Epoch [2014/5000]:   0%|          | 0/10 [00:00<?, ?it/s, loss=0.991]

In [20]:
# same_seed(config['seed'])

train_data, test_data = pd.read_csv(File_Path_Train).values, pd.read_csv(File_Path_Test).values
train_data, valid_data = train_valid_split(train_data, config['valid_ratio'], config['seed'])


# # Select features
x_train, x_valid, x_test, y_train, y_valid = select_feat(train_data, valid_data, test_data, config['select_all'], config['select_features'])

# valid_dataset = COVID19Dataset(x_valid, y_valid)
                                            
# # Pytorch data loader loads pytorch dataset into batches.
# # train_loader = DataLoader(train_dataset, batch_size=config['batch_size'], shuffle=True, pin_memory=True)
valid_loader = DataLoader(valid_dataset, batch_size=config['batch_size'], shuffle=True, pin_memory=True)
# # test_loader = DataLoader(test_dataset, batch_size=config['batch_size'], shuffle=False, pin_memory=True)

criterion = nn.MSELoss(reduction='mean') # Define your loss function, do not modify this.

model = My_Model(input_dim=x_train.shape[1]).to(device)
model.load_state_dict(torch.load(config['save_path']))
loss_record = []
for x, y in valid_loader:
    x, y = x.to(device), y.to(device)
    with torch.no_grad():
        pred = model(x)
        loss = criterion(pred, y)

    loss_record.append(loss.item())

mean_valid_loss = sum(loss_record)/len(loss_record)


# preds = predict(valid_loader, model, device)
# for idx,x, y in enumerate(valid_loader):
#     loss=criterion(preds[index],y)
#     loss_record.append(loss.item())

# mean_valid_loss = sum(loss_record)/len(loss_record)
print(mean_valid_loss)

0.8569455941518148


# Plot learning curves with `tensorboard` (optional)

`tensorboard` is a tool that allows you to visualize your training progress.

If this block does not display your learning curve, please wait for few minutes, and re-run this block. It might take some time to load your logging information.

In [None]:
%reload_ext tensorboard
%tensorboard --logdir=./runs/

# Testing
The predictions of your model on testing set will be stored at `pred.csv`.

In [None]:
def save_pred(preds, file):
    ''' Save predictions to specified file '''
    with open(file, 'w') as fp:
        writer = csv.writer(fp)
        writer.writerow(['id', 'tested_positive'])
        for i, p in enumerate(preds):
            writer.writerow([i, p])

model = My_Model(input_dim=x_train.shape[1]).to(device)
model.load_state_dict(torch.load(config['save_path']))
preds = predict(test_loader, model, device)
save_pred(preds, 'pred.csv')

# Download

Run this block to download the `pred.csv` automatically.

In [None]:
# from google.colab import files
# files.download('pred.csv')

# Reference
This notebook uses code written by Heng-Jui Chang @ NTUEE (https://github.com/ga642381/ML2021-Spring/blob/main/HW01/HW01.ipynb)