# Introduction

### Problem

Huang, Zhe, Wessler, Benjamin S., and Hughes, Michael C. (2023) – Detecting Heart
Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance
Learning describes the clinical problem of under-diagnosis and under-treatment
of aortic stenosis (AS), a degenerative valve condition. In clinical practice, AS is diagnosed
by manual expert view of a transthoracic echocardiography (TTE) – which uses ultrasound
to produce many images of the heart. AS can be treated effectively, but requires
identification early on. Automatic screening of AS from transthoracic echocardiography
imagery can improve the rate of detection and decrease mortality.

### Paper Explanation
The challenge with automatic detection is each TTE “consists of dozens of images or videos
(typically 27-97 in our data) that show
the heart’s complex anatomy from different acquisition angles” (Huang, Wessler, and
Hughes, 2023) where a clinical expert identifies imagery where the aortic valve is clearly
visible, then assesses the severity on a 3-level scale (no, early, significant disease).
Traditional Deep Learning approaches classify a single image with a single result, however
the clinical expert review makes a single “coherent prediction” (Huang, Wessler, and
Hughes, 2023) from knowledge gathered from the set of images. Additionally, the image
views produced by a TTE are often unlabeled in Electronic Health Records, further
complicating any Deep Learning approaches.

The paper finds previous approaches to automatic detection such as attention-based
multiple instance learning (MIL) to be insufficient based on accuracy and detection yield,
and explores a novel MIL approach to improve the detection of AS from automatic detection
that mimics the methodology of a clinical expert.

The paper outlines two novel contributions to automatic AS detection:

1) Supervised attention mechanism that identifies relevant TTE views (often unlabeled),
mimicking human filtering done by a clinical expert. This is accomplished by introducing a
new loss term, “supervised attention (SA)”, to match attention weights to the relevance
scores from a View Relevance classifier.

2) Self-supervised pretraining strategy through contrastive learning on the embedding of
the entire TTE study (i.e., a “bag of images”) – compared to traditional pretraining
strategies which focus on individual images.

### Paper Results

The paper uses *balanced accuracy* as the performance metric due to the class imbalance in the TMED-2 dataset -- making standard accuracy "less suitable" (Huang et al., 2023). The proposed method (SAMIL) was compared to general-purpose multi-instance algorithms and prior methods for AS diagnosis using deep neural networks.

SAMIL performed much better (76% balanced accuracy) than 4 other state-of-the-art attention-based MIL architectures tested vs. a range of 60-67% balanced accuracy for existing algorithms.

The chart below from the original paper outlines the balanced accuracy of SAMIL against other  approaches dedicated to AS diagnosis (*Filter then Average* and *Weighted Average by View Relevance*), and other general approaches including ABMIL, Set Transformer, and DSMIL.

![](https://drive.google.com/uc?export=view&id=1IpY4tUQAVBqCfwaPM73vmP8bVcFHnqe6)

(Huang et al., 2023)

# Scope of Reproducibility:

### Hypothese To Be Tested

1. A supervised attention mechanism will provide significant improvements over
standard MIL approaches in AS detection rates and detection accuracy, with a
smaller model size.
2. Self-supervised pretraining of “study-level” TTE artifacts provides improvements in
AS detection rates and detection accuracy over traditional “image-level” pretraining,
or no pretraining at all.

### Planned Ablations

The paper has two ablations targeting the attention strategy and the pretraining strategy.

1. Attention: The attention mechanisms within the pooling layer σ to be tested are the
baseline ABMIL model, ABMIL with gated attention, and the SAMIL model without
pretraining. The paper compares the performance of these three approaches and identifies
that SAMIL’s supervised attention model outperforms ABMIL (the baseline model that
SAMIL builds upon) by +1200 bps. The Github repo scripts includes parameters to control
the attention mechanism for ABMIL (gated_attention vs. attention), and SAMIL with and
without pretraining.
2. Pre Training: The paper introduces a novel approach of built-in study-level (i.e., bag-level)
pretraining. This ablation compares different pretraining strategies including: image-level
contrastive learning and no pretraining to the study-level pretraining approach. The paper
finds no improvements with image-level pretraining, but the study-level pretraining shows
improvements of +480 bps. The Github repo scripts include parameters to control
pretraining options of: study level, image level, and none.

# Methodology

To reproduce this paper, the following pre-requisites must be acquired:

1. Access to the TMED-2 dataset [here](https://tmed.cs.tufts.edu/tmed_v2.html)
2. Download the pretrained view classifiers, MOCO pretrained checkpoints, and training curves of SAMIL from the paper's Github repo [here](https://tufts.box.com/s/c5w8123j7h3dpls75jye1363uh8qv8us). Once downloaded, upload the entire unzipped folder to your Google Drive (see path below).

The methodology for reproduction is as follows:

1. Create and train the ABMIL model
2. Create and train the SAMIL model with no Pretraining
3. Train the SAMIL model with Image Level Pretraining
4. Train the SAMIL model with Study Level Pretraining

The model definitions and helper methods are pulled from the paper's Github repo.

In [2]:
import os
import zipfile
from google.colab import drive
drive.mount('/content/drive', force_remount=True)

# Modify the paths below if they differ from your upload locations.
# Assumes the SAMIL Github repo has been cloned and uploaded to drive in the `SAMIL` folder.
MODEL_CHECKPOINTS = '/content/drive/MyDrive/SAMIL/model_checkpoints'
ROOT_DIR = '/content/drive/MyDrive/SAMIL'
DATA_INFO_DIR = '/content/drive/MyDrive/SAMIL/data_info'
DATA_DIR = '/content/drive/MyDrive/DL4H-TMED2/'

with zipfile.ZipFile(DATA_DIR + 'labeled.zip', 'r') as zip_ref:
    zip_ref.extractall('/content/data')

with zipfile.ZipFile(DATA_DIR + 'unlabeled.zip', 'r') as zip_ref:
    zip_ref.extractall('/content/data')

LOCAL_DATA_DIR = '/content/data/'

Mounted at /content/drive


## Environment Setup

In [3]:
import math
import glob
import random

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
from torch.utils.data import Dataset, DataLoader
import torchvision
from torchvision import transforms
from tqdm import tqdm

trainingSeed = 0
batchSize = 1
numWorkers = 8
random.seed(trainingSeed)
np.random.seed(trainingSeed)
torch.manual_seed(trainingSeed)

device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

##  Data
The dataset used by the paper is the[ TMED-2 dataset](https://tmed.cs.tufts.edu/tmed_v2.html), containing transthoracic echocardiogram (TTE) imagery from routine care of patients at Tufts Medical Center.

The paper uses the (`view_and_diagnosis_labeled_set`) from TMED-2, consisting of 599 studies from 577 patients. The patients are labeled by board certified medical staff with the following values: none, early AS, or significant AS. The dataset has been partioned into different splits, each containing 360 training studies, 119 validation studies, and 120 test studies.

Code blocks below should only be executed after you have acquired the TMED-2 dataset and uploaded the `view_and_diagnosis_labeled_set` folder to Drive.

In [4]:
labeled_dir = '/content/drive/MyDrive/DL4H-TMED2/labeled'
unlabeled_dir = '/content/drive/MyDrive/DL4H-TMED2/unlabeled'

# Assumes the `view_and_diagnosis_labeled_set` is uploaded to a folder named `DL4H-TMED2`

TMED2SummaryTable = pd.read_csv(os.path.join(DATA_INFO_DIR, 'TMED2SummaryTable.csv'))
SEED_DIR = DATA_INFO_DIR + '/DataPartition/seed0/DEV479/FullyLabeledSet_studies'

train_PatientStudy_list = pd.read_csv(os.path.join(SEED_DIR, "train_studies.csv"))
val_PatientStudy_list = pd.read_csv(os.path.join(SEED_DIR, "val_studies.csv"))
test_PatientStudy_list = pd.read_csv(os.path.join(SEED_DIR, "test_studies.csv"))

train_PatientStudy_ids = train_PatientStudy_list["study"].values
val_PatientStudy_ids = val_PatientStudy_list["study"].values
test_PatientStudy_ids = test_PatientStudy_list["study"].values

#Debug
#print(train_PatientStudy_ids)
# print(val_PatientStudy_ids)
# print(test_PatientStudy_ids)

## EchoDataset

The class below is directly from the paper repo and handles transforming and loading the TMED-2 image data.

In [5]:
from PIL import Image
from torch.utils.data import Dataset

DiagnosisStr_to_Int_Mapping={
    'no_AS':0,
    'mild_AS':1,
    'mildtomod_AS':1,
    'moderate_AS':2,
    'severe_AS':2

}

class EchoDataset(Dataset):
    def __init__(self, PatientStudy_list, TMED2SummaryTable, ML_DATA_dir, sampling_strategy='first_frame', training_seed=0, transform_fn=None):

        self.PatientStudy_list = PatientStudy_list
        self.TMED2SummaryTable = TMED2SummaryTable #note: using the patient_id column in TMED2SummaryTable can uniquely identify a patient_study (there is NO same patient_study belong to different parts: diagnosis_labeled/, unlabeled/, view_and_diagnosis_labeled_set/, view_labeled AT THE SAME TIME)

        self.ML_DATA_dir = ML_DATA_dir

        self.sampling_strategy = sampling_strategy

        self.training_seed=training_seed

        self.transform_fn = transform_fn

        self.bag_of_PiatentStudy_images, self.bag_of_PatientStudy_DiagnosisLabels = self._create_bags()



    def _create_bags(self):

        bag_of_PatientStudy_images = []
        bag_of_PatientStudy_DiagnosisLabels = []

        for PatientStudy in self.PatientStudy_list:
            this_PatientStudyRecords_from_TMED2SummaryTable = self.TMED2SummaryTable[self.TMED2SummaryTable['patient_study']==PatientStudy]
            assert this_PatientStudyRecords_from_TMED2SummaryTable.shape[0]!=0, 'every PatientStudy from the studylist should be found in TMED2SummaryTable'

            this_PatientStudyRecords_from_TMED2SummaryTable_DiagnosisLabel = list(set(this_PatientStudyRecords_from_TMED2SummaryTable.diagnosis_label.values))
            assert len(this_PatientStudyRecords_from_TMED2SummaryTable_DiagnosisLabel)==1, 'every PatientStudy should only have one diagnosis label'

            this_PatientStudy_DiagnosisLabel = this_PatientStudyRecords_from_TMED2SummaryTable_DiagnosisLabel[0]
            this_PatientStudy_DiagnosisLabel = DiagnosisStr_to_Int_Mapping[this_PatientStudy_DiagnosisLabel]

            this_PatientStudy_Id_ImagesPattern = PatientStudy + "_*.png"
            this_PatientStudy_Id_LabeledImages: list[str] = glob.glob(pathname=this_PatientStudy_Id_ImagesPattern, root_dir=os.path.join(self.ML_DATA_dir, "labeled"))
            this_PatientStudy_Id_UnlabeledImages: list[str] = glob.glob(pathname=this_PatientStudy_Id_ImagesPattern, root_dir=os.path.join(self.ML_DATA_dir, "unlabeled"))

            # From paper repo, sort to ensure order of images are consistent each run.
            this_PatientStudy_Id_LabeledImages.sort()
            this_PatientStudy_Id_UnlabeledImages.sort()

            this_PatientStudyImages = []

            for ImagePath in this_PatientStudy_Id_LabeledImages:
                this_PatientStudyImages.append(
                    np.array(Image.open(self.ML_DATA_dir + '/labeled/' + ImagePath).convert(mode="RGB"))
                )

            for ImagePath in this_PatientStudy_Id_UnlabeledImages:
                this_PatientStudyImages.append(
                    np.array(Image.open(self.ML_DATA_dir + '/unlabeled/' + ImagePath).convert(mode="RGB"))
                )

            bag_of_PatientStudy_images.append(np.array(this_PatientStudyImages))
            bag_of_PatientStudy_DiagnosisLabels.append(this_PatientStudy_DiagnosisLabel)

        return bag_of_PatientStudy_images, bag_of_PatientStudy_DiagnosisLabels


    def __len__(self):
        return len(self.bag_of_PiatentStudy_images)


    def __getitem__(self, index):

        bag_image = self.bag_of_PiatentStudy_images[index]

        if self.transform_fn is not None:
            bag_image = torch.stack([self.transform_fn(Image.fromarray(image)) for image in bag_image])


        DiagnosisLabel = self.bag_of_PatientStudy_DiagnosisLabels[index]

        return bag_image, DiagnosisLabel



### Transformations

In [6]:
import PIL
import PIL.ImageOps
import PIL.ImageEnhance
import PIL.ImageDraw
from PIL import Image

PARAMETER_MAX = 10


def AutoContrast(img, **kwarg):
    return PIL.ImageOps.autocontrast(img)


def Brightness(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    return PIL.ImageEnhance.Brightness(img).enhance(v)


def Color(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    return PIL.ImageEnhance.Color(img).enhance(v)


def Contrast(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    return PIL.ImageEnhance.Contrast(img).enhance(v)


def Cutout(img, v, max_v, bias=0):
    if v == 0:
        return img
    v = _float_parameter(v, max_v) + bias
    v = int(v * min(img.size))
    return CutoutAbs(img, v)


def CutoutAbs(img, v, **kwarg):
    w, h = img.size
    x0 = np.random.uniform(0, w)
    y0 = np.random.uniform(0, h)
    x0 = int(max(0, x0 - v / 2.))
    y0 = int(max(0, y0 - v / 2.))
    x1 = int(min(w, x0 + v))
    y1 = int(min(h, y0 + v))
    xy = (x0, y0, x1, y1)
    # gray
    color = (127, 127, 127)
    img = img.copy()
    PIL.ImageDraw.Draw(img).rectangle(xy, color)
    return img


def Equalize(img, **kwarg):
    return PIL.ImageOps.equalize(img)


def Identity(img, **kwarg):
    return img


def Invert(img, **kwarg):
    return PIL.ImageOps.invert(img)


def Posterize(img, v, max_v, bias=0):
    v = _int_parameter(v, max_v) + bias
    return PIL.ImageOps.posterize(img, v)


def Rotate(img, v, max_v, bias=0):
    v = _int_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    return img.rotate(v)


def Sharpness(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    return PIL.ImageEnhance.Sharpness(img).enhance(v)


def ShearX(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    return img.transform(img.size, PIL.Image.AFFINE, (1, v, 0, 0, 1, 0))


def ShearY(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    return img.transform(img.size, PIL.Image.AFFINE, (1, 0, 0, v, 1, 0))


def Solarize(img, v, max_v, bias=0):
    v = _int_parameter(v, max_v) + bias
    return PIL.ImageOps.solarize(img, 256 - v)


def SolarizeAdd(img, v, max_v, bias=0, threshold=128):
    v = _int_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    img_np = np.array(img).astype(np.int)
    img_np = img_np + v
    img_np = np.clip(img_np, 0, 255)
    img_np = img_np.astype(np.uint8)
    img = Image.fromarray(img_np)
    return PIL.ImageOps.solarize(img, threshold)


def TranslateX(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    v = int(v * img.size[0])
    return img.transform(img.size, PIL.Image.AFFINE, (1, 0, v, 0, 1, 0))


def TranslateY(img, v, max_v, bias=0):
    v = _float_parameter(v, max_v) + bias
    if random.random() < 0.5:
        v = -v
    v = int(v * img.size[1])
    return img.transform(img.size, PIL.Image.AFFINE, (1, 0, 0, 0, 1, v))


def _float_parameter(v, max_v):
    return float(v) * max_v / PARAMETER_MAX


def _int_parameter(v, max_v):
    return int(v * max_v / PARAMETER_MAX)


def fixmatch_augment_pool():
    # FixMatch paper
    augs = [(AutoContrast, None, None),
            (Brightness, 0.9, 0.05),
            (Color, 0.9, 0.05),
            (Contrast, 0.9, 0.05),
            (Equalize, None, None),
            (Identity, None, None),
            (Posterize, 4, 4),
            (Rotate, 30, 0),
            (Sharpness, 0.9, 0.05),
            (ShearX, 0.3, 0),
            (ShearY, 0.3, 0),
            (Solarize, 256, 0),
            (TranslateX, 0.3, 0),
            (TranslateY, 0.3, 0)]
    return augs

class RandAugmentMC(object):
    def __init__(self, n, m):
        assert n >= 1
        assert 1 <= m <= 10
        self.n = n
        self.m = m
        self.augment_pool = fixmatch_augment_pool()

    def __call__(self, img):
        ops = random.choices(self.augment_pool, k=self.n)
        for op, max_v, bias in ops:
            v = np.random.randint(1, self.m)
            if random.random() < 0.5:
                img = op(img, v=v, max_v=max_v, bias=bias)
        img = CutoutAbs(img, int(32*0.5))
        return img


In [7]:
transform_eval = transforms.Compose([
    transforms.ToTensor(),
])

transform_labeledtrain = transforms.Compose([
    transforms.RandomHorizontalFlip(),
    transforms.RandomCrop(size=112,
                          padding=int(112*0.125),
                          padding_mode='reflect'),
    RandAugmentMC(n=2, m=10),
    transforms.ToTensor(),
])

## Create Dataset

In [8]:
train_dataset = EchoDataset(train_PatientStudy_ids, TMED2SummaryTable, LOCAL_DATA_DIR, sampling_strategy='first_frame', training_seed=trainingSeed, transform_fn=transform_labeledtrain)

trainmemory_dataset = EchoDataset(train_PatientStudy_ids, TMED2SummaryTable, LOCAL_DATA_DIR, sampling_strategy='first_frame', training_seed=trainingSeed, transform_fn=transform_eval)

val_dataset = EchoDataset(val_PatientStudy_ids, TMED2SummaryTable, LOCAL_DATA_DIR, sampling_strategy='first_frame', training_seed=trainingSeed, transform_fn=transform_eval)

test_dataset = EchoDataset(test_PatientStudy_ids, TMED2SummaryTable, LOCAL_DATA_DIR, sampling_strategy='first_frame', training_seed=trainingSeed, transform_fn=transform_eval)

In [12]:
print("train: {}, trainmemory: {}, val: {}, test: {}".format(len(train_dataset), len(trainmemory_dataset), len(val_dataset), len(test_dataset)))


train: 360, trainmemory: 360, val: 119, test: 120


In [9]:
train_loader = DataLoader(train_dataset, batch_size=batchSize, shuffle=True, num_workers=numWorkers)
trainmemory_loader = DataLoader(trainmemory_dataset, batch_size=batchSize, shuffle=False, num_workers=numWorkers)
val_loader = DataLoader(val_dataset, batch_size=batchSize, shuffle=False, num_workers=numWorkers)
test_loader = DataLoader(test_dataset, batch_size=batchSize, shuffle=False, num_workers=numWorkers)

## Dataset Analysis

The method below provides a high level analysis of the TMED-2 dataset for the seed0 split.

We see number of samples and classes are as expected across the Train, Validation, and Test sets based on the specs from the TMED-2 website.

Additionally, we see an imbalancedclass distribution. Namely, `sev_AS` which maps to a diagnosis of "Severe AS" dominates the dataset, while `no_AS` is poorly represented and could lead towards a biased prediction of `sev_AS`.

In [10]:
def dataset_statistics(loader, name):
    num_batches = len(loader)
    num_samples = len(loader.dataset)
    label_mapping = {0: "no_AS", 1: "mod_AS", 2: "sev_AS"}

    print(f"{name} DataLoader:")
    print(f"  Total number of batches: {num_batches}")
    print(f"  Total number of samples: {num_samples}")

    labels = []
    for _, batch_labels in loader:
        labels.extend(batch_labels.tolist())
    num_classes = len(set(labels))
    print(f"  Number of classes: {num_classes}")

    class_distribution = {label_mapping[label]: labels.count(label) for label in set(labels) if label in label_mapping}
    print(f"  Class distribution: {class_distribution}")

dataset_statistics(train_loader, "Train")
dataset_statistics(trainmemory_loader, "Train Memory")
dataset_statistics(val_loader, "Validation")
dataset_statistics(test_loader, "Test")

Train DataLoader:
  Total number of batches: 360
  Total number of samples: 360


  self.pid = os.fork()


  Number of classes: 3
  Class distribution: {'no_AS': 76, 'mod_AS': 103, 'sev_AS': 181}
Train Memory DataLoader:
  Total number of batches: 360
  Total number of samples: 360
  Number of classes: 3
  Class distribution: {'no_AS': 76, 'mod_AS': 103, 'sev_AS': 181}
Validation DataLoader:
  Total number of batches: 119
  Total number of samples: 119
  Number of classes: 3
  Class distribution: {'no_AS': 25, 'mod_AS': 34, 'sev_AS': 60}
Test DataLoader:
  Total number of batches: 120
  Total number of samples: 120
  Number of classes: 3
  Class distribution: {'no_AS': 26, 'mod_AS': 34, 'sev_AS': 60}


##   Model

The paper evaluates multiple models in addition to the SAMIL model they are proposing. These include ABMIL, Set Transformer, and DSMIL.

### SAMIL

#### View Classifier

The following class for the View Classifier is directly ported from the paper's Github repo.

In [11]:
import logging

import torch
import torch.nn as nn
import torch.nn.functional as F
import sys

logging.basicConfig(format='%(asctime)s | %(levelname)s : %(message)s',
                     level=logging.INFO, stream=sys.stdout)
logger = logging.getLogger(__name__)
logger.setLevel(logging.DEBUG)



def mish(x):
    """Mish: A Self Regularized Non-Monotonic Neural Activation Function (https://arxiv.org/abs/1908.08681)"""
    return x * torch.tanh(F.softplus(x))


class PSBatchNorm2d(nn.BatchNorm2d):
    """How Does BN Increase Collapsed Neural Network Filters? (https://arxiv.org/abs/2001.11216)"""

    def __init__(self, num_features, alpha=0.1, eps=1e-05, momentum=0.001, affine=True, track_running_stats=True):
        super().__init__(num_features, eps, momentum, affine, track_running_stats)
        self.alpha = alpha

    def forward(self, x):
        return super().forward(x) + self.alpha


class BasicBlock(nn.Module):
    def __init__(self, in_planes, out_planes, stride, drop_rate=0.0, activate_before_residual=False):
        super(BasicBlock, self).__init__()
        self.bn1 = nn.BatchNorm2d(in_planes, momentum=0.001)
        self.relu1 = nn.LeakyReLU(negative_slope=0.1, inplace=True)
        self.conv1 = nn.Conv2d(in_planes, out_planes, kernel_size=3, stride=stride,
                               padding=1, bias=False)
        self.bn2 = nn.BatchNorm2d(out_planes, momentum=0.001)
        self.relu2 = nn.LeakyReLU(negative_slope=0.1, inplace=True)
        self.conv2 = nn.Conv2d(out_planes, out_planes, kernel_size=3, stride=1,
                               padding=1, bias=False)
        self.drop_rate = drop_rate
        self.equalInOut = (in_planes == out_planes)
        self.convShortcut = (not self.equalInOut) and nn.Conv2d(in_planes, out_planes, kernel_size=1, stride=stride,
                                                                padding=0, bias=False) or None
        self.activate_before_residual = activate_before_residual

    def forward(self, x):
        if not self.equalInOut and self.activate_before_residual == True:
            x = self.relu1(self.bn1(x))
        else:
            out = self.relu1(self.bn1(x))
        out = self.relu2(self.bn2(self.conv1(out if self.equalInOut else x)))
        if self.drop_rate > 0:
            out = F.dropout(out, p=self.drop_rate, training=self.training)
        out = self.conv2(out)
        return torch.add(x if self.equalInOut else self.convShortcut(x), out)


class NetworkBlock(nn.Module):
    def __init__(self, nb_layers, in_planes, out_planes, block, stride, drop_rate=0.0, activate_before_residual=False):
        super(NetworkBlock, self).__init__()
        self.layer = self._make_layer(
            block, in_planes, out_planes, nb_layers, stride, drop_rate, activate_before_residual)

    def _make_layer(self, block, in_planes, out_planes, nb_layers, stride, drop_rate, activate_before_residual):
        layers = []
        for i in range(int(nb_layers)):
            layers.append(block(i == 0 and in_planes or out_planes, out_planes,
                                i == 0 and stride or 1, drop_rate, activate_before_residual))
        return nn.Sequential(*layers)

    def forward(self, x):
        return self.layer(x)

# args.model_depth = 28
# args.model_width = 2

class WideResNet(nn.Module):
    def __init__(self, num_classes, depth=28, widen_factor=2, drop_rate=0.0):
        super(WideResNet, self).__init__()
        channels = [16, 16*widen_factor, 32*widen_factor, 64*widen_factor, 128*widen_factor]
        assert((depth - 4) % 6 == 0)
        n = (depth - 4) / 6 #equivalent to 'repeat' in tf repo
        block = BasicBlock
        # 1st conv before any network block
        self.conv1 = nn.Conv2d(3, channels[0], kernel_size=3, stride=1,
                               padding=1, bias=False)
        # 1st block
        self.block1 = NetworkBlock(
            n, channels[0], channels[1], block, 1, drop_rate, activate_before_residual=True)
        # 2nd block
        self.block2 = NetworkBlock(
            n, channels[1], channels[2], block, 2, drop_rate)
        # 3rd block
        self.block3 = NetworkBlock(
            n, channels[2], channels[3], block, 2, drop_rate)

        # 4th block (hz added)
        self.block4 = NetworkBlock(
            n, channels[3], channels[4], block, 2, drop_rate)

        # global average pooling and classifier
        self.bn1 = nn.BatchNorm2d(channels[4], momentum=0.001)
        self.relu = nn.LeakyReLU(negative_slope=0.1, inplace=True)
        self.fc = nn.Linear(channels[4], num_classes)
        self.channels = channels[4]

        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                nn.init.kaiming_normal_(m.weight,
                                        mode='fan_out',
                                        nonlinearity='leaky_relu')
            elif isinstance(m, nn.BatchNorm2d):
                nn.init.constant_(m.weight, 1.0)
                nn.init.constant_(m.bias, 0.0)
            elif isinstance(m, nn.Linear):
                nn.init.xavier_normal_(m.weight)
                nn.init.constant_(m.bias, 0.0)

    def forward(self, x):
        out = self.conv1(x)
        out = self.block1(out)
        out = self.block2(out)
        out = self.block3(out)
        out = self.block4(out)

        out = self.relu(self.bn1(out))
        out = F.adaptive_avg_pool2d(out, 1)
        out = out.view(-1, self.channels)
        return self.fc(out)


def build_wideresnet(depth, widen_factor, dropout, num_classes):
    logger.info(f"Model: WideResNet {depth}x{widen_factor}")
    return WideResNet(depth=depth,
                      widen_factor=widen_factor,
                      drop_rate=dropout,
                      num_classes=num_classes)


#### SAMIL Model

The following class represents the SAMIL model. This code is unchanged from the paper's Github repo.

In [12]:
import torch
import torch.nn as nn
import torch.nn.functional as F

class SAMIL(nn.Module):
    def __init__(self, num_classes=3):
        super(SAMIL, self).__init__()
        self.L = 500
        self.B = 250
        self.D = 128
        self.K = 1
        self.num_classes = num_classes

        self.feature_extractor_part1 = nn.Sequential(
#             nn.Conv2d(1, 20, kernel_size=5),
            nn.Conv2d(3, 20, kernel_size=5),
            nn.ReLU(),
            nn.MaxPool2d(2, stride=2),
            nn.Conv2d(20, 50, kernel_size=5),
            nn.ReLU(),
            nn.MaxPool2d(2, stride=2),
             #hz added
            nn.Conv2d(50, 100, kernel_size=5),
            nn.ReLU(),
            nn.MaxPool2d(2, stride=2),
            nn.Conv2d(100, 200, kernel_size=3),
            nn.ReLU(),
            nn.MaxPool2d(2, stride=2),
        )

        self.feature_extractor_part2 = nn.Sequential(
#             nn.Linear(50 * 4 * 4, self.L),
            nn.Linear(200 * 4 * 4, self.L),
            nn.ReLU(),
        )

        self.feature_extractor_part3 = nn.Sequential(

            nn.Linear(self.L, self.B),
            nn.ReLU(),
            nn.Linear(self.B, self.L),
            nn.ReLU(),
        )

        self.attention_V = nn.Sequential(
            nn.Linear(self.L, self.D),
            nn.Tanh(),
            nn.Linear(self.D, self.K)
        )

        self.attention_U = nn.Sequential(
            nn.Linear(self.L, self.D),
            nn.Tanh(),
            nn.Linear(self.D, self.K)
        )

#         self.attention_weights = nn.Linear(self.D, self.K)

        self.classifier = nn.Sequential(
#             nn.Linear(self.L*self.K, 1),
            nn.Linear(self.L*self.K, self.num_classes),
#             nn.Sigmoid()
        )

    def forward(self, x):

#         print('Inside forward: input x shape: {}'.format(x.shape))
        x = x.squeeze(0)
#         print('Inside forward: after squeeze x shape: {}'.format(x.shape))

        H = self.feature_extractor_part1(x)
#         print('Inside forward: after feature_extractor_part1 H shape: {}'.format(H.shape))


#         H = H.view(-1, 50 * 4 * 4)
        H = H.view(-1, 200 * 4 * 4)
#         print('Inside forward: after view H shape: {}'.format(H.shape))

        H = self.feature_extractor_part2(H)  # NxL
#         print('Inside forward: after feature_extractor_part2 H shape: {}'.format(H.shape))

        A_V = self.attention_V(H)  # NxK
#         print('Inside forward: A_V is {}, shape: {}'.format(A_V, A_V.shape))

        A_V = torch.transpose(A_V, 1, 0)  # KxN
#         print('Inside forward: A_V is {}, shape: {}'.format(A_V, A_V.shape))

        A_V = F.softmax(A_V, dim=1)  # softmax over N
#         print('Inside forward: A_V (View) is {}, shape: {}'.format(A_V, A_V.shape))


        H = self.feature_extractor_part3(H)

        A_U = self.attention_U(H)  # NxK
#         print('Inside forward: A_U is {}, shape: {}'.format(A_U, A_U.shape))

        A_U = torch.transpose(A_U, 1, 0)  # KxN
#         print('Inside forward: A_U is {}, shape: {}'.format(A_U, A_U.shape))

        A_U = F.softmax(A_U, dim=1)  # softmax over N
#         print('Inside forward: A_U (Diagnosis) is {}, shape: {}'.format(A_U, A_U.shape))

#         A = A_V * A_U
#         print('Inside forward: final A is {}, shape: {}'.format(A, A.shape))
        A = torch.exp(torch.log(A_V) + torch.log(A_U)) #numerically more stable?

        A = A/torch.sum(A)
#         A = F.softmax(A, dim=1)
#         print('Inside forward: final A is {}, shape: {}'.format(A, A.shape))
#         A = self.attention_weights(A_V * A_U) # element wise multiplication # NxK
#         print('Inside forward: A is {}, shape: {}'.format(A, A.shape))

#         A = torch.transpose(A, 1, 0)  # KxN
# #         print('Inside forward: A is {}, shape: {}'.format(A, A.shape))

#         A = F.softmax(A, dim=1)  # softmax over N
# #         print('Inside forward: A is {}, shape: {}'.format(A, A.shape))

        M = torch.mm(A, H)  # KxL #M can be regarded as final representation of this bag
#         print('Inside forward: M is {}, shape: {}'.format(M, M.shape))

        out = self.classifier(M)


        return out, A_V #only view regularize one branch of the attention weights



#### SAMIL Helpers
The following methods are from the paper's Github repo. Specifically the `src/SAMIL/main.py` file.

In [13]:
import pandas as pd
import numpy as np
import torch
import torch.nn.functional as F
import torch.optim as optim
from torch.optim.lr_scheduler import LambdaLR
from torch.utils.data import DataLoader
from torchvision import transforms

from torch.utils.tensorboard import SummaryWriter

logger = logging.getLogger(__name__)

def str2bool(s):
    if s == 'True':
        return True
    elif s == 'False':
        return False
    else:
        raise NameError('Bad string')

def save_checkpoint(state, checkpoint_dir, filename='last_checkpoint.pth.tar'):
    '''last_checkpoint.pth.tar or xxx_model_best.pth.tar'''

    filepath = os.path.join(checkpoint_dir, filename)
    torch.save(state, filepath)

def set_seed(seed):
    random.seed(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)

def get_cosine_schedule_with_warmup(optimizer,
                                    lr_warmup_epochs,
                                    lr_cycle_epochs, #total train epochs
                                    num_cycles=7./16.,
                                    last_epoch=-1):
    def _lr_lambda(current_epoch):
        if current_epoch < lr_warmup_epochs:
            return float(current_epoch) / float(max(1, lr_warmup_epochs))
#         no_progress = float(current_epoch - lr_warmup_epochs) / \
#             float(max(1, float(lr_cycle_epochs) - lr_warmup_epochs))

        #see if using restart
        ###############################################################
        if current_epoch%lr_cycle_epochs==0:
            current_cycle_epoch=lr_cycle_epochs
        else:
            current_cycle_epoch = current_epoch%lr_cycle_epochs

        no_progress = float(current_cycle_epoch - lr_warmup_epochs) / \
            float(max(1, float(lr_cycle_epochs) - lr_warmup_epochs))
        #################################################################

        return max(0., math.cos(math.pi * num_cycles * no_progress))

    return LambdaLR(optimizer, _lr_lambda, last_epoch)

def get_fixed_lr(optimizer,
                lr_warmup_epochs,
                lr_cycle_epochs, #total train iterations
                num_cycles=7./16.,
                last_epoch=-1):
    def _lr_lambda(current_epoch):

        return 1.0

    return LambdaLR(optimizer, _lr_lambda, last_epoch)

def create_view_model(args):

    view_model = build_wideresnet(depth=28,
                                        widen_factor=2,
                                        dropout=0.0,
                                        num_classes=3)

    logger.info("Total params for View Model: {:.2f}M".format(
        sum(p.numel() for p in view_model.parameters())/1e6))


    #load the saved checkpoint
    if args['data_seed']==0:
        args['view_checkpoint_path'] = os.path.join(args['checkpoint_dir'], 'view_classifier', 'seed0_model_best.pth.tar')
    elif args['data_seed']==1:
        args['view_checkpoint_path'] = os.path.join(args['checkpoint_dir'], 'view_classifier', 'seed1_model_best.pth.tar')
    elif args['data_seed']==2:
        args['view_checkpoint_path'] = os.path.join(args['checkpoint_dir'], 'view_classifier', 'seed2_model_best.pth.tar')
    else:
        raise NameError('?')


    view_checkpoint = torch.load(args['view_checkpoint_path'], map_location=device)

    view_model.load_state_dict(view_checkpoint['ema_state_dict'])

    view_model.eval()

    return view_model

def create_model(args):
    model = SAMIL()

    if args['MIL_checkpoint_path'] !='':
        print('!!!!!!!!!!!!!!!!!!!!!initializing from pretrained checkpoint!!!!!!!!!!!!!!!!!!!!!')
        pretrained_dict = torch.load(args['MIL_checkpoint_path'], map_location=device)

        #https://discuss.pytorch.org/t/dataparallel-changes-parameter-names-issue-with-load-state-dict/60211
        #rename tensor in the pretrained dict
        from collections import OrderedDict
        new_state_dict = OrderedDict()
        for k, v in pretrained_dict.items():
#                         print(k)
            if 'encoder_q' in k:
#                             print('!extract: {}'.format(k))
                name = '.'.join(k.split('.')[1:])
#                             print('new_name: {}'.format(name))
                new_state_dict[name] = v

        model_dict = model.state_dict()

        new_state_dict = {k: v for k, v in new_state_dict.items() if k in model_dict}
        model_dict.update(new_state_dict)

        # 3. load the new state dict
        model.load_state_dict(model_dict)



    logger.info("Total params: {:.2f}M".format(
        sum(p.numel() for p in model.parameters() if p.requires_grad)/1e6))

    return model

## SAMIL Training

The method below sets up various arguments around pretraining. The paper explores three methods of training: No Pretraining, pre training the Feature Extrator (to learn instance-level representations), and pre training the study-level representations of all *K* images in a routine echocardiogram.

In [14]:
logging.basicConfig(
    format="%(asctime)s - %(levelname)s - %(name)s -   %(message)s",
    datefmt="%m/%d/%Y %H:%M:%S",
    level=logging.INFO
)

def setup_samil_train(args):

    if args['training_seed'] is not None:
        print('setting training seed{}'.format(args['training_seed']), flush=True)
        set_seed(args['training_seed'])


    if args['Pretrained'] == 'Whole':

        if args['data_seed']==0:
            args['MIL_checkpoint_path'] = os.path.join(args['checkpoint_dir'],'MOCO_Pretraining_StudyLevel', 'seed0_checkpoint.pt')
        elif args['data_seed']==1:
            args['MIL_checkpoint_path'] = os.path.join(args['checkpoint_dir'],'MOCO_Pretraining_StudyLevel', 'seed1_checkpoint.pt')
        elif args['data_seed']==2:
            args['MIL_checkpoint_path'] = os.path.join(args['checkpoint_dir'],'MOCO_Pretraining_StudyLevel', 'seed2_checkpoint.pt')
        else:
            raise NameError('NOT VALID PRETRAINED MODEL')

    elif args['Pretrained'] == 'FeatureExtractor1':

        if args['data_seed']==0:
            args['MIL_checkpoint_path']=os.path.join(args['checkpoint_dir'], 'MOCO_Pretraining_ImageLevel', 'seed0_checkpoint.pt')
        elif args['data_seed']==1:
            args['MIL_checkpoint_path']=os.path.join(args['checkpoint_dir'], 'MOCO_Pretraining_ImageLevel', 'seed1_checkpoint.pt')
        elif args['data_seed']==2:
            args['MIL_checkpoint_path']=os.path.join(args['checkpoint_dir'], 'MOCO_Pretraining_ImageLevel', 'seed2_checkpoint.pt')
        else:
            raise NameError('NOT VALID PRETRAINED MODEL')


    elif args['Pretrained'] == 'NoPretrain':
        args['MIL_checkpoint_path']=''

    else:
        raise NameError('invalid pretrain option')

    if args['use_class_weights'] == 'True':
        print('!!!!!!!!Using pre-calculated class weights!!!!!!!!')

        #indeed, every split should have the same class weight for diagnosis by our dataset construction
        if args['data_seed'] == 0 and args['development_size'] == 'DEV479':
            args['class_weights'] = '0.463,0.342,0.195'
        elif args['data_seed'] == 1 and args['development_size'] == 'DEV479':
            args['class_weights'] = '0.463,0.342,0.195'
        elif args['data_seed'] == 2 and args['development_size'] == 'DEV479':
            args['class_weights'] = '0.463,0.342,0.195'
        else:
            raise NameError('not valid class weights setting')

    else:
        args['class_weights'] = '1.0,1.0,1.0'
        print('?????????Not using pre-calculated class weights?????????')

    experiment_name = "{}".format(args['Pretrained'])

    args['experiment_dir'] = os.path.join(args['train_dir'], experiment_name)

    if args['resume'] != 'None':
        args['resume_checkpoint_fullpath'] = os.path.join(args['experiment_dir'], args['resume'])
        print('args.resume_checkpoint_fullpath: {}'.format(args['resume_checkpoint_fullpath']))
    else:
        args['resume_checkpoint_fullpath'] = None

    os.makedirs(args['experiment_dir'], exist_ok=True)
    args['writer'] = SummaryWriter(args['experiment_dir'])

    brief_summary = {}
    brief_summary['val_progression_view'] = {}

    brief_summary['dataset_name'] = args['dataset_name']
    brief_summary['algorithm'] = 'Echo_MIL'
    brief_summary['hyperparameters'] = {
        'train_epoch': args['train_epoch'],
        'optimizer': args['optimizer_type'],
        'lr': args['lr'],
        'wd': args['wd'],
        'T':args['T'],
        'lambda_ViewRegularization':args['lambda_ViewRegularization']
    }

    return args, brief_summary

#### Train One Epoch and Early Stop Logic

The code block below contains the methods to train a single epoch and the early stop logic.

In [15]:
from copy import deepcopy
from sklearn.metrics import balanced_accuracy_score

import torch

class ModelEMA(object):
    def __init__(self, args, model, decay):
        self.ema = deepcopy(model)
        self.ema.to(args['device'])
        self.ema.eval()
        self.decay = decay
        self.ema_has_module = hasattr(self.ema, 'module')
        # Fix EMA. https://github.com/valencebond/FixMatch_pytorch thank you!
        self.param_keys = [k for k, _ in self.ema.named_parameters()]
        self.buffer_keys = [k for k, _ in self.ema.named_buffers()]

        print('self.param_keys: {}'.format(self.param_keys))
        print('self.buffer_keys: {}'.format(self.buffer_keys))

        for p in self.ema.parameters():
#             print('Inside ModelEMA, p dtype is {}'.format(p.dtype))
            p.requires_grad_(False)


    def update(self, model):
        needs_module = hasattr(model, 'module') and not self.ema_has_module
        with torch.no_grad():
            msd = model.state_dict()
            esd = self.ema.state_dict()
            for k in self.param_keys:
                if needs_module:
                    j = 'module.' + k
                else:
                    j = k
                model_v = msd[j].detach()
                ema_v = esd[k]
                esd[k].copy_(ema_v * self.decay + (1. - self.decay) * model_v)

            for k in self.buffer_keys:
                if needs_module:
                    j = 'module.' + k
                else:
                    j = k
                esd[k].copy_(msd[j])

import time
from tqdm import tqdm
import torch.nn.functional as F

import logging
from sklearn.metrics import confusion_matrix as sklearn_cm
import numpy as np
import os
import pickle

import torch
import torch.nn as nn

import numpy as np

from sklearn.metrics import confusion_matrix as sklearn_cm


class EarlyStopping:
    """Early stops the training if validation acc doesn't improve after a given patience."""

    def __init__(self, patience=300, initial_count=0, delta=0):

        """
        Args:
            patience (int): How long to wait after last time validation loss improved.
                            Default: 20
            delta (float): Minimum change in the monitored quantity to qualify as an improvement.
                            Default: 0

        """

        self.patience = patience
        self.counter = initial_count
        self.best_score = None
        self.early_stop = False
        self.delta = delta


    def __call__(self, val_acc):

        score = val_acc

        if self.best_score is None:
            self.best_score = score

        elif score <= self.best_score + self.delta:
            self.counter += 1
            if self.counter >= self.patience:
                self.early_stop = True

        else:
            self.best_score = score
            self.counter = 0

        print('!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!counter: {}, score: {}, best_score: {}'.format(self.counter, score, self.best_score))

        return self.counter


def train_one_epoch(args, weights, train_loader, model, ema_model, view_model, optimizer, scheduler, epoch):

    args['writer'].add_scalar('train/lr', scheduler.get_last_lr()[0], epoch)

    model.train()

    TotalLoss_this_epoch, LabeledCELoss_this_epoch, ViewRegularizationLoss_this_epoch, scaled_ViewRegularizationLoss_this_epoch = [], [], [], []

    train_iter = iter(train_loader)
    n_steps_per_epoch = 360 #360 train studies, batch size 1
    p_bar = tqdm(range(n_steps_per_epoch), disable=False)

#     for batch_idx, (data, bag_label, view_relevance) in enumerate(tqdm(train_loader)):
    for batch_idx in range(n_steps_per_epoch):

        try:
            data, bag_label = next(train_iter)
        except:
            train_iter = iter(train_loader)
            data, bag_label = next(train_iter)

#         print('batch_idx: {}'.format(batch_idx))

#         print('type(data): {}, data.size: {}, require grad: {}'.format(type(data), data.size(), data.requires_grad))
#         print('type(bag_label): {}, bag_label: {}'.format(type(bag_label), bag_label))
#         print('type(view_relevance): {}, view_relevance: {}'.format(type(view_relevance), view_relevance))
        data, bag_label = data.to(args['device']), bag_label.to(args['device'])


        outputs, attentions = model(data)

        log_attentions = torch.log(attentions)


        with torch.no_grad():
            view_predictions = view_model(data.squeeze(0))
            softmax_view_predictions = F.softmax(view_predictions, dim=1)
            predicted_relevance = softmax_view_predictions[:, :2]
            predicted_relevance = torch.sum(predicted_relevance, dim=1)
            predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
            predicted_relative_relevance = predicted_relative_relevance.unsqueeze(0)


        #element shape in F.cross_entropy: prediction torch.size([batch_size, num_classes]) and true label torch.size([batch_size])
        if args['use_class_weights'] == 'True':
            LabeledCELoss = F.cross_entropy(outputs, bag_label, weights, reduction='mean')
        else:
            LabeledCELoss = F.cross_entropy(outputs, bag_label, reduction='mean')


#         parser.add_argument('--ViewRegularization_warmup_pos', default=0.4, type=float, help='position at which view regularization loss warmup ends') #following MixMatch and FixMatch repo

# parser.add_argument('--ViewRegularization_warmup_schedule_type', default='NoWarmup', choices=['NoWarmup', 'Linear', 'Sigmoid', ], type=str)

        #ViewRegularization warmup schedule choice
        if args['ViewRegularization_warmup_schedule_type'] == 'NoWarmup':
            current_warmup = 1
        elif args['ViewRegularization_warmup_schedule_type'] == 'Linear':
            current_warmup = np.clip(epoch/(float(args['ViewRegularization_warmup_pos']) * args['train_epoch']), 0, 1)
        elif args['ViewRegularization_warmup_schedule_type'] == 'Sigmoid':
            current_warmup = math.exp(-5 * (1 - min(epoch/(float(args['ViewRegularization_warmup_pos']) * args['train_epoch']), 1))**2)
        else:
            raise NameError('Not supported ViewRegularization warmup schedule')



        ViewRegularizationLoss = F.kl_div(input=log_attentions, target=predicted_relative_relevance, log_target=False, reduction='batchmean')

        # backward pass
        total_loss = LabeledCELoss + args['lambda_ViewRegularization'] * ViewRegularizationLoss * current_warmup

        total_loss.backward()



        TotalLoss_this_epoch.append(total_loss.item())
        LabeledCELoss_this_epoch.append(LabeledCELoss.item())
        ViewRegularizationLoss_this_epoch.append(ViewRegularizationLoss.item())
        scaled_ViewRegularizationLoss_this_epoch.append(args['lambda_ViewRegularization'] * ViewRegularizationLoss.item() * current_warmup)

        # step
        optimizer.step()

        #update ema model
        ema_model.update(model)

        model.zero_grad()

    scheduler.step()

    return TotalLoss_this_epoch, LabeledCELoss_this_epoch, ViewRegularizationLoss_this_epoch, scaled_ViewRegularizationLoss_this_epoch



#regular eval_model
def eval_model(args, data_loader, raw_model, ema_model, epoch):

    raw_model.eval()
    ema_model.eval()

    data_loader = tqdm(data_loader, disable=False)

    with torch.no_grad():
        total_targets = []
        total_raw_outputs = []
        total_ema_outputs = []


        for batch_idx, (data, bag_label) in enumerate(data_loader):

#             print('EVAL type(data): {}, data.size: {}, require grad: {}'.format(type(data), data.size(), data.requires_grad))
#             print('EVAL type(bag_label): {}, bag_label: {}'.format(type(bag_label), bag_label))

            data, bag_label = data.to(args['device']), bag_label.to(args['device'])

            raw_outputs, raw_attention_weights = raw_model(data)
            ema_outputs, ema_attention_weights = ema_model(data)
#             print('target is {}, raw_outputs is: {}, ema_outputs is {}'.format(bag_label, raw_outputs, ema_outputs))

            total_targets.append(bag_label.detach().cpu())
            total_raw_outputs.append(raw_outputs.detach().cpu())
            total_ema_outputs.append(ema_outputs.detach().cpu())


        total_targets = np.concatenate(total_targets, axis=0)
        total_raw_outputs = np.concatenate(total_raw_outputs, axis=0)
        total_ema_outputs = np.concatenate(total_ema_outputs, axis=0)
#         print('RegularEval total_targets: {}'.format(total_targets))
#         print('RegularEval total_raw_outputs: {}'.format(total_raw_outputs))
#         print('RegularEval total_ema_outputs: {}'.format(total_ema_outputs))

        raw_Bacc = calculate_balanced_accuracy(total_raw_outputs, total_targets)
        ema_Bacc = calculate_balanced_accuracy(total_ema_outputs, total_targets)

#         print('raw Bacc this evaluation step: {}'.format(raw_Bacc), flush=True)
#         print('ema Bacc this evaluation step: {}'.format(ema_Bacc), flush=True)


    return raw_Bacc, ema_Bacc, total_targets, total_raw_outputs, total_ema_outputs

def eval_model_test(args, data_loader, raw_model):
    raw_model.eval()

    with torch.no_grad():
        ground_truth_labels = []
        pred_labels = []

        for data, bag_label in data_loader:
            data, bag_label = data.to(device), bag_label.to(device)

            pred_logit, _ = raw_model(data)

            pred_label = torch.softmax(pred_logit, dim=-1)
            pred_label = torch.argmax(pred_label).item()

            pred_labels.append(pred_label)
            ground_truth_labels.append(bag_label.item())

        bal_acc = balanced_accuracy_score(ground_truth_labels, pred_labels)

    return bal_acc

def calculate_balanced_accuracy(prediction, true_target, return_type = 'only balanced_accuracy'):

    confusion_matrix = sklearn_cm(true_target, prediction.argmax(1))
    n_class = confusion_matrix.shape[0]
    print('Inside calculate_balanced_accuracy, {} classes passed in'.format(n_class), flush=True)

    assert n_class==3

    recalls = []
    for i in range(n_class):
        recall = confusion_matrix[i,i]/np.sum(confusion_matrix[i])
        recalls.append(recall)
        print('class{} recall: {}'.format(i, recall), flush=True)

    balanced_accuracy = np.mean(np.array(recalls))


    if return_type == 'all':
#         return balanced_accuracy * 100, class0_recall * 100, class1_recall * 100, class2_recall * 100
        return balanced_accuracy * 100, recalls

    elif return_type == 'only balanced_accuracy':
        return balanced_accuracy * 100
    else:
        raise NameError('Unsupported return_type in this calculate_balanced_accuracy fn')


 #shared helper fct across different algos
def save_pickle(save_dir, save_file_name, data):
    if not os.path.exists(save_dir):
        os.makedirs(save_dir)

    data_save_fullpath = os.path.join(save_dir, save_file_name)
    with open(data_save_fullpath, 'wb') as handle:
        pickle.dump(data, handle, protocol=pickle.HIGHEST_PROTOCOL)


#### Training Runner
The code block below contains the logic to train the model, one epoch at a time, with early stop, and the logic to write the results out. This method is called in subsequent training blocks after the training arguments are defined.

In [16]:
import json

def train_samil(args):
    best_val_ema_Bacc = 0
    best_test_ema_Bacc_at_val = 0
    best_train_ema_Bacc_at_val = 0

    best_val_raw_Bacc = 0
    best_test_raw_Bacc_at_val = 0
    best_train_raw_Bacc_at_val = 0


    current_count=0

    if os.path.isfile(args.get('resume_checkpoint_fullpath')):

        print('Resuming from checkpoint: {}'.format(args.get('resume_checkpoint_fullpath')))

        checkpoint = torch.load(args['resume_checkpoint_fullpath'])
        args['start_epoch'] = checkpoint['epoch']
        model.load_state_dict(checkpoint['state_dict'])
        ema_model.ema.load_state_dict(checkpoint['ema_state_dict'])
        current_count = checkpoint['current_count']
        optimizer.load_state_dict(checkpoint['optimizer'])
        scheduler.load_state_dict(checkpoint['scheduler'])

        best_val_ema_Bacc = checkpoint['val_progression_view']['best_val_ema_Bacc']
        best_test_ema_Bacc_at_val = checkpoint['val_progression_view']['best_test_ema_Bacc_at_val']
        best_train_ema_Bacc_at_val = checkpoint['val_progression_view']['best_train_ema_Bacc_at_val']

        best_val_raw_Bacc = checkpoint['val_progression_view']['best_val_raw_Bacc']
        best_test_raw_Bacc_at_val = checkpoint['val_progression_view']['best_test_raw_Bacc_at_val']
        best_train_raw_Bacc_at_val = checkpoint['val_progression_view']['best_train_raw_Bacc_at_val']


    else:
        print('!!!!Does not have checkpoint yet!!!!')


    logger.info("***** Running training *****")
    logger.info(f"  Task = {args['dataset_name']}")
    logger.info(f"  Num Epochs = {args['train_epoch']}")
    logger.info(f"  Total optimization steps = {args['train_epoch'] * len(train_dataset)}")

    train_loss_dict = dict()
    train_loss_dict['Totalloss'] = []
    train_loss_dict['LabeledCEloss'] = []
    train_loss_dict['ViewRegularizationLoss'] = []

    early_stopping = EarlyStopping(patience=args['patience'], initial_count=current_count)
    early_stopping_warmup = args['early_stopping_warmup']

    for epoch in tqdm(range(args['start_epoch'], args['train_epoch'])):
        val_predictions_save_dict = dict()
        test_predictions_save_dict = dict()
        train_predictions_save_dict = dict()

        TotalLoss_list, LabeledCEloss_list, ViewRegularizationLoss_list, scaled_ViewRegularizationLoss_list = train_one_epoch(args, weights, train_loader, model, ema_model, view_model, optimizer, scheduler, epoch)
        train_loss_dict['Totalloss'].extend(TotalLoss_list)
        train_loss_dict['LabeledCEloss'].extend(LabeledCEloss_list)
        train_loss_dict['ViewRegularizationLoss'].extend(ViewRegularizationLoss_list)

        if epoch % args['eval_every_Xepoch'] == 0:
            val_raw_Bacc, val_ema_Bacc, val_true_labels, val_raw_predictions, val_ema_predictions = eval_model(args, val_loader, model, ema_model.ema, epoch)
            val_predictions_save_dict['raw_Bacc'] = val_raw_Bacc
            val_predictions_save_dict['ema_Bacc'] = val_ema_Bacc
            val_predictions_save_dict['true_labels'] = val_true_labels
            val_predictions_save_dict['raw_predictions'] = val_raw_predictions
            val_predictions_save_dict['ema_predictions'] = val_ema_predictions

            test_raw_Bacc, test_ema_Bacc, test_true_labels, test_raw_predictions, test_ema_predictions = eval_model(args, test_loader, model, ema_model.ema, epoch)

            test_predictions_save_dict['raw_Bacc'] = test_raw_Bacc
            test_predictions_save_dict['ema_Bacc'] = test_ema_Bacc
            test_predictions_save_dict['true_labels'] = test_true_labels
            test_predictions_save_dict['raw_predictions'] = test_raw_predictions
            test_predictions_save_dict['ema_predictions'] = test_ema_predictions

            train_raw_Bacc, train_ema_Bacc, train_true_labels, train_raw_predictions, train_ema_predictions = eval_model(args, trainmemory_loader, model, ema_model.ema, epoch)

            train_predictions_save_dict['raw_Bacc'] = train_raw_Bacc
            train_predictions_save_dict['ema_Bacc'] = train_ema_Bacc
            train_predictions_save_dict['true_labels'] = train_true_labels
            train_predictions_save_dict['raw_predictions'] = train_raw_predictions
            train_predictions_save_dict['ema_predictions'] = train_ema_predictions

            if val_raw_Bacc > best_val_raw_Bacc:

                best_val_raw_Bacc = val_raw_Bacc
                best_test_raw_Bacc_at_val = test_raw_Bacc
                best_train_raw_Bacc_at_val = train_raw_Bacc

                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_raw_val'), 'val_predictions.pkl', val_predictions_save_dict)

                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_raw_val'), 'test_predictions.pkl', test_predictions_save_dict)


                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_raw_val'), 'train_predictions.pkl', train_predictions_save_dict)

                save_checkpoint(
                {
                'epoch': epoch+1,
                'state_dict': model.state_dict(),
                'ema_state_dict': ema_model.ema.state_dict(),
                'current_count':current_count,
                'optimizer': optimizer.state_dict(),
                'scheduler': scheduler.state_dict(),

                'val_progression_view':
                    {'epoch': epoch+1,
                    'best_val_ema_Bacc': best_val_ema_Bacc,
                    'best_val_raw_Bacc': best_val_raw_Bacc,
                    'best_test_ema_Bacc_at_val': best_test_ema_Bacc_at_val,
                    'best_test_raw_Bacc_at_val': best_test_raw_Bacc_at_val,
                    'best_train_ema_Bacc_at_val': best_train_ema_Bacc_at_val,
                    'best_train_raw_Bacc_at_val': best_train_raw_Bacc_at_val,
                      },

                }, args['experiment_dir'], filename='val_progression_view/best_predictions_at_raw_val/best_model.pth.tar')


            if val_ema_Bacc > best_val_ema_Bacc:

                best_val_ema_Bacc = val_ema_Bacc
                best_test_ema_Bacc_at_val = test_ema_Bacc
                best_train_ema_Bacc_at_val = train_ema_Bacc

                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_ema_val'), 'val_predictions.pkl', val_predictions_save_dict)

                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_ema_val'), 'test_predictions.pkl', test_predictions_save_dict)

                save_pickle(os.path.join(args['experiment_dir'], 'val_progression_view', 'best_predictions_at_ema_val'), 'train_predictions.pkl', train_predictions_save_dict)

                save_checkpoint(
                {
                'epoch': epoch+1,
                'state_dict': model.state_dict(),
                'ema_state_dict': ema_model.ema.state_dict(),
                'current_count':current_count,
                'optimizer': optimizer.state_dict(),
                'scheduler': scheduler.state_dict(),

                'val_progression_view':
                    {'epoch': epoch+1,
                    #regular val
                    'best_val_ema_Bacc': best_val_ema_Bacc,
                    'best_val_raw_Bacc': best_val_raw_Bacc,
                    'best_test_ema_Bacc_at_val': best_test_ema_Bacc_at_val,
                    'best_test_raw_Bacc_at_val': best_test_raw_Bacc_at_val,
                    'best_train_ema_Bacc_at_val': best_train_ema_Bacc_at_val,
                    'best_train_raw_Bacc_at_val': best_train_raw_Bacc_at_val,
                      },

                }, args['experiment_dir'], filename='val_progression_view/best_predictions_at_ema_val/best_model.pth.tar')



            logger.info('val progression view:')
            logger.info('At RAW Best val, validation/test/train %.2f %.2f %.2f' % (best_val_raw_Bacc, best_test_raw_Bacc_at_val, best_train_raw_Bacc_at_val))
            logger.info('At EMA Best val, validation/test/train %.2f %.2f %.2f' % (best_val_ema_Bacc, best_test_ema_Bacc_at_val, best_train_ema_Bacc_at_val))

            args['writer'].add_scalar('train/1.train_raw_Bacc', train_raw_Bacc, epoch)
            args['writer'].add_scalar('train/1.train_ema_Bacc', train_ema_Bacc, epoch)
            args['writer'].add_scalar('train/1.LabeledCEloss', np.mean(LabeledCEloss_list), epoch)

            args['writer'].add_scalar('val/1.val_raw_Bacc', val_raw_Bacc, epoch)
            args['writer'].add_scalar('val/2.val_ema_Bacc', val_ema_Bacc, epoch)


            args['writer'].add_scalar('test/1.test_raw_Bacc', test_raw_Bacc, epoch)
            args['writer'].add_scalar('test/2.test_ema_Bacc', test_ema_Bacc, epoch)

            brief_summary['val_progression_view']['best_val_ema_Bacc'] = best_val_ema_Bacc
            brief_summary['val_progression_view']['best_val_raw_Bacc'] = best_val_raw_Bacc
            brief_summary['val_progression_view']['best_test_ema_Bacc_at_val'] = best_test_ema_Bacc_at_val
            brief_summary['val_progression_view']['best_test_raw_Bacc_at_val'] = best_test_raw_Bacc_at_val
            brief_summary['val_progression_view']['best_train_ema_Bacc_at_val'] = best_train_ema_Bacc_at_val
            brief_summary['val_progression_view']['best_train_raw_Bacc_at_val'] = best_train_raw_Bacc_at_val


            with open(os.path.join(args['experiment_dir'], "brief_summary.json"), "w") as f:
                json.dump(brief_summary, f)

            if epoch > early_stopping_warmup:
                current_count = early_stopping(val_ema_Bacc)

            save_checkpoint(
                {
                'epoch': epoch+1,
                'state_dict': model.state_dict(),
                'ema_state_dict': ema_model.ema.state_dict(),
                'current_count':current_count,
                'optimizer': optimizer.state_dict(),
                'scheduler': scheduler.state_dict(),

                'val_progression_view':
                    {'epoch': epoch+1,
                    #regular val
                    'best_val_ema_Bacc': best_val_ema_Bacc,
                    'best_val_raw_Bacc': best_val_raw_Bacc,
                    'best_test_ema_Bacc_at_val': best_test_ema_Bacc_at_val,
                    'best_test_raw_Bacc_at_val': best_test_raw_Bacc_at_val,
                    'best_train_ema_Bacc_at_val': best_train_ema_Bacc_at_val,
                    'best_train_raw_Bacc_at_val': best_train_raw_Bacc_at_val,
                      },


                }, args['experiment_dir'], filename='last_checkpoint.pth.tar')


            if early_stopping.early_stop:
                break

    brief_summary['val_progression_view']['best_val_ema_Bacc'] = best_val_ema_Bacc
    brief_summary['val_progression_view']['best_val_raw_Bacc'] = best_val_raw_Bacc
    brief_summary['val_progression_view']['best_test_ema_Bacc_at_val'] = best_test_ema_Bacc_at_val
    brief_summary['val_progression_view']['best_test_raw_Bacc_at_val'] = best_test_raw_Bacc_at_val
    brief_summary['val_progression_view']['best_train_ema_Bacc_at_val'] = best_train_ema_Bacc_at_val
    brief_summary['val_progression_view']['best_train_raw_Bacc_at_val'] = best_train_raw_Bacc_at_val



    args['writer'].close()

    with open(os.path.join(args['experiment_dir'], "brief_summary.json"), "w") as f:
        json.dump(brief_summary, f)

### Configure SAMIL Arguments for Training

The code block below configures the arguments for training the SAMIL model per the specified hyperparameters in the paper for Split 1 and the Github repo here: [Hyperparameters](https://github.com/tufts-ml/SAMIL/blob/main/Hyperparameters/Hyperparameters.txt)

| SAMIL (with study-level SSL) | split1 | split2 | split3 |
| ---------------------------- | ------ | ------ | ------ |
| Learning rate                | 0.0008 | 0.0005 | 0.0005 |
| Weight decay                 | 0.0001 | 0.0001 | 0.001  |
| Temperature T                | 0.1    | 0.05   | 0.1    |
| λ<sub>sA</sub>               | 15.0   | 20.0   | 20.0   |
| Learning rate schedule       | cosine | cosine | cosine |

Table C.1: Hyperparameter settings for SAMIL across different data splits.

#### Training Type 1: No Pretraining

**Please change hyperparameters and epochs as needed.**

The epoch count defined in the paper is 2,000. To test the training below, set `train_epoch` to 1.



In [23]:
RUNS_DIR = '/content/runs/'

args = {
    'training_seed': 0,
    'Pretrained': 'NoPretrain',
    'data_seed': 0,
    'checkpoint_dir': MODEL_CHECKPOINTS,
    'MIL_checkpoint_path': '',
    'use_class_weights': 'True',
    'ViewRegularization_warmup_schedule_type': 'Linear',
    'optimizer_type': 'SGD',
    'lr_schedule_type': 'CosineLR',
    'lr_cycle_epochs': 1,
    'lr': 0.0008, # learning rate
    'wd': 0.0001, # weight decay
    'T': 0.1, # tempertature
    'lambda_ViewRegularization': 15.0, # λsA
    'train_dir': RUNS_DIR + 'SAMIL',
    'resume': 'last_checkpoint.pth.tar',
    'dataset_name': 'echo',
    'train_epoch': 2000, # number of epochs, 2000 defined in the paper. CHANGE ME!
    'development_size': 'DEV479',
    'lr_warmup_epochs': 0,
    'ema_decay': 0.999,
    'device': device,
    'start_epoch': 0,
    'patience': 200,
    'early_stopping_warmup': 200,
    'ViewRegularization_warmup_pos': 0.4,
    'eval_every_Xepoch': 1
}

args, brief_summary = setup_samil_train(args)

weights = args['class_weights']
weights = [float(i) for i in weights.split(',')]
weights = torch.Tensor(weights)
weights = weights.to(device)

#load the view model, the output is unnormalized logits, need to use softmax on the output
view_model = create_view_model(args)
view_model.to(device)

model = create_model(args)
model.to(device)

no_decay = ['bias', 'bn']
grouped_parameters = [
    {'params': [p for n, p in model.named_parameters() if not any(
        nd in n for nd in no_decay)], 'weight_decay': args['wd']},
    {'params': [p for n, p in model.named_parameters() if any(
        nd in n for nd in no_decay)], 'weight_decay': 0.0}
]

if args['optimizer_type'] == 'SGD':
    optimizer = optim.SGD(grouped_parameters, lr=args['lr'],
                          momentum=0.9, nesterov=True)

elif args['optimizer_type'] == 'Adam':
    optimizer = optim.Adam(grouped_parameters, lr=args['lr'])

elif args['optimizer_type'] == 'AdamW':
    optimizer = optim.AdamW(grouped_parameters, lr=args['lr'])

else:
    raise NameError('Not supported optimizer setting')

#lr_schedule_type choice
if args['lr_schedule_type'] == 'CosineLR':
    scheduler = get_cosine_schedule_with_warmup(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

elif args['lr_schedule_type'] == 'FixedLR':
    scheduler = get_fixed_lr(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

else:
    raise NameError('Not supported lr scheduler setting')


#instantiate the ema_model object
ema_model = ModelEMA(args, model, args['ema_decay'])


# !!! Start training
train_samil(args)

setting training seed0


INFO:__main__:Model: WideResNet 28x2


!!!!!!!!Using pre-calculated class weights!!!!!!!!
args.resume_checkpoint_fullpath: /content/runs/SAMIL/NoPretrain/last_checkpoint.pth.tar


INFO:__main__:Total params for View Model: 5.93M
INFO:__main__:Total params: 2.31M
INFO:__main__:***** Running training *****
INFO:__main__:  Task = echo
INFO:__main__:  Num Epochs = 2000
INFO:__main__:  Total optimization steps = 720000


self.param_keys: ['feature_extractor_part1.0.weight', 'feature_extractor_part1.0.bias', 'feature_extractor_part1.3.weight', 'feature_extractor_part1.3.bias', 'feature_extractor_part1.6.weight', 'feature_extractor_part1.6.bias', 'feature_extractor_part1.9.weight', 'feature_extractor_part1.9.bias', 'feature_extractor_part2.0.weight', 'feature_extractor_part2.0.bias', 'feature_extractor_part3.0.weight', 'feature_extractor_part3.0.bias', 'feature_extractor_part3.2.weight', 'feature_extractor_part3.2.bias', 'attention_V.0.weight', 'attention_V.0.bias', 'attention_V.2.weight', 'attention_V.2.bias', 'attention_U.0.weight', 'attention_U.0.bias', 'attention_U.2.weight', 'attention_U.2.bias', 'classifier.0.weight', 'classifier.0.bias']
self.buffer_keys: []
!!!!Does not have checkpoint yet!!!!


  0%|          | 0/2000 [00:00<?, ?it/s]

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:03<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:06,  1.77it/s][A[A

  2%|▏         | 2/119 [00:00<00:45,  2.58it/s][A[A

  4%|▍         | 5/119 [00:01<00:17,  6.64it/s][A[A

  7%|▋         | 8/119 [00:01<00:10, 10.28it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 13.34it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.40it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 19.76it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 21.13it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 25.28it/s][A[A

 24%|██▎       | 28/119 [00:01<00:03, 26.14it/s][A[A

 27%|██▋       | 32/119 [00:01<00:03, 28.25it/s][A[A

 29%|██▉       | 35/119 [00:02<00:03, 27.34it/s][A[A

 32%|███▏      | 38/119 [00:02<00:02, 28.00it/s][A[A

 34%|███▍      | 41/119 [00:02<00:02, 28.20it/s][A[A

 37%|███▋      

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:00,  1.01s/it][A[A

  2%|▏         | 2/120 [00:01<00:58,  2.02it/s][A[A

  2%|▎         | 3/120 [00:01<00:41,  2.81it/s][A[A

  4%|▍         | 5/120 [00:01<00:21,  5.27it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.83it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.71it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.52it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.43it/s][A[A

 13%|█▎        | 16/120 [00:02<00:06, 15.18it/s][A[A

 16%|█▌        | 19/120 [00:02<00:05, 17.48it/s][A[A

 19%|█▉        | 23/120 [00:02<00:04, 22.35it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 22.00it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 23.28it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 26.78it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 25.41it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 26.53it/s][A[A

 37%|███▋      | 44/120 [00:03<00:02, 27.95it/s][A[A

 40%|████      | 48/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<03:38,  1.64it/s][A[A

  1%|          | 2/360 [00:00<02:19,  2.56it/s][A[A

  1%|          | 4/360 [00:01<01:08,  5.19it/s][A[A

  2%|▏         | 7/360 [00:01<00:36,  9.62it/s][A[A

  3%|▎         | 10/360 [00:01<00:25, 13.73it/s][A[A

  4%|▍         | 14/360 [00:01<00:18, 18.78it/s][A[A

  5%|▍         | 17/360 [00:01<00:16, 20.76it/s][A[A

  6%|▌         | 20/360 [00:01<00:14, 22.79it/s][A[A

  7%|▋         | 24/360 [00:01<00:12, 26.55it/s][A[A

  8%|▊         | 27/360 [00:01<00:12, 26.57it/s][A[A

  8%|▊         | 30/360 [00:01<00:12, 25.72it/s][A[A

  9%|▉         | 34/360 [00:02<00:11, 27.58it/s][A[A

 11%|█         | 38/360 [00:02<00:10, 29.30it/s][A[A

 12%|█▏        | 42/360 [00:02<00:10, 29.97it/s][A[A

 13%|█▎        | 46/360 [00:02<00:10, 31.21it/s][A[A

 14%|█▍        | 50/360 [00:02<00:09, 32.39it/s][A[A

 15%|█▌        | 54/360 [00:02<00:09, 33.66it/s][A[A

 16%|█▌        | 58/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:35,  1.24it/s][A[A

  2%|▏         | 2/119 [00:01<00:52,  2.21it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.88it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.35it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.55it/s][A[A

 10%|█         | 12/119 [00:01<00:07, 15.13it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 20.03it/s][A[A

 16%|█▌        | 19/119 [00:01<00:04, 21.37it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 22.51it/s][A[A

 22%|██▏       | 26/119 [00:01<00:03, 25.50it/s][A[A

 25%|██▌       | 30/119 [00:02<00:03, 27.14it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:26,  1.37it/s][A[A

  2%|▎         | 3/120 [00:00<00:30,  3.89it/s][A[A

  4%|▍         | 5/120 [00:01<00:17,  6.63it/s][A[A

  7%|▋         | 8/120 [00:01<00:10, 10.34it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 14.10it/s][A[A

 12%|█▎        | 15/120 [00:01<00:05, 19.03it/s][A[A

 15%|█▌        | 18/120 [00:01<00:04, 21.09it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 22.54it/s][A[A

 22%|██▏       | 26/120 [00:01<00:03, 27.01it/s][A[A

 24%|██▍       | 29/120 [00:01<00:03, 25.21it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 28.45it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 26.58it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 26.56it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 29.24it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 28.53it/s][A[A

 42%|████▏     | 50/120 [00:02<00:02, 27.19it/s][A[A

 45%|████▌     | 54/120 [00:02<00:02, 28.07it/s][A[A

 48%|████▊     | 58/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:09,  1.16it/s][A[A

  1%|          | 2/360 [00:01<03:27,  1.73it/s][A[A

  1%|          | 3/360 [00:01<02:30,  2.37it/s][A[A

  1%|▏         | 5/360 [00:01<01:16,  4.62it/s][A[A

  2%|▏         | 7/360 [00:01<00:53,  6.66it/s][A[A

  3%|▎         | 10/360 [00:01<00:32, 10.73it/s][A[A

  3%|▎         | 12/360 [00:01<00:28, 12.36it/s][A[A

  4%|▍         | 14/360 [00:02<00:24, 13.99it/s][A[A

  4%|▍         | 16/360 [00:02<00:22, 15.18it/s][A[A

  5%|▌         | 19/360 [00:02<00:20, 16.90it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.27it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 18.61it/s][A[A

  7%|▋         | 26/360 [00:02<00:17, 18.66it/s][A[A

  8%|▊         | 28/360 [00:02<00:17, 18.76it/s][A[A

  8%|▊         | 30/360 [00:02<00:18, 17.80it/s][A[A

  9%|▉         | 33/360 [00:03<00:16, 19.42it/s][A[A

 10%|▉         | 35/360 [00:03<00:16, 19.53it/s][A[A

 11%|█         | 39/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:16,  1.55it/s][A[A

  2%|▏         | 2/119 [00:00<00:43,  2.71it/s][A[A

  3%|▎         | 4/119 [00:00<00:21,  5.36it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.73it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.99it/s][A[A

  9%|▉         | 11/119 [00:01<00:07, 14.11it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 18.78it/s][A[A

 15%|█▌        | 18/119 [00:01<00:04, 20.75it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 22.00it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 25.64it/s][A[A

 24%|██▍       | 29/119 [00:01<00:03, 27.68it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:21,  1.19s/it][A[A

  2%|▏         | 2/120 [00:01<01:07,  1.74it/s][A[A

  2%|▎         | 3/120 [00:01<00:44,  2.62it/s][A[A

  4%|▍         | 5/120 [00:01<00:22,  5.04it/s][A[A

  7%|▋         | 8/120 [00:01<00:12,  8.85it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.58it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 15.94it/s][A[A

 14%|█▍        | 17/120 [00:02<00:05, 18.75it/s][A[A

 18%|█▊        | 21/120 [00:02<00:04, 21.60it/s][A[A

 21%|██        | 25/120 [00:02<00:03, 24.25it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 27.70it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 26.27it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 26.19it/s][A[A

 33%|███▎      | 40/120 [00:02<00:02, 28.25it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 28.02it/s][A[A

 39%|███▉      | 47/120 [00:03<00:02, 28.44it/s][A[A

 42%|████▏     | 50/120 [00:03<00:02, 27.48it/s][A[A

 45%|████▌     | 54/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<03:05,  1.93it/s][A[A

  1%|          | 2/360 [00:00<01:41,  3.54it/s][A[A

  1%|          | 3/360 [00:00<01:52,  3.17it/s][A[A

  2%|▏         | 6/360 [00:01<00:47,  7.51it/s][A[A

  3%|▎         | 10/360 [00:01<00:27, 12.94it/s][A[A

  4%|▍         | 14/360 [00:01<00:18, 18.40it/s][A[A

  5%|▍         | 17/360 [00:01<00:16, 20.79it/s][A[A

  6%|▌         | 20/360 [00:01<00:16, 20.97it/s][A[A

  7%|▋         | 24/360 [00:01<00:13, 24.22it/s][A[A

  8%|▊         | 27/360 [00:01<00:13, 24.87it/s][A[A

  9%|▊         | 31/360 [00:01<00:12, 27.39it/s][A[A

  9%|▉         | 34/360 [00:02<00:12, 26.88it/s][A[A

 11%|█         | 38/360 [00:02<00:11, 28.24it/s][A[A

 12%|█▏        | 42/360 [00:02<00:10, 30.77it/s][A[A

 13%|█▎        | 46/360 [00:02<00:10, 30.52it/s][A[A

 14%|█▍        | 50/360 [00:02<00:09, 31.11it/s][A[A

 15%|█▌        | 54/360 [00:02<00:09, 32.50it/s][A[A

 16%|█▌        | 58/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:07,  1.75it/s][A[A

  2%|▏         | 2/119 [00:00<00:38,  3.07it/s][A[A

  3%|▎         | 4/119 [00:00<00:21,  5.37it/s][A[A

  4%|▍         | 5/119 [00:01<00:18,  6.14it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.95it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.90it/s][A[A

 10%|█         | 12/119 [00:01<00:06, 16.44it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 19.51it/s][A[A

 15%|█▌        | 18/119 [00:01<00:04, 21.79it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 22.30it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 25.56it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:31,  1.30it/s][A[A

  2%|▏         | 2/120 [00:00<00:47,  2.51it/s][A[A

  2%|▎         | 3/120 [00:01<00:34,  3.41it/s][A[A

  4%|▍         | 5/120 [00:01<00:19,  5.88it/s][A[A

  7%|▋         | 8/120 [00:01<00:11, 10.03it/s][A[A

 10%|█         | 12/120 [00:01<00:07, 15.37it/s][A[A

 12%|█▎        | 15/120 [00:01<00:05, 18.21it/s][A[A

 16%|█▌        | 19/120 [00:01<00:04, 21.35it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 23.99it/s][A[A

 22%|██▏       | 26/120 [00:01<00:03, 24.92it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 24.92it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 28.03it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 26.79it/s][A[A

 32%|███▎      | 39/120 [00:02<00:02, 27.29it/s][A[A

 36%|███▌      | 43/120 [00:02<00:02, 29.26it/s][A[A

 38%|███▊      | 46/120 [00:02<00:02, 28.11it/s][A[A

 41%|████      | 49/120 [00:02<00:02, 27.76it/s][A[A

 43%|████▎     | 52/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:25,  1.07s/it][A[A

  1%|          | 2/360 [00:01<03:20,  1.78it/s][A[A

  1%|          | 3/360 [00:01<02:19,  2.57it/s][A[A

  1%|▏         | 5/360 [00:01<01:14,  4.79it/s][A[A

  2%|▏         | 7/360 [00:01<00:52,  6.71it/s][A[A

  3%|▎         | 10/360 [00:01<00:34, 10.09it/s][A[A

  3%|▎         | 12/360 [00:02<00:31, 11.10it/s][A[A

  4%|▍         | 14/360 [00:02<00:26, 12.89it/s][A[A

  4%|▍         | 16/360 [00:02<00:25, 13.72it/s][A[A

  5%|▌         | 18/360 [00:02<00:22, 15.11it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.44it/s][A[A

  7%|▋         | 24/360 [00:02<00:16, 20.37it/s][A[A

  8%|▊         | 27/360 [00:02<00:15, 21.87it/s][A[A

  9%|▊         | 31/360 [00:02<00:13, 24.83it/s][A[A

  9%|▉         | 34/360 [00:02<00:12, 26.15it/s][A[A

 10%|█         | 37/360 [00:03<00:12, 26.73it/s][A[A

 11%|█▏        | 41/360 [00:03<00:11, 28.63it/s][A[A

 12%|█▎        | 45/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:04,  1.84it/s][A[A

  2%|▏         | 2/119 [00:00<00:46,  2.51it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.77it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.15it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.31it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 16.86it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 19.83it/s][A[A

 16%|█▌        | 19/119 [00:01<00:04, 20.68it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 21.25it/s][A[A

 21%|██        | 25/119 [00:01<00:04, 23.31it/s][A[A

 24%|██▍       | 29/119 [00:01<00:03, 25.35it/s][A[A

 27%|██▋       | 32/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:33,  1.27it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.72it/s][A[A

  5%|▌         | 6/120 [00:01<00:14,  7.74it/s][A[A

  8%|▊         | 10/120 [00:01<00:08, 12.91it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 15.28it/s][A[A

 13%|█▎        | 16/120 [00:01<00:05, 18.21it/s][A[A

 16%|█▌        | 19/120 [00:01<00:04, 20.28it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 23.43it/s][A[A

 22%|██▎       | 27/120 [00:01<00:03, 25.79it/s][A[A

 25%|██▌       | 30/120 [00:01<00:03, 25.94it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 26.75it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 25.78it/s][A[A

 33%|███▎      | 40/120 [00:02<00:02, 27.02it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 28.02it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 28.35it/s][A[A

 42%|████▏     | 50/120 [00:02<00:02, 27.49it/s][A[A

 45%|████▌     | 54/120 [00:02<00:02, 28.24it/s][A[A

 48%|████▊     | 57/120

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:29,  1.33it/s][A[A

  1%|          | 3/360 [00:00<01:39,  3.57it/s][A[A

  2%|▏         | 6/360 [00:01<00:46,  7.62it/s][A[A

  2%|▎         | 9/360 [00:01<00:30, 11.45it/s][A[A

  3%|▎         | 12/360 [00:01<00:24, 14.26it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.66it/s][A[A

  6%|▌         | 20/360 [00:01<00:14, 22.92it/s][A[A

  7%|▋         | 24/360 [00:01<00:13, 24.15it/s][A[A

  8%|▊         | 27/360 [00:01<00:13, 24.85it/s][A[A

  9%|▊         | 31/360 [00:01<00:11, 27.46it/s][A[A

  9%|▉         | 34/360 [00:02<00:11, 27.37it/s][A[A

 10%|█         | 37/360 [00:02<00:12, 26.28it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 28.14it/s][A[A

 12%|█▎        | 45/360 [00:02<00:10, 29.04it/s][A[A

 14%|█▎        | 49/360 [00:02<00:10, 30.88it/s][A[A

 15%|█▍        | 53/360 [00:02<00:09, 31.62it/s][A[A

 16%|█▌        | 57/360 [00:02<00:09, 32.09it/s][A[A

 17%|█▋        | 61/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:07,  1.76it/s][A[A

  2%|▏         | 2/119 [00:00<00:48,  2.42it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.05it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.23it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.57it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 16.53it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 19.10it/s][A[A

 16%|█▌        | 19/119 [00:01<00:04, 21.16it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 22.86it/s][A[A

 22%|██▏       | 26/119 [00:01<00:03, 26.17it/s][A[A

 24%|██▍       | 29/119 [00:01<00:03, 27.00it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:39,  1.19it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.74it/s][A[A

  5%|▌         | 6/120 [00:01<00:14,  7.64it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 14.26it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 16.78it/s][A[A

 15%|█▌        | 18/120 [00:01<00:04, 20.89it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 21.96it/s][A[A

 20%|██        | 24/120 [00:01<00:04, 23.84it/s][A[A

 22%|██▎       | 27/120 [00:01<00:03, 25.13it/s][A[A

 25%|██▌       | 30/120 [00:01<00:03, 25.35it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 25.01it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 25.32it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 26.51it/s][A[A

 36%|███▌      | 43/120 [00:02<00:02, 28.71it/s][A[A

 38%|███▊      | 46/120 [00:02<00:02, 27.87it/s][A[A

 41%|████      | 49/120 [00:02<00:02, 27.76it/s][A[A

 43%|████▎     | 52/120 [00:02<00:02, 27.13it/s][A[A

 46%|████▌     | 55/120

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:49,  1.03it/s][A[A

  1%|          | 2/360 [00:01<02:45,  2.16it/s][A[A

  1%|          | 3/360 [00:01<01:57,  3.04it/s][A[A

  2%|▏         | 6/360 [00:01<00:47,  7.43it/s][A[A

  2%|▎         | 9/360 [00:01<00:30, 11.58it/s][A[A

  3%|▎         | 11/360 [00:01<00:26, 13.28it/s][A[A

  4%|▍         | 14/360 [00:01<00:20, 16.62it/s][A[A

  5%|▌         | 18/360 [00:01<00:16, 20.73it/s][A[A

  6%|▌         | 22/360 [00:01<00:14, 23.64it/s][A[A

  7%|▋         | 26/360 [00:02<00:12, 26.38it/s][A[A

  8%|▊         | 30/360 [00:02<00:12, 27.38it/s][A[A

  9%|▉         | 33/360 [00:02<00:12, 26.71it/s][A[A

 10%|█         | 36/360 [00:02<00:12, 26.92it/s][A[A

 11%|█         | 39/360 [00:02<00:11, 27.47it/s][A[A

 12%|█▏        | 43/360 [00:02<00:10, 29.12it/s][A[A

 13%|█▎        | 47/360 [00:02<00:10, 30.42it/s][A[A

 14%|█▍        | 51/360 [00:02<00:10, 29.53it/s][A[A

 15%|█▌        | 55/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:13,  1.61it/s][A[A

  2%|▏         | 2/119 [00:01<00:55,  2.09it/s][A[A

  3%|▎         | 3/119 [00:01<00:36,  3.20it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.34it/s][A[A

  6%|▌         | 7/119 [00:01<00:14,  7.59it/s][A[A

  8%|▊         | 9/119 [00:01<00:11,  9.94it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 11.86it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.96it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 15.21it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 16.11it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.11it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:26,  1.37it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.76it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.19it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.92it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 13.75it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 17.00it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.93it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 24.69it/s][A[A

 20%|██        | 24/120 [00:01<00:04, 23.78it/s][A[A

 22%|██▎       | 27/120 [00:01<00:03, 24.25it/s][A[A

 25%|██▌       | 30/120 [00:01<00:03, 25.02it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 26.88it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 25.80it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 26.03it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 27.62it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 27.88it/s][A[A

 42%|████▏     | 50/120 [00:02<00:02, 27.57it/s][A[A

 45%|████▌     | 54/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<03:55,  1.52it/s][A[A

  1%|          | 2/360 [00:00<02:02,  2.93it/s][A[A

  1%|          | 3/360 [00:01<01:44,  3.42it/s][A[A

  2%|▏         | 6/360 [00:01<00:43,  8.09it/s][A[A

  3%|▎         | 10/360 [00:01<00:24, 14.07it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 17.23it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.62it/s][A[A

  6%|▌         | 20/360 [00:01<00:14, 23.49it/s][A[A

  6%|▋         | 23/360 [00:01<00:13, 24.32it/s][A[A

  7%|▋         | 26/360 [00:01<00:13, 23.86it/s][A[A

  8%|▊         | 30/360 [00:01<00:12, 25.97it/s][A[A

  9%|▉         | 33/360 [00:02<00:12, 25.63it/s][A[A

 10%|█         | 36/360 [00:02<00:12, 26.06it/s][A[A

 11%|█         | 40/360 [00:02<00:11, 28.78it/s][A[A

 12%|█▏        | 44/360 [00:02<00:10, 29.92it/s][A[A

 13%|█▎        | 48/360 [00:02<00:10, 30.44it/s][A[A

 14%|█▍        | 52/360 [00:02<00:09, 31.42it/s][A[A

 16%|█▌        | 56/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:25,  1.38it/s][A[A

  2%|▏         | 2/119 [00:00<00:45,  2.59it/s][A[A

  3%|▎         | 3/119 [00:00<00:29,  3.92it/s][A[A

  4%|▍         | 5/119 [00:01<00:16,  6.74it/s][A[A

  7%|▋         | 8/119 [00:01<00:09, 11.27it/s][A[A

  8%|▊         | 10/119 [00:01<00:08, 13.14it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 20.14it/s][A[A

 15%|█▌        | 18/119 [00:01<00:04, 20.27it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 21.76it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 25.21it/s][A[A

 24%|██▎       | 28/119 [00:01<00:03, 26.21it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:25,  1.39it/s][A[A

  2%|▏         | 2/120 [00:00<00:45,  2.59it/s][A[A

  2%|▎         | 3/120 [00:00<00:30,  3.80it/s][A[A

  6%|▌         | 7/120 [00:01<00:10, 10.49it/s][A[A

  8%|▊         | 9/120 [00:01<00:09, 12.21it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.46it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 17.32it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.67it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.78it/s][A[A

 20%|██        | 24/120 [00:01<00:03, 26.03it/s][A[A

 23%|██▎       | 28/120 [00:01<00:03, 26.13it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 26.26it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 26.02it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 25.69it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 23.31it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 22.56it/s][A[A

 38%|███▊      | 46/120 [00:02<00:03, 22.34it/s][A[A

 41%|████      | 49/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:34,  1.31it/s][A[A

  1%|          | 3/360 [00:00<01:37,  3.67it/s][A[A

  1%|▏         | 5/360 [00:01<00:56,  6.30it/s][A[A

  2%|▏         | 8/360 [00:01<00:33, 10.54it/s][A[A

  3%|▎         | 11/360 [00:01<00:25, 13.69it/s][A[A

  4%|▍         | 14/360 [00:01<00:20, 17.19it/s][A[A

  5%|▌         | 18/360 [00:01<00:16, 21.31it/s][A[A

  6%|▌         | 21/360 [00:01<00:14, 22.87it/s][A[A

  7%|▋         | 25/360 [00:01<00:13, 25.71it/s][A[A

  8%|▊         | 28/360 [00:01<00:12, 25.61it/s][A[A

  9%|▊         | 31/360 [00:02<00:12, 25.64it/s][A[A

 10%|▉         | 35/360 [00:02<00:12, 26.92it/s][A[A

 11%|█         | 39/360 [00:02<00:11, 29.03it/s][A[A

 12%|█▏        | 43/360 [00:02<00:10, 30.00it/s][A[A

 13%|█▎        | 47/360 [00:02<00:10, 30.47it/s][A[A

 14%|█▍        | 51/360 [00:02<00:09, 31.40it/s][A[A

 15%|█▌        | 55/360 [00:02<00:10, 30.24it/s][A[A

 16%|█▋        | 59/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:10,  1.68it/s][A[A

  2%|▏         | 2/119 [00:01<00:55,  2.10it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.12it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.43it/s][A[A

  6%|▌         | 7/119 [00:01<00:15,  7.32it/s][A[A

  8%|▊         | 9/119 [00:01<00:12,  8.96it/s][A[A

  9%|▉         | 11/119 [00:01<00:10, 10.60it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.12it/s][A[A

 13%|█▎        | 16/119 [00:01<00:07, 14.15it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.71it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.27it/s][A[A

 19%|█▉        | 23/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:27,  1.36it/s][A[A

  2%|▏         | 2/120 [00:00<00:44,  2.63it/s][A[A

  2%|▎         | 3/120 [00:00<00:30,  3.78it/s][A[A

  5%|▌         | 6/120 [00:01<00:13,  8.51it/s][A[A

  8%|▊         | 10/120 [00:01<00:07, 14.55it/s][A[A

 11%|█         | 13/120 [00:01<00:06, 16.44it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.65it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.72it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 23.26it/s][A[A

 22%|██▎       | 27/120 [00:01<00:03, 25.33it/s][A[A

 25%|██▌       | 30/120 [00:02<00:03, 24.77it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 26.58it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 25.76it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 25.78it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 27.38it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 27.29it/s][A[A

 42%|████▎     | 51/120 [00:02<00:02, 29.01it/s][A[A

 45%|████▌     | 54/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:39,  1.28it/s][A[A

  1%|          | 2/360 [00:00<02:17,  2.60it/s][A[A

  1%|          | 3/360 [00:01<01:35,  3.73it/s][A[A

  1%|▏         | 5/360 [00:01<00:54,  6.57it/s][A[A

  2%|▏         | 7/360 [00:01<00:38,  9.11it/s][A[A

  3%|▎         | 10/360 [00:01<00:26, 13.38it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 17.02it/s][A[A

  5%|▍         | 17/360 [00:01<00:16, 21.43it/s][A[A

  6%|▌         | 20/360 [00:01<00:14, 22.99it/s][A[A

  7%|▋         | 24/360 [00:01<00:13, 24.57it/s][A[A

  8%|▊         | 27/360 [00:01<00:12, 25.69it/s][A[A

  9%|▊         | 31/360 [00:02<00:12, 27.38it/s][A[A

  9%|▉         | 34/360 [00:02<00:12, 26.73it/s][A[A

 11%|█         | 38/360 [00:02<00:11, 27.58it/s][A[A

 12%|█▏        | 42/360 [00:02<00:10, 29.16it/s][A[A

 12%|█▎        | 45/360 [00:02<00:10, 29.29it/s][A[A

 14%|█▎        | 49/360 [00:02<00:10, 30.49it/s][A[A

 15%|█▍        | 53/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:01,  1.90it/s][A[A

  2%|▏         | 2/119 [00:00<00:40,  2.86it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.64it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.84it/s][A[A

  8%|▊         | 9/119 [00:01<00:10, 10.73it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 15.63it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 18.11it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 19.91it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 21.77it/s][A[A

 22%|██▏       | 26/119 [00:01<00:03, 24.69it/s][A[A

 24%|██▍       | 29/119 [00:02<00:03, 25.00it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:43,  1.15it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.73it/s][A[A

  6%|▌         | 7/120 [00:01<00:12,  9.23it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 11.01it/s][A[A

 11%|█         | 13/120 [00:01<00:06, 15.59it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 17.16it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 18.17it/s][A[A

 18%|█▊        | 22/120 [00:01<00:05, 18.57it/s][A[A

 21%|██        | 25/120 [00:01<00:04, 19.65it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.89it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 20.37it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 17.94it/s][A[A

 30%|███       | 36/120 [00:02<00:04, 18.31it/s][A[A

 32%|███▏      | 38/120 [00:02<00:04, 18.26it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 20.52it/s][A[A

 37%|███▋      | 44/120 [00:02<00:03, 19.60it/s][A[A

 39%|███▉      | 47/120 [00:03<00:03, 19.62it/s][A[A

 42%|████▏     | 50/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:15,  1.41it/s][A[A

  1%|          | 2/360 [00:00<02:15,  2.65it/s][A[A

  1%|          | 3/360 [00:01<01:38,  3.62it/s][A[A

  1%|▏         | 5/360 [00:01<00:53,  6.60it/s][A[A

  2%|▏         | 8/360 [00:01<00:31, 11.18it/s][A[A

  3%|▎         | 12/360 [00:01<00:20, 16.70it/s][A[A

  4%|▍         | 15/360 [00:01<00:18, 18.61it/s][A[A

  5%|▌         | 19/360 [00:01<00:15, 22.56it/s][A[A

  6%|▌         | 22/360 [00:01<00:15, 22.53it/s][A[A

  7%|▋         | 25/360 [00:01<00:14, 23.61it/s][A[A

  8%|▊         | 28/360 [00:01<00:13, 24.89it/s][A[A

  9%|▊         | 31/360 [00:02<00:13, 24.72it/s][A[A

 10%|▉         | 35/360 [00:02<00:11, 27.57it/s][A[A

 11%|█         | 38/360 [00:02<00:11, 27.62it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 27.79it/s][A[A

 12%|█▎        | 45/360 [00:02<00:10, 29.81it/s][A[A

 14%|█▎        | 49/360 [00:02<00:10, 29.89it/s][A[A

 15%|█▍        | 53/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:52,  1.05it/s][A[A

  2%|▏         | 2/119 [00:01<01:11,  1.64it/s][A[A

  3%|▎         | 3/119 [00:01<00:44,  2.60it/s][A[A

  4%|▍         | 5/119 [00:01<00:24,  4.63it/s][A[A

  5%|▌         | 6/119 [00:01<00:20,  5.41it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.42it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.26it/s][A[A

 12%|█▏        | 14/119 [00:02<00:07, 13.81it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 13.66it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.69it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.62it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:33,  1.27it/s][A[A

  2%|▏         | 2/120 [00:00<00:46,  2.54it/s][A[A

  3%|▎         | 4/120 [00:01<00:21,  5.32it/s][A[A

  6%|▌         | 7/120 [00:01<00:11,  9.81it/s][A[A

  8%|▊         | 10/120 [00:01<00:08, 13.63it/s][A[A

 11%|█         | 13/120 [00:01<00:06, 16.56it/s][A[A

 13%|█▎        | 16/120 [00:01<00:05, 19.37it/s][A[A

 16%|█▌        | 19/120 [00:01<00:04, 21.72it/s][A[A

 18%|█▊        | 22/120 [00:01<00:04, 22.43it/s][A[A

 21%|██        | 25/120 [00:01<00:04, 22.29it/s][A[A

 23%|██▎       | 28/120 [00:01<00:03, 23.01it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 25.09it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 25.26it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 25.88it/s][A[A

 34%|███▍      | 41/120 [00:02<00:02, 26.94it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 26.26it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 26.84it/s][A[A

 42%|████▏     | 50/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:28,  1.34it/s][A[A

  1%|          | 2/360 [00:00<02:12,  2.71it/s][A[A

  1%|          | 3/360 [00:01<01:38,  3.63it/s][A[A

  2%|▏         | 6/360 [00:01<00:42,  8.43it/s][A[A

  2%|▎         | 9/360 [00:01<00:28, 12.41it/s][A[A

  3%|▎         | 12/360 [00:01<00:22, 15.65it/s][A[A

  4%|▍         | 15/360 [00:01<00:18, 18.98it/s][A[A

  5%|▌         | 18/360 [00:01<00:15, 21.67it/s][A[A

  6%|▌         | 21/360 [00:01<00:14, 22.78it/s][A[A

  7%|▋         | 24/360 [00:01<00:14, 23.77it/s][A[A

  8%|▊         | 27/360 [00:01<00:13, 24.42it/s][A[A

  8%|▊         | 30/360 [00:02<00:13, 23.96it/s][A[A

  9%|▉         | 33/360 [00:02<00:13, 24.05it/s][A[A

 10%|█         | 36/360 [00:02<00:14, 21.72it/s][A[A

 11%|█         | 39/360 [00:02<00:14, 21.85it/s][A[A

 12%|█▏        | 42/360 [00:02<00:15, 21.01it/s][A[A

 12%|█▎        | 45/360 [00:02<00:14, 21.95it/s][A[A

 13%|█▎        | 48/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:20,  1.47it/s][A[A

  2%|▏         | 2/119 [00:00<00:40,  2.92it/s][A[A

  3%|▎         | 3/119 [00:00<00:27,  4.15it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.07it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.79it/s][A[A

  7%|▋         | 8/119 [00:01<00:10, 10.10it/s][A[A

  9%|▉         | 11/119 [00:01<00:07, 14.75it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 19.21it/s][A[A

 15%|█▌        | 18/119 [00:01<00:04, 20.56it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 22.02it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 26.00it/s][A[A

 24%|██▍       | 29/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:27,  1.24s/it][A[A

  2%|▏         | 2/120 [00:01<01:11,  1.66it/s][A[A

  2%|▎         | 3/120 [00:01<00:44,  2.65it/s][A[A

  4%|▍         | 5/120 [00:01<00:23,  4.99it/s][A[A

  6%|▌         | 7/120 [00:01<00:15,  7.30it/s][A[A

  8%|▊         | 9/120 [00:01<00:12,  9.14it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.63it/s][A[A

 11%|█         | 13/120 [00:02<00:08, 12.16it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.33it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.16it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 16.89it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 18.55it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 18.49it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 18.52it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 16.28it/s][A[A

 27%|██▋       | 32/120 [00:03<00:04, 18.18it/s][A[A

 28%|██▊       | 34/120 [00:03<00:04, 18.30it/s][A[A

 31%|███       | 37/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:11,  1.43it/s][A[A

  1%|          | 2/360 [00:00<02:10,  2.74it/s][A[A

  1%|          | 3/360 [00:00<01:36,  3.70it/s][A[A

  1%|▏         | 5/360 [00:01<00:52,  6.75it/s][A[A

  2%|▏         | 7/360 [00:01<00:37,  9.47it/s][A[A

  3%|▎         | 10/360 [00:01<00:25, 13.88it/s][A[A

  4%|▎         | 13/360 [00:01<00:19, 17.67it/s][A[A

  4%|▍         | 16/360 [00:01<00:17, 20.15it/s][A[A

  6%|▌         | 20/360 [00:01<00:13, 24.94it/s][A[A

  6%|▋         | 23/360 [00:01<00:13, 24.18it/s][A[A

  7%|▋         | 26/360 [00:01<00:13, 25.24it/s][A[A

  8%|▊         | 29/360 [00:01<00:12, 26.17it/s][A[A

  9%|▉         | 32/360 [00:02<00:12, 25.94it/s][A[A

 10%|▉         | 35/360 [00:02<00:12, 26.82it/s][A[A

 11%|█         | 38/360 [00:02<00:12, 26.46it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 28.11it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 28.57it/s][A[A

 13%|█▎        | 48/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:01,  1.03s/it][A[A

  2%|▏         | 2/119 [00:01<01:03,  1.84it/s][A[A

  3%|▎         | 4/119 [00:01<00:26,  4.27it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.21it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.53it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.40it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.90it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.38it/s][A[A

 16%|█▌        | 19/119 [00:02<00:05, 17.97it/s][A[A

 18%|█▊        | 22/119 [00:02<00:04, 20.58it/s][A[A

 22%|██▏       | 26/119 [00:02<00:03, 23.73it/s][A[A

 24%|██▍       | 29/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:37,  1.23it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.71it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.32it/s][A[A

  7%|▋         | 8/120 [00:01<00:11, 10.05it/s][A[A

 10%|█         | 12/120 [00:01<00:07, 15.11it/s][A[A

 13%|█▎        | 16/120 [00:01<00:05, 19.30it/s][A[A

 16%|█▌        | 19/120 [00:01<00:04, 20.24it/s][A[A

 18%|█▊        | 22/120 [00:01<00:04, 22.31it/s][A[A

 21%|██        | 25/120 [00:01<00:04, 22.70it/s][A[A

 23%|██▎       | 28/120 [00:01<00:03, 24.05it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 25.42it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 25.22it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 24.77it/s][A[A

 34%|███▍      | 41/120 [00:02<00:02, 27.50it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 25.83it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 26.38it/s][A[A

 42%|████▏     | 50/120 [00:02<00:02, 26.36it/s][A[A

 44%|████▍     | 53/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:49,  1.14s/it][A[A

  1%|          | 2/360 [00:01<03:22,  1.76it/s][A[A

  1%|          | 3/360 [00:01<02:18,  2.57it/s][A[A

  1%|          | 4/360 [00:01<01:38,  3.60it/s][A[A

  1%|▏         | 5/360 [00:01<01:18,  4.52it/s][A[A

  2%|▏         | 7/360 [00:01<00:52,  6.75it/s][A[A

  2%|▎         | 9/360 [00:01<00:38,  9.24it/s][A[A

  3%|▎         | 11/360 [00:02<00:30, 11.45it/s][A[A

  4%|▎         | 13/360 [00:02<00:27, 12.73it/s][A[A

  4%|▍         | 15/360 [00:02<00:25, 13.67it/s][A[A

  5%|▍         | 17/360 [00:02<00:23, 14.47it/s][A[A

  5%|▌         | 19/360 [00:02<00:21, 15.59it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.50it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 18.36it/s][A[A

  7%|▋         | 26/360 [00:02<00:18, 18.13it/s][A[A

  8%|▊         | 28/360 [00:03<00:18, 18.37it/s][A[A

  8%|▊         | 30/360 [00:03<00:18, 18.09it/s][A[A

  9%|▉         | 33/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:17,  1.51it/s][A[A

  2%|▏         | 2/119 [00:00<00:41,  2.79it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.17it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.32it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.16it/s][A[A

 10%|█         | 12/119 [00:01<00:07, 14.51it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 17.56it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 19.36it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 21.35it/s][A[A

 20%|██        | 24/119 [00:01<00:04, 23.05it/s][A[A

 24%|██▎       | 28/119 [00:01<00:03, 25.84it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:57,  1.01it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.18it/s][A[A

  6%|▌         | 7/120 [00:01<00:13,  8.09it/s][A[A

  8%|▊         | 10/120 [00:01<00:09, 11.52it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 14.73it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 17.22it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 19.58it/s][A[A

 18%|█▊        | 22/120 [00:01<00:04, 20.98it/s][A[A

 21%|██        | 25/120 [00:01<00:04, 22.96it/s][A[A

 23%|██▎       | 28/120 [00:02<00:03, 24.46it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 23.50it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 24.20it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 24.77it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 24.74it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 27.26it/s][A[A

 39%|███▉      | 47/120 [00:02<00:02, 26.69it/s][A[A

 42%|████▏     | 50/120 [00:02<00:02, 26.35it/s][A[A

 44%|████▍     | 53/120

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:23,  1.36it/s][A[A

  1%|          | 3/360 [00:01<01:43,  3.44it/s][A[A

  2%|▏         | 6/360 [00:01<00:49,  7.17it/s][A[A

  3%|▎         | 10/360 [00:01<00:29, 11.85it/s][A[A

  4%|▎         | 13/360 [00:01<00:23, 14.66it/s][A[A

  4%|▍         | 16/360 [00:01<00:19, 17.34it/s][A[A

  6%|▌         | 20/360 [00:01<00:15, 21.54it/s][A[A

  6%|▋         | 23/360 [00:01<00:14, 22.99it/s][A[A

  7%|▋         | 26/360 [00:01<00:13, 24.22it/s][A[A

  8%|▊         | 29/360 [00:01<00:13, 24.84it/s][A[A

  9%|▉         | 32/360 [00:02<00:12, 25.84it/s][A[A

 10%|█         | 36/360 [00:02<00:11, 28.55it/s][A[A

 11%|█         | 39/360 [00:02<00:11, 27.28it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 26.94it/s][A[A

 13%|█▎        | 46/360 [00:02<00:10, 28.62it/s][A[A

 14%|█▍        | 50/360 [00:02<00:10, 29.84it/s][A[A

 15%|█▌        | 54/360 [00:02<00:10, 30.56it/s][A[A

 16%|█▌        | 58/360

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:23,  1.42it/s][A[A

  3%|▎         | 3/119 [00:00<00:25,  4.47it/s][A[A

  4%|▍         | 5/119 [00:01<00:17,  6.35it/s][A[A

  6%|▌         | 7/119 [00:01<00:14,  7.79it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.03it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.62it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 18.99it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 19.22it/s][A[A

 20%|██        | 24/119 [00:01<00:04, 23.34it/s][A[A

 24%|██▎       | 28/119 [00:01<00:03, 25.90it/s][A[A

 26%|██▌       | 31/119 [00:02<00:03, 26.07it/s][A[A

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:39,  1.19it/s][A[A

  2%|▎         | 3/120 [00:01<00:32,  3.60it/s][A[A

  5%|▌         | 6/120 [00:01<00:15,  7.55it/s][A[A

  8%|▊         | 9/120 [00:01<00:09, 11.34it/s][A[A

 10%|█         | 12/120 [00:01<00:07, 14.54it/s][A[A

 12%|█▎        | 15/120 [00:01<00:05, 17.71it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 19.71it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 21.61it/s][A[A

 20%|██        | 24/120 [00:01<00:04, 20.47it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 20.28it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 19.45it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 22.35it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 22.78it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 23.11it/s][A[A

 37%|███▋      | 44/120 [00:02<00:03, 24.52it/s][A[A

 40%|████      | 48/120 [00:02<00:02, 26.23it/s][A[A

 42%|████▎     | 51/120 [00:03<00:03, 22.54it/s][A[A

 45%|████▌     | 54/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:11,  1.03s/it][A[A

  1%|          | 2/360 [00:01<02:55,  2.04it/s][A[A

  1%|          | 3/360 [00:01<02:04,  2.88it/s][A[A

  1%|▏         | 5/360 [00:01<01:05,  5.43it/s][A[A

  2%|▏         | 7/360 [00:01<00:44,  7.93it/s][A[A

  3%|▎         | 11/360 [00:01<00:25, 13.62it/s][A[A

  4%|▍         | 14/360 [00:01<00:20, 16.79it/s][A[A

  5%|▍         | 17/360 [00:01<00:18, 18.21it/s][A[A

  6%|▌         | 20/360 [00:02<00:16, 20.19it/s][A[A

  7%|▋         | 24/360 [00:02<00:14, 23.19it/s][A[A

  8%|▊         | 27/360 [00:02<00:13, 24.24it/s][A[A

  8%|▊         | 30/360 [00:02<00:13, 24.18it/s][A[A

  9%|▉         | 34/360 [00:02<00:12, 26.87it/s][A[A

 10%|█         | 37/360 [00:02<00:11, 27.19it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 28.31it/s][A[A

 12%|█▏        | 44/360 [00:02<00:11, 28.64it/s][A[A

 13%|█▎        | 47/360 [00:02<00:11, 28.43it/s][A[A

 14%|█▍        | 50/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:22,  1.43it/s][A[A

  2%|▏         | 2/119 [00:00<00:42,  2.75it/s][A[A

  3%|▎         | 3/119 [00:00<00:29,  3.93it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.73it/s][A[A

  4%|▍         | 5/119 [00:01<00:19,  5.74it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.57it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.56it/s][A[A

  8%|▊         | 10/119 [00:01<00:08, 12.12it/s][A[A

 10%|█         | 12/119 [00:01<00:07, 14.14it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 17.55it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.47it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:34,  1.25it/s][A[A

  2%|▎         | 3/120 [00:00<00:30,  3.82it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.07it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.92it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.38it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 16.93it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.49it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.95it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 23.57it/s][A[A

 22%|██▎       | 27/120 [00:01<00:03, 26.62it/s][A[A

 25%|██▌       | 30/120 [00:02<00:03, 24.25it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 24.99it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 25.05it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 25.91it/s][A[A

 35%|███▌      | 42/120 [00:02<00:03, 25.35it/s][A[A

 38%|███▊      | 45/120 [00:02<00:02, 26.11it/s][A[A

 40%|████      | 48/120 [00:02<00:02, 25.90it/s][A[A

 42%|████▎     | 51/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:16,  1.40it/s][A[A

  1%|          | 2/360 [00:00<02:11,  2.72it/s][A[A

  1%|          | 3/360 [00:01<01:36,  3.69it/s][A[A

  1%|▏         | 5/360 [00:01<00:54,  6.54it/s][A[A

  2%|▏         | 7/360 [00:01<00:38,  9.17it/s][A[A

  3%|▎         | 10/360 [00:01<00:26, 13.31it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 17.21it/s][A[A

  5%|▍         | 17/360 [00:01<00:15, 21.86it/s][A[A

  6%|▌         | 20/360 [00:01<00:14, 23.52it/s][A[A

  6%|▋         | 23/360 [00:01<00:13, 24.23it/s][A[A

  7%|▋         | 26/360 [00:01<00:13, 24.23it/s][A[A

  8%|▊         | 29/360 [00:02<00:13, 25.18it/s][A[A

  9%|▉         | 32/360 [00:02<00:13, 24.84it/s][A[A

 10%|▉         | 35/360 [00:02<00:12, 25.61it/s][A[A

 11%|█         | 39/360 [00:02<00:11, 28.05it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 28.46it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 27.94it/s][A[A

 14%|█▎        | 49/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:28,  1.33it/s][A[A

  2%|▏         | 2/119 [00:00<00:45,  2.58it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.16it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.33it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.62it/s][A[A

 10%|█         | 12/119 [00:01<00:06, 15.59it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 18.95it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 17.02it/s][A[A

 18%|█▊        | 21/119 [00:01<00:05, 18.51it/s][A[A

 20%|██        | 24/119 [00:01<00:04, 19.14it/s][A[A

 24%|██▎       | 28/119 [00:02<00:04, 22.53it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:30,  1.32it/s][A[A

  2%|▏         | 2/120 [00:00<00:43,  2.69it/s][A[A

  2%|▎         | 3/120 [00:01<00:32,  3.58it/s][A[A

  5%|▌         | 6/120 [00:01<00:13,  8.49it/s][A[A

  8%|▊         | 9/120 [00:01<00:08, 12.62it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 14.14it/s][A[A

 12%|█▏        | 14/120 [00:01<00:05, 17.71it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 20.13it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.97it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 23.32it/s][A[A

 22%|██▏       | 26/120 [00:01<00:03, 24.86it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 24.06it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 25.57it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 25.58it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 25.42it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 22.44it/s][A[A

 37%|███▋      | 44/120 [00:02<00:03, 21.81it/s][A[A

 39%|███▉      | 47/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:49,  1.24it/s][A[A

  1%|          | 2/360 [00:01<02:58,  2.01it/s][A[A

  1%|          | 4/360 [00:01<01:22,  4.31it/s][A[A

  2%|▏         | 7/360 [00:01<00:43,  8.09it/s][A[A

  3%|▎         | 11/360 [00:01<00:27, 12.89it/s][A[A

  4%|▍         | 14/360 [00:01<00:21, 15.81it/s][A[A

  5%|▍         | 17/360 [00:01<00:19, 17.74it/s][A[A

  6%|▌         | 20/360 [00:01<00:17, 19.71it/s][A[A

  7%|▋         | 24/360 [00:02<00:15, 22.18it/s][A[A

  8%|▊         | 27/360 [00:02<00:14, 23.43it/s][A[A

  9%|▊         | 31/360 [00:02<00:12, 25.38it/s][A[A

  9%|▉         | 34/360 [00:02<00:12, 25.73it/s][A[A

 10%|█         | 37/360 [00:02<00:12, 25.51it/s][A[A

 11%|█         | 40/360 [00:02<00:12, 26.65it/s][A[A

 12%|█▏        | 44/360 [00:02<00:11, 28.24it/s][A[A

 13%|█▎        | 47/360 [00:02<00:11, 28.25it/s][A[A

 14%|█▍        | 51/360 [00:02<00:10, 29.57it/s][A[A

 15%|█▌        | 55/360 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:11,  1.65it/s][A[A

  2%|▏         | 2/119 [00:00<00:47,  2.45it/s][A[A

  3%|▎         | 4/119 [00:01<00:21,  5.30it/s][A[A

  5%|▌         | 6/119 [00:01<00:13,  8.07it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.33it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.49it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.94it/s][A[A

 14%|█▍        | 17/119 [00:01<00:06, 16.97it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 17.22it/s][A[A

 18%|█▊        | 21/119 [00:01<00:05, 16.67it/s][A[A

 20%|██        | 24/119 [00:02<00:04, 19.19it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:42,  1.16it/s][A[A

  2%|▏         | 2/120 [00:00<00:48,  2.41it/s][A[A

  3%|▎         | 4/120 [00:01<00:22,  5.24it/s][A[A

  7%|▋         | 8/120 [00:01<00:10, 10.92it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 14.59it/s][A[A

 12%|█▎        | 15/120 [00:01<00:05, 19.49it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 20.34it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 21.11it/s][A[A

 21%|██        | 25/120 [00:01<00:03, 24.41it/s][A[A

 23%|██▎       | 28/120 [00:01<00:03, 24.96it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 24.64it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 24.78it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 25.27it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 25.23it/s][A[A

 36%|███▌      | 43/120 [00:02<00:02, 25.90it/s][A[A

 38%|███▊      | 46/120 [00:02<00:02, 26.29it/s][A[A

 41%|████      | 49/120 [00:02<00:02, 25.29it/s][A[A

 43%|████▎     | 52/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<03:53,  1.54it/s][A[A

  1%|          | 2/360 [00:00<02:24,  2.48it/s][A[A

  1%|          | 3/360 [00:01<01:47,  3.33it/s][A[A

  2%|▏         | 6/360 [00:01<00:45,  7.71it/s][A[A

  2%|▎         | 9/360 [00:01<00:30, 11.34it/s][A[A

  4%|▎         | 13/360 [00:01<00:21, 16.22it/s][A[A

  4%|▍         | 16/360 [00:01<00:19, 18.04it/s][A[A

  6%|▌         | 20/360 [00:01<00:15, 21.55it/s][A[A

  6%|▋         | 23/360 [00:01<00:14, 23.37it/s][A[A

  7%|▋         | 26/360 [00:01<00:14, 23.33it/s][A[A

  8%|▊         | 29/360 [00:02<00:13, 24.32it/s][A[A

  9%|▉         | 32/360 [00:02<00:13, 24.57it/s][A[A

 10%|▉         | 35/360 [00:02<00:13, 24.64it/s][A[A

 11%|█         | 39/360 [00:02<00:11, 27.15it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 27.41it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 27.71it/s][A[A

 14%|█▎        | 49/360 [00:02<00:10, 29.56it/s][A[A

 14%|█▍        | 52/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:08,  1.72it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.34it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.96it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.12it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.36it/s][A[A

 10%|█         | 12/119 [00:01<00:06, 15.30it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 19.19it/s][A[A

 16%|█▌        | 19/119 [00:01<00:04, 20.15it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 22.19it/s][A[A

 21%|██        | 25/119 [00:01<00:03, 24.06it/s][A[A

 24%|██▎       | 28/119 [00:01<00:03, 24.54it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:17,  1.54it/s][A[A

  2%|▎         | 3/120 [00:00<00:34,  3.40it/s][A[A

  5%|▌         | 6/120 [00:01<00:15,  7.24it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.77it/s][A[A

 11%|█         | 13/120 [00:01<00:06, 16.02it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 17.18it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 19.30it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 22.66it/s][A[A

 22%|██▏       | 26/120 [00:01<00:04, 23.14it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 22.45it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 24.24it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 23.76it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 22.89it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 20.38it/s][A[A

 37%|███▋      | 44/120 [00:02<00:03, 19.78it/s][A[A

 39%|███▉      | 47/120 [00:02<00:03, 19.70it/s][A[A

 42%|████▏     | 50/120 [00:03<00:03, 19.25it/s][A[A

 43%|████▎     | 52/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:40,  1.28it/s][A[A

  1%|          | 2/360 [00:00<02:16,  2.62it/s][A[A

  1%|          | 3/360 [00:01<01:36,  3.68it/s][A[A

  1%|▏         | 5/360 [00:01<00:52,  6.78it/s][A[A

  2%|▏         | 7/360 [00:01<00:39,  8.96it/s][A[A

  3%|▎         | 10/360 [00:01<00:26, 13.13it/s][A[A

  4%|▍         | 14/360 [00:01<00:19, 18.13it/s][A[A

  5%|▍         | 17/360 [00:01<00:17, 20.13it/s][A[A

  6%|▌         | 20/360 [00:01<00:15, 21.32it/s][A[A

  7%|▋         | 24/360 [00:01<00:14, 23.82it/s][A[A

  8%|▊         | 27/360 [00:02<00:13, 24.09it/s][A[A

  9%|▊         | 31/360 [00:02<00:12, 25.69it/s][A[A

 10%|▉         | 35/360 [00:02<00:12, 27.07it/s][A[A

 11%|█         | 38/360 [00:02<00:11, 27.29it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 27.37it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 28.27it/s][A[A

 14%|█▎        | 49/360 [00:02<00:10, 29.20it/s][A[A

 14%|█▍        | 52/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:31,  1.29it/s][A[A

  2%|▏         | 2/119 [00:00<00:49,  2.37it/s][A[A

  3%|▎         | 4/119 [00:01<00:21,  5.27it/s][A[A

  6%|▌         | 7/119 [00:01<00:12,  9.31it/s][A[A

  8%|▊         | 9/119 [00:01<00:10, 10.43it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 11.53it/s][A[A

 13%|█▎        | 15/119 [00:01<00:06, 16.68it/s][A[A

 14%|█▍        | 17/119 [00:01<00:06, 15.69it/s][A[A

 16%|█▌        | 19/119 [00:01<00:06, 14.42it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 15.22it/s][A[A

 19%|█▉        | 23/119 [00:02<00:05, 16.17it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:27,  1.36it/s][A[A

  2%|▏         | 2/120 [00:00<00:44,  2.67it/s][A[A

  2%|▎         | 3/120 [00:00<00:30,  3.89it/s][A[A

  4%|▍         | 5/120 [00:01<00:17,  6.62it/s][A[A

  6%|▌         | 7/120 [00:01<00:12,  9.39it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 15.13it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 17.29it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.56it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.14it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 22.74it/s][A[A

 22%|██▏       | 26/120 [00:01<00:04, 22.96it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 23.28it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 23.60it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 23.81it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 23.59it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 24.10it/s][A[A

 38%|███▊      | 45/120 [00:02<00:02, 26.66it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:12,  1.42it/s][A[A

  1%|          | 2/360 [00:00<02:17,  2.60it/s][A[A

  1%|          | 3/360 [00:00<01:35,  3.72it/s][A[A

  1%|▏         | 5/360 [00:01<00:55,  6.44it/s][A[A

  2%|▏         | 8/360 [00:01<00:32, 10.90it/s][A[A

  3%|▎         | 11/360 [00:01<00:23, 14.88it/s][A[A

  4%|▍         | 14/360 [00:01<00:19, 17.48it/s][A[A

  5%|▍         | 17/360 [00:01<00:17, 20.03it/s][A[A

  6%|▌         | 20/360 [00:01<00:15, 22.13it/s][A[A

  6%|▋         | 23/360 [00:01<00:14, 23.39it/s][A[A

  7%|▋         | 26/360 [00:01<00:13, 24.30it/s][A[A

  8%|▊         | 29/360 [00:02<00:13, 24.50it/s][A[A

  9%|▉         | 32/360 [00:02<00:13, 25.13it/s][A[A

 10%|▉         | 35/360 [00:02<00:12, 26.27it/s][A[A

 11%|█         | 38/360 [00:02<00:12, 25.73it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 26.88it/s][A[A

 12%|█▎        | 45/360 [00:02<00:10, 28.64it/s][A[A

 13%|█▎        | 48/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:10,  1.68it/s][A[A

  2%|▏         | 2/119 [00:00<00:40,  2.86it/s][A[A

  3%|▎         | 4/119 [00:00<00:21,  5.36it/s][A[A

  4%|▍         | 5/119 [00:01<00:18,  6.17it/s][A[A

  6%|▌         | 7/119 [00:01<00:13,  8.29it/s][A[A

  8%|▊         | 10/119 [00:01<00:08, 12.29it/s][A[A

 12%|█▏        | 14/119 [00:01<00:05, 17.72it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 19.25it/s][A[A

 17%|█▋        | 20/119 [00:01<00:04, 20.90it/s][A[A

 19%|█▉        | 23/119 [00:01<00:04, 22.13it/s][A[A

 22%|██▏       | 26/119 [00:01<00:03, 24.09it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:39,  1.19it/s][A[A

  2%|▎         | 3/120 [00:00<00:31,  3.72it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.33it/s][A[A

  6%|▌         | 7/120 [00:01<00:12,  8.89it/s][A[A

  8%|▊         | 10/120 [00:01<00:08, 12.81it/s][A[A

 11%|█         | 13/120 [00:01<00:06, 16.32it/s][A[A

 13%|█▎        | 16/120 [00:01<00:05, 18.43it/s][A[A

 16%|█▌        | 19/120 [00:01<00:04, 20.95it/s][A[A

 18%|█▊        | 22/120 [00:01<00:04, 22.33it/s][A[A

 21%|██        | 25/120 [00:01<00:04, 23.22it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 21.29it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 22.34it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 22.10it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 19.75it/s][A[A

 33%|███▎      | 40/120 [00:02<00:04, 19.51it/s][A[A

 36%|███▌      | 43/120 [00:02<00:04, 18.91it/s][A[A

 38%|███▊      | 45/120 [00:02<00:04, 18.17it/s][A[A

 39%|███▉      | 47/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:23,  1.36it/s][A[A

  1%|          | 2/360 [00:00<02:18,  2.58it/s][A[A

  1%|          | 3/360 [00:01<01:39,  3.61it/s][A[A

  1%|▏         | 5/360 [00:01<00:52,  6.70it/s][A[A

  2%|▏         | 7/360 [00:01<00:37,  9.45it/s][A[A

  3%|▎         | 11/360 [00:01<00:21, 16.06it/s][A[A

  4%|▍         | 14/360 [00:01<00:18, 18.46it/s][A[A

  5%|▍         | 17/360 [00:01<00:18, 18.96it/s][A[A

  6%|▌         | 20/360 [00:01<00:16, 20.91it/s][A[A

  6%|▋         | 23/360 [00:01<00:14, 23.02it/s][A[A

  7%|▋         | 26/360 [00:01<00:14, 23.69it/s][A[A

  8%|▊         | 30/360 [00:02<00:13, 25.04it/s][A[A

  9%|▉         | 33/360 [00:02<00:12, 26.02it/s][A[A

 10%|█         | 36/360 [00:02<00:12, 25.33it/s][A[A

 11%|█         | 39/360 [00:02<00:12, 26.53it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 26.83it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 27.35it/s][A[A

 14%|█▎        | 49/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<01:57,  1.00it/s][A[A

  2%|▏         | 2/119 [00:01<01:19,  1.47it/s][A[A

  3%|▎         | 4/119 [00:01<00:35,  3.22it/s][A[A

  5%|▌         | 6/119 [00:01<00:23,  4.87it/s][A[A

  7%|▋         | 8/119 [00:01<00:17,  6.50it/s][A[A

  8%|▊         | 10/119 [00:02<00:13,  8.18it/s][A[A

 11%|█         | 13/119 [00:02<00:09, 11.45it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 13.98it/s][A[A

 15%|█▌        | 18/119 [00:02<00:07, 13.88it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.89it/s][A[A

 18%|█▊        | 22/119 [00:02<00:06, 14.92it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:32,  1.29it/s][A[A

  2%|▏         | 2/120 [00:00<00:45,  2.61it/s][A[A

  2%|▎         | 3/120 [00:00<00:29,  3.93it/s][A[A

  4%|▍         | 5/120 [00:01<00:16,  7.04it/s][A[A

  6%|▌         | 7/120 [00:01<00:11,  9.82it/s][A[A

  8%|▊         | 9/120 [00:01<00:09, 12.05it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 13.98it/s][A[A

 12%|█▎        | 15/120 [00:01<00:05, 19.33it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 20.09it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 21.61it/s][A[A

 20%|██        | 24/120 [00:01<00:04, 23.14it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 23.10it/s][A[A

 25%|██▌       | 30/120 [00:02<00:03, 23.74it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 24.83it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 25.40it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 26.15it/s][A[A

 35%|███▌      | 42/120 [00:02<00:02, 26.32it/s][A[A

 38%|███▊      | 45/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:34,  1.31it/s][A[A

  1%|          | 2/360 [00:00<02:18,  2.58it/s][A[A

  1%|          | 3/360 [00:01<01:34,  3.76it/s][A[A

  1%|▏         | 5/360 [00:01<00:52,  6.71it/s][A[A

  2%|▏         | 7/360 [00:01<00:37,  9.52it/s][A[A

  3%|▎         | 10/360 [00:01<00:24, 14.10it/s][A[A

  4%|▎         | 13/360 [00:01<00:19, 17.93it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.38it/s][A[A

  5%|▌         | 19/360 [00:01<00:17, 19.25it/s][A[A

  6%|▌         | 22/360 [00:01<00:17, 19.05it/s][A[A

  7%|▋         | 25/360 [00:02<00:17, 18.92it/s][A[A

  8%|▊         | 27/360 [00:02<00:18, 18.40it/s][A[A

  8%|▊         | 30/360 [00:02<00:18, 18.07it/s][A[A

  9%|▉         | 32/360 [00:02<00:18, 17.78it/s][A[A

  9%|▉         | 34/360 [00:02<00:17, 18.14it/s][A[A

 10%|█         | 36/360 [00:02<00:18, 17.93it/s][A[A

 11%|█         | 39/360 [00:02<00:16, 19.31it/s][A[A

 12%|█▏        | 42/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:06,  1.78it/s][A[A

  2%|▏         | 2/119 [00:00<00:48,  2.40it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.11it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.67it/s][A[A

  7%|▋         | 8/119 [00:01<00:10, 10.12it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.33it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.82it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 19.16it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 19.59it/s][A[A

 20%|██        | 24/119 [00:01<00:04, 23.02it/s][A[A

 23%|██▎       | 27/119 [00:02<00:03, 23.73it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:37,  1.32s/it][A[A

  2%|▏         | 2/120 [00:01<01:14,  1.59it/s][A[A

  2%|▎         | 3/120 [00:01<00:46,  2.49it/s][A[A

  4%|▍         | 5/120 [00:01<00:24,  4.68it/s][A[A

  6%|▌         | 7/120 [00:01<00:16,  6.70it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.25it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.39it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.74it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.45it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.72it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 14.12it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 16.93it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 18.86it/s][A[A

 23%|██▎       | 28/120 [00:03<00:04, 20.64it/s][A[A

 26%|██▌       | 31/120 [00:03<00:04, 21.63it/s][A[A

 28%|██▊       | 34/120 [00:03<00:03, 23.56it/s][A[A

 31%|███       | 37/120 [00:03<00:03, 23.49it/s][A[A

 33%|███▎      | 40/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:30,  1.33it/s][A[A

  1%|          | 2/360 [00:00<02:17,  2.61it/s][A[A

  1%|          | 3/360 [00:00<01:30,  3.93it/s][A[A

  1%|▏         | 5/360 [00:01<00:51,  6.93it/s][A[A

  2%|▏         | 7/360 [00:01<00:38,  9.16it/s][A[A

  3%|▎         | 10/360 [00:01<00:27, 12.72it/s][A[A

  4%|▎         | 13/360 [00:01<00:21, 16.23it/s][A[A

  4%|▍         | 16/360 [00:01<00:17, 19.37it/s][A[A

  5%|▌         | 19/360 [00:01<00:15, 21.81it/s][A[A

  6%|▌         | 22/360 [00:01<00:15, 22.04it/s][A[A

  7%|▋         | 25/360 [00:01<00:14, 22.98it/s][A[A

  8%|▊         | 28/360 [00:02<00:13, 23.79it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 23.39it/s][A[A

  9%|▉         | 34/360 [00:02<00:13, 24.84it/s][A[A

 10%|█         | 37/360 [00:02<00:12, 24.92it/s][A[A

 11%|█▏        | 41/360 [00:02<00:11, 27.09it/s][A[A

 12%|█▎        | 45/360 [00:02<00:11, 28.10it/s][A[A

 14%|█▎        | 49/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:27,  1.35it/s][A[A

  2%|▏         | 2/119 [00:00<00:49,  2.38it/s][A[A

  3%|▎         | 4/119 [00:01<00:21,  5.30it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.60it/s][A[A

  7%|▋         | 8/119 [00:01<00:11, 10.03it/s][A[A

  9%|▉         | 11/119 [00:01<00:07, 13.74it/s][A[A

 13%|█▎        | 15/119 [00:01<00:05, 18.30it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 19.61it/s][A[A

 18%|█▊        | 21/119 [00:01<00:04, 21.01it/s][A[A

 20%|██        | 24/119 [00:01<00:04, 22.77it/s][A[A

 23%|██▎       | 27/119 [00:02<00:03, 24.38it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:34,  1.25it/s][A[A

  2%|▏         | 2/120 [00:00<00:51,  2.31it/s][A[A

  3%|▎         | 4/120 [00:01<00:23,  4.99it/s][A[A

  5%|▌         | 6/120 [00:01<00:15,  7.38it/s][A[A

  8%|▊         | 9/120 [00:01<00:09, 11.68it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.38it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 17.20it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 19.25it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 20.95it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 22.45it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 23.44it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 24.39it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 24.37it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 24.44it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 24.56it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 24.36it/s][A[A

 37%|███▋      | 44/120 [00:02<00:02, 25.64it/s][A[A

 39%|███▉      | 47/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:59,  1.17s/it][A[A

  1%|          | 2/360 [00:01<03:22,  1.76it/s][A[A

  1%|          | 3/360 [00:01<02:11,  2.72it/s][A[A

  1%|▏         | 5/360 [00:01<01:07,  5.30it/s][A[A

  2%|▏         | 7/360 [00:01<00:46,  7.60it/s][A[A

  3%|▎         | 11/360 [00:01<00:26, 13.15it/s][A[A

  4%|▍         | 14/360 [00:01<00:22, 15.41it/s][A[A

  5%|▍         | 17/360 [00:02<00:18, 18.24it/s][A[A

  6%|▌         | 20/360 [00:02<00:17, 19.48it/s][A[A

  6%|▋         | 23/360 [00:02<00:15, 21.66it/s][A[A

  7%|▋         | 26/360 [00:02<00:15, 22.06it/s][A[A

  8%|▊         | 29/360 [00:02<00:14, 22.53it/s][A[A

  9%|▉         | 32/360 [00:02<00:14, 23.27it/s][A[A

 10%|▉         | 35/360 [00:02<00:13, 23.59it/s][A[A

 11%|█         | 39/360 [00:02<00:12, 26.30it/s][A[A

 12%|█▏        | 43/360 [00:03<00:11, 27.74it/s][A[A

 13%|█▎        | 46/360 [00:03<00:11, 27.23it/s][A[A

 14%|█▍        | 50/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:13,  1.61it/s][A[A

  2%|▏         | 2/119 [00:00<00:54,  2.14it/s][A[A

  3%|▎         | 3/119 [00:01<00:36,  3.15it/s][A[A

  3%|▎         | 4/119 [00:01<00:28,  4.11it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.21it/s][A[A

  6%|▌         | 7/119 [00:01<00:16,  6.84it/s][A[A

  8%|▊         | 9/119 [00:01<00:11,  9.30it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 11.00it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.96it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 14.34it/s][A[A

 15%|█▌        | 18/119 [00:02<00:07, 13.76it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:32,  1.29it/s][A[A

  2%|▏         | 2/120 [00:00<00:47,  2.51it/s][A[A

  3%|▎         | 4/120 [00:01<00:23,  5.00it/s][A[A

  5%|▌         | 6/120 [00:01<00:15,  7.31it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.40it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 13.65it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 17.14it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 18.91it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 20.91it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 22.86it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 23.46it/s][A[A

 24%|██▍       | 29/120 [00:02<00:03, 24.37it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 26.58it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 23.42it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 24.53it/s][A[A

 35%|███▌      | 42/120 [00:02<00:03, 24.91it/s][A[A

 38%|███▊      | 45/120 [00:02<00:03, 24.58it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:15,  1.40it/s][A[A

  1%|          | 2/360 [00:00<02:14,  2.65it/s][A[A

  1%|          | 3/360 [00:01<01:43,  3.45it/s][A[A

  1%|▏         | 5/360 [00:01<00:55,  6.36it/s][A[A

  2%|▏         | 7/360 [00:01<00:39,  8.94it/s][A[A

  3%|▎         | 10/360 [00:01<00:25, 13.60it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 16.99it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.31it/s][A[A

  5%|▌         | 19/360 [00:01<00:16, 20.50it/s][A[A

  6%|▋         | 23/360 [00:01<00:14, 24.06it/s][A[A

  8%|▊         | 27/360 [00:01<00:12, 26.78it/s][A[A

  8%|▊         | 30/360 [00:02<00:13, 24.99it/s][A[A

  9%|▉         | 33/360 [00:02<00:12, 25.71it/s][A[A

 10%|█         | 36/360 [00:02<00:12, 26.55it/s][A[A

 11%|█         | 39/360 [00:02<00:12, 25.96it/s][A[A

 12%|█▏        | 42/360 [00:02<00:11, 26.58it/s][A[A

 12%|█▎        | 45/360 [00:02<00:12, 25.79it/s][A[A

 14%|█▎        | 49/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:09,  1.70it/s][A[A

  2%|▏         | 2/119 [00:00<00:46,  2.51it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.91it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.32it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.66it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.51it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 15.74it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 18.86it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 19.45it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 21.68it/s][A[A

 21%|██        | 25/119 [00:01<00:04, 23.35it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:55,  1.03it/s][A[A

  2%|▏         | 2/120 [00:01<00:54,  2.15it/s][A[A

  3%|▎         | 4/120 [00:01<00:25,  4.54it/s][A[A

  5%|▌         | 6/120 [00:01<00:16,  6.73it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.55it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.07it/s][A[A

 10%|█         | 12/120 [00:01<00:09, 11.77it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 13.49it/s][A[A

 13%|█▎        | 16/120 [00:01<00:07, 14.06it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 14.55it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 15.66it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 17.92it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.87it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.04it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 16.09it/s][A[A

 26%|██▌       | 31/120 [00:02<00:05, 16.24it/s][A[A

 28%|██▊       | 34/120 [00:03<00:04, 17.87it/s][A[A

 30%|███       | 36/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:32,  1.32it/s][A[A

  1%|          | 2/360 [00:00<02:20,  2.54it/s][A[A

  1%|          | 3/360 [00:01<01:34,  3.79it/s][A[A

  1%|▏         | 5/360 [00:01<00:53,  6.64it/s][A[A

  2%|▏         | 8/360 [00:01<00:31, 11.06it/s][A[A

  3%|▎         | 11/360 [00:01<00:23, 14.70it/s][A[A

  4%|▍         | 14/360 [00:01<00:19, 18.00it/s][A[A

  5%|▍         | 17/360 [00:01<00:17, 19.58it/s][A[A

  6%|▌         | 20/360 [00:01<00:16, 21.06it/s][A[A

  7%|▋         | 24/360 [00:01<00:14, 23.03it/s][A[A

  8%|▊         | 27/360 [00:01<00:13, 24.65it/s][A[A

  8%|▊         | 30/360 [00:02<00:14, 23.49it/s][A[A

  9%|▉         | 33/360 [00:02<00:13, 24.65it/s][A[A

 10%|█         | 37/360 [00:02<00:12, 26.84it/s][A[A

 11%|█         | 40/360 [00:02<00:11, 26.73it/s][A[A

 12%|█▏        | 44/360 [00:02<00:11, 27.74it/s][A[A

 13%|█▎        | 47/360 [00:02<00:11, 28.18it/s][A[A

 14%|█▍        | 50/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:08<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:03,  1.05s/it][A[A

  2%|▏         | 2/119 [00:01<01:02,  1.86it/s][A[A

  3%|▎         | 3/119 [00:01<00:39,  2.92it/s][A[A

  3%|▎         | 4/119 [00:01<00:30,  3.72it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  5.97it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.17it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.82it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 13.05it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 15.66it/s][A[A

 16%|█▌        | 19/119 [00:02<00:05, 17.54it/s][A[A

 18%|█▊        | 22/119 [00:02<00:04, 19.98it/s][A[A

 21%|██        | 25/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:38,  1.33s/it][A[A

  2%|▏         | 2/120 [00:01<01:13,  1.60it/s][A[A

  2%|▎         | 3/120 [00:01<00:47,  2.44it/s][A[A

  4%|▍         | 5/120 [00:01<00:24,  4.69it/s][A[A

  6%|▌         | 7/120 [00:01<00:16,  6.69it/s][A[A

  8%|▊         | 9/120 [00:01<00:12,  8.95it/s][A[A

 10%|█         | 12/120 [00:02<00:08, 12.47it/s][A[A

 12%|█▎        | 15/120 [00:02<00:06, 16.02it/s][A[A

 15%|█▌        | 18/120 [00:02<00:05, 18.10it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 19.03it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 22.13it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.33it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 20.20it/s][A[A

 28%|██▊       | 34/120 [00:03<00:04, 20.08it/s][A[A

 31%|███       | 37/120 [00:03<00:04, 20.08it/s][A[A

 33%|███▎      | 40/120 [00:03<00:03, 21.35it/s][A[A

 36%|███▌      | 43/120 [00:03<00:03, 21.08it/s][A[A

 38%|███▊      | 46/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:55,  1.01it/s][A[A

  1%|          | 2/360 [00:01<03:04,  1.94it/s][A[A

  1%|          | 3/360 [00:01<02:02,  2.92it/s][A[A

  1%|          | 4/360 [00:01<01:34,  3.77it/s][A[A

  2%|▏         | 6/360 [00:01<00:59,  6.00it/s][A[A

  2%|▏         | 7/360 [00:01<00:52,  6.66it/s][A[A

  3%|▎         | 10/360 [00:01<00:31, 11.09it/s][A[A

  4%|▎         | 13/360 [00:01<00:24, 13.91it/s][A[A

  4%|▍         | 16/360 [00:02<00:20, 16.39it/s][A[A

  5%|▌         | 19/360 [00:02<00:19, 17.70it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.95it/s][A[A

  6%|▋         | 23/360 [00:02<00:19, 17.45it/s][A[A

  7%|▋         | 25/360 [00:02<00:19, 17.48it/s][A[A

  8%|▊         | 29/360 [00:02<00:15, 21.13it/s][A[A

  9%|▉         | 32/360 [00:02<00:15, 20.66it/s][A[A

 10%|▉         | 35/360 [00:03<00:16, 20.28it/s][A[A

 11%|█         | 38/360 [00:03<00:18, 17.31it/s][A[A

 11%|█         | 40/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:11<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:04,  1.06s/it][A[A

  2%|▏         | 2/119 [00:01<01:29,  1.31it/s][A[A

  3%|▎         | 3/119 [00:01<00:58,  1.98it/s][A[A

  3%|▎         | 4/119 [00:01<00:41,  2.80it/s][A[A

  5%|▌         | 6/119 [00:02<00:25,  4.46it/s][A[A

  7%|▋         | 8/119 [00:02<00:18,  5.85it/s][A[A

  8%|▊         | 10/119 [00:02<00:14,  7.43it/s][A[A

 10%|█         | 12/119 [00:02<00:11,  9.29it/s][A[A

 12%|█▏        | 14/119 [00:02<00:09, 11.25it/s][A[A

 13%|█▎        | 16/119 [00:02<00:08, 12.24it/s][A[A

 15%|█▌        | 18/119 [00:03<00:08, 11.29it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:56,  1.02it/s][A[A

  2%|▏         | 2/120 [00:01<00:59,  1.99it/s][A[A

  2%|▎         | 3/120 [00:01<00:42,  2.75it/s][A[A

  4%|▍         | 5/120 [00:01<00:22,  5.01it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.25it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.25it/s][A[A

 10%|█         | 12/120 [00:01<00:09, 11.13it/s][A[A

 12%|█▏        | 14/120 [00:02<00:08, 12.55it/s][A[A

 14%|█▍        | 17/120 [00:02<00:06, 15.35it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 16.11it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.17it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 19.47it/s][A[A

 23%|██▎       | 28/120 [00:02<00:05, 18.10it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 18.30it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 17.78it/s][A[A

 28%|██▊       | 34/120 [00:03<00:05, 14.59it/s][A[A

 30%|███       | 36/120 [00:03<00:06, 13.87it/s][A[A

 32%|███▏      | 38/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:03,  1.01s/it][A[A

  1%|          | 2/360 [00:01<03:02,  1.97it/s][A[A

  1%|          | 3/360 [00:01<01:58,  3.02it/s][A[A

  1%|▏         | 5/360 [00:01<01:03,  5.62it/s][A[A

  2%|▏         | 7/360 [00:01<00:44,  7.88it/s][A[A

  2%|▎         | 9/360 [00:01<00:35,  9.96it/s][A[A

  3%|▎         | 11/360 [00:01<00:29, 11.68it/s][A[A

  4%|▍         | 14/360 [00:01<00:22, 15.31it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 16.34it/s][A[A

  5%|▌         | 19/360 [00:02<00:18, 18.57it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.63it/s][A[A

  7%|▋         | 25/360 [00:02<00:16, 20.57it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 20.77it/s][A[A

  9%|▊         | 31/360 [00:02<00:16, 20.11it/s][A[A

  9%|▉         | 34/360 [00:02<00:15, 20.60it/s][A[A

 11%|█         | 38/360 [00:02<00:13, 24.01it/s][A[A

 11%|█▏        | 41/360 [00:03<00:13, 23.60it/s][A[A

 12%|█▏        | 44/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:09<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:06,  1.07s/it][A[A

  2%|▏         | 2/119 [00:01<01:02,  1.88it/s][A[A

  3%|▎         | 4/119 [00:01<00:28,  4.01it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.10it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  7.38it/s][A[A

  9%|▉         | 11/119 [00:01<00:10, 10.62it/s][A[A

 11%|█         | 13/119 [00:01<00:08, 11.91it/s][A[A

 13%|█▎        | 15/119 [00:02<00:08, 12.81it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.36it/s][A[A

 16%|█▌        | 19/119 [00:02<00:07, 13.44it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 14.05it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:31,  1.27s/it][A[A

  2%|▏         | 2/120 [00:01<01:11,  1.66it/s][A[A

  3%|▎         | 4/120 [00:01<00:31,  3.63it/s][A[A

  5%|▌         | 6/120 [00:01<00:21,  5.21it/s][A[A

  7%|▋         | 8/120 [00:01<00:16,  6.84it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.31it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 10.83it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 13.24it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 14.54it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 16.73it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 16.00it/s][A[A

 21%|██        | 25/120 [00:02<00:06, 15.71it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.22it/s][A[A

 25%|██▌       | 30/120 [00:03<00:04, 18.24it/s][A[A

 28%|██▊       | 33/120 [00:03<00:04, 19.92it/s][A[A

 30%|███       | 36/120 [00:03<00:04, 20.47it/s][A[A

 32%|███▎      | 39/120 [00:03<00:03, 21.67it/s][A[A

 35%|███▌      | 42/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:05,  1.18s/it][A[A

  1%|          | 2/360 [00:01<03:28,  1.71it/s][A[A

  1%|          | 3/360 [00:01<02:30,  2.38it/s][A[A

  1%|          | 4/360 [00:01<01:45,  3.39it/s][A[A

  2%|▏         | 6/360 [00:01<01:00,  5.81it/s][A[A

  2%|▏         | 7/360 [00:01<00:54,  6.53it/s][A[A

  2%|▎         | 9/360 [00:02<00:38,  9.09it/s][A[A

  4%|▎         | 13/360 [00:02<00:23, 14.58it/s][A[A

  4%|▍         | 15/360 [00:02<00:22, 15.03it/s][A[A

  5%|▌         | 19/360 [00:02<00:18, 18.87it/s][A[A

  6%|▌         | 22/360 [00:02<00:15, 21.43it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 21.20it/s][A[A

  8%|▊         | 28/360 [00:02<00:14, 22.35it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 22.83it/s][A[A

  9%|▉         | 34/360 [00:03<00:13, 23.51it/s][A[A

 10%|█         | 37/360 [00:03<00:13, 24.72it/s][A[A

 11%|█         | 40/360 [00:03<00:12, 24.81it/s][A[A

 12%|█▏        | 43/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:49,  1.08it/s][A[A

  2%|▏         | 2/119 [00:01<00:52,  2.23it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.08it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.13it/s][A[A

  4%|▍         | 5/119 [00:01<00:23,  4.83it/s][A[A

  5%|▌         | 6/119 [00:01<00:21,  5.18it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  7.96it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.35it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.49it/s][A[A

 13%|█▎        | 16/119 [00:02<00:05, 17.58it/s][A[A

 15%|█▌        | 18/119 [00:02<00:05, 17.80it/s][A[A

 17%|█▋        | 20/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:32,  1.28it/s][A[A

  2%|▏         | 2/120 [00:00<00:49,  2.37it/s][A[A

  2%|▎         | 3/120 [00:01<00:33,  3.50it/s][A[A

  4%|▍         | 5/120 [00:01<00:17,  6.54it/s][A[A

  7%|▋         | 8/120 [00:01<00:10, 10.39it/s][A[A

  9%|▉         | 11/120 [00:01<00:07, 14.03it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 16.84it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 18.78it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 21.07it/s][A[A

 19%|█▉        | 23/120 [00:01<00:04, 20.85it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 21.24it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 20.25it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 20.62it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 19.31it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 19.11it/s][A[A

 33%|███▎      | 40/120 [00:02<00:04, 19.30it/s][A[A

 35%|███▌      | 42/120 [00:02<00:04, 19.19it/s][A[A

 37%|███▋      | 44/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:40,  1.28it/s][A[A

  1%|          | 2/360 [00:00<02:25,  2.46it/s][A[A

  1%|          | 3/360 [00:01<01:37,  3.66it/s][A[A

  1%|▏         | 5/360 [00:01<00:56,  6.29it/s][A[A

  2%|▏         | 8/360 [00:01<00:33, 10.64it/s][A[A

  3%|▎         | 10/360 [00:01<00:28, 12.32it/s][A[A

  4%|▎         | 13/360 [00:01<00:21, 15.92it/s][A[A

  4%|▍         | 16/360 [00:01<00:17, 19.29it/s][A[A

  5%|▌         | 19/360 [00:01<00:16, 20.25it/s][A[A

  6%|▌         | 22/360 [00:01<00:15, 21.70it/s][A[A

  7%|▋         | 25/360 [00:02<00:14, 22.89it/s][A[A

  8%|▊         | 28/360 [00:02<00:13, 24.16it/s][A[A

  9%|▊         | 31/360 [00:02<00:13, 23.91it/s][A[A

  9%|▉         | 34/360 [00:02<00:13, 25.02it/s][A[A

 10%|█         | 37/360 [00:02<00:12, 25.00it/s][A[A

 11%|█         | 40/360 [00:02<00:12, 25.27it/s][A[A

 12%|█▏        | 43/360 [00:02<00:12, 25.50it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:10<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:18,  1.17s/it][A[A

  2%|▏         | 2/119 [00:01<01:09,  1.68it/s][A[A

  3%|▎         | 3/119 [00:01<00:45,  2.56it/s][A[A

  3%|▎         | 4/119 [00:01<00:35,  3.25it/s][A[A

  4%|▍         | 5/119 [00:01<00:27,  4.17it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.10it/s][A[A

  6%|▌         | 7/119 [00:02<00:19,  5.82it/s][A[A

  8%|▊         | 9/119 [00:02<00:13,  8.24it/s][A[A

 10%|█         | 12/119 [00:02<00:08, 12.03it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 14.83it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.04it/s][A[A

 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:14,  1.63s/it][A[A

  2%|▏         | 2/120 [00:01<01:33,  1.26it/s][A[A

  2%|▎         | 3/120 [00:02<00:59,  1.95it/s][A[A

  3%|▎         | 4/120 [00:02<00:41,  2.81it/s][A[A

  4%|▍         | 5/120 [00:02<00:31,  3.69it/s][A[A

  6%|▌         | 7/120 [00:02<00:18,  5.96it/s][A[A

  7%|▋         | 8/120 [00:02<00:17,  6.51it/s][A[A

  8%|▊         | 10/120 [00:02<00:12,  8.90it/s][A[A

 10%|█         | 12/120 [00:02<00:11,  9.17it/s][A[A

 12%|█▏        | 14/120 [00:02<00:09, 10.78it/s][A[A

 13%|█▎        | 16/120 [00:03<00:09, 11.18it/s][A[A

 15%|█▌        | 18/120 [00:03<00:08, 12.49it/s][A[A

 17%|█▋        | 20/120 [00:03<00:08, 11.68it/s][A[A

 18%|█▊        | 22/120 [00:03<00:08, 10.97it/s][A[A

 20%|██        | 24/120 [00:03<00:08, 11.99it/s][A[A

 22%|██▏       | 26/120 [00:03<00:07, 12.01it/s][A[A

 23%|██▎       | 28/120 [00:04<00:07, 12.32it/s][A[A

 25%|██▌       | 30/120 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<05:47,  1.03it/s][A[A

  1%|          | 2/360 [00:01<02:51,  2.09it/s][A[A

  1%|          | 4/360 [00:01<01:20,  4.42it/s][A[A

  2%|▏         | 6/360 [00:01<00:54,  6.55it/s][A[A

  2%|▏         | 8/360 [00:01<00:39,  8.81it/s][A[A

  3%|▎         | 11/360 [00:01<00:27, 12.61it/s][A[A

  4%|▍         | 14/360 [00:01<00:22, 15.60it/s][A[A

  5%|▍         | 17/360 [00:01<00:18, 18.11it/s][A[A

  6%|▌         | 20/360 [00:01<00:16, 20.11it/s][A[A

  6%|▋         | 23/360 [00:02<00:15, 21.39it/s][A[A

  7%|▋         | 26/360 [00:02<00:15, 21.11it/s][A[A

  8%|▊         | 29/360 [00:02<00:14, 23.11it/s][A[A

  9%|▉         | 32/360 [00:02<00:16, 19.65it/s][A[A

 10%|▉         | 35/360 [00:02<00:17, 18.45it/s][A[A

 10%|█         | 37/360 [00:02<00:17, 18.09it/s][A[A

 11%|█         | 39/360 [00:02<00:18, 17.42it/s][A[A

 11%|█▏        | 41/360 [00:03<00:17, 18.00it/s][A[A

 12%|█▏        | 43/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:13<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:09,  1.09s/it][A[A

  2%|▏         | 2/119 [00:01<01:14,  1.58it/s][A[A

  3%|▎         | 3/119 [00:01<00:46,  2.49it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.43it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.06it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  7.20it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.16it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.47it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 14.65it/s][A[A

 15%|█▌        | 18/119 [00:02<00:07, 14.40it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.08it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:33,  1.27it/s][A[A

  2%|▏         | 2/120 [00:00<00:47,  2.49it/s][A[A

  2%|▎         | 3/120 [00:01<00:34,  3.36it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.26it/s][A[A

  7%|▋         | 8/120 [00:01<00:10, 10.31it/s][A[A

  8%|▊         | 10/120 [00:01<00:08, 12.29it/s][A[A

 10%|█         | 12/120 [00:01<00:07, 13.72it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 16.98it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 19.11it/s][A[A

 18%|█▊        | 21/120 [00:01<00:05, 19.74it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 21.92it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 22.63it/s][A[A

 26%|██▌       | 31/120 [00:02<00:03, 23.10it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 23.76it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 23.94it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 23.54it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 24.95it/s][A[A

 38%|███▊      | 46/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:10,  1.43it/s][A[A

  1%|          | 2/360 [00:00<02:13,  2.68it/s][A[A

  1%|          | 3/360 [00:01<02:04,  2.87it/s][A[A

  1%|          | 4/360 [00:01<01:32,  3.86it/s][A[A

  2%|▏         | 6/360 [00:01<00:52,  6.73it/s][A[A

  2%|▏         | 8/360 [00:01<00:38,  9.22it/s][A[A

  3%|▎         | 10/360 [00:01<00:30, 11.50it/s][A[A

  3%|▎         | 12/360 [00:01<00:30, 11.40it/s][A[A

  4%|▍         | 15/360 [00:01<00:24, 14.31it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.69it/s][A[A

  5%|▌         | 19/360 [00:02<00:23, 14.35it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.33it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 17.16it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 17.10it/s][A[A

  8%|▊         | 29/360 [00:02<00:18, 18.28it/s][A[A

  9%|▊         | 31/360 [00:02<00:19, 16.83it/s][A[A

  9%|▉         | 33/360 [00:02<00:19, 17.09it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:10,  1.66it/s][A[A

  2%|▏         | 2/119 [00:01<00:58,  2.00it/s][A[A

  3%|▎         | 4/119 [00:01<00:26,  4.38it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.52it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.90it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.63it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 15.97it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.89it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 18.90it/s][A[A

 19%|█▉        | 23/119 [00:02<00:04, 21.29it/s][A[A

 23%|██▎       | 27/119 [00:02<00:03, 24.04it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:35,  1.31s/it][A[A

  2%|▏         | 2/120 [00:01<01:12,  1.62it/s][A[A

  3%|▎         | 4/120 [00:01<00:31,  3.72it/s][A[A

  6%|▌         | 7/120 [00:01<00:16,  6.97it/s][A[A

  8%|▊         | 9/120 [00:01<00:12,  8.86it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.36it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 14.84it/s][A[A

 15%|█▌        | 18/120 [00:02<00:05, 17.31it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 19.52it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 21.30it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 22.71it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 22.16it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 21.81it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 21.87it/s][A[A

 32%|███▎      | 39/120 [00:03<00:03, 23.04it/s][A[A

 35%|███▌      | 42/120 [00:03<00:03, 23.71it/s][A[A

 38%|███▊      | 45/120 [00:03<00:03, 24.69it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:43,  1.27it/s][A[A

  1%|          | 2/360 [00:00<02:35,  2.30it/s][A[A

  1%|          | 3/360 [00:01<01:45,  3.38it/s][A[A

  2%|▏         | 6/360 [00:01<00:45,  7.70it/s][A[A

  2%|▏         | 8/360 [00:01<00:35,  9.91it/s][A[A

  3%|▎         | 11/360 [00:01<00:27, 12.87it/s][A[A

  4%|▍         | 14/360 [00:01<00:21, 15.82it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 16.28it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.02it/s][A[A

  6%|▌         | 21/360 [00:01<00:19, 17.58it/s][A[A

  7%|▋         | 24/360 [00:02<00:17, 19.19it/s][A[A

  7%|▋         | 26/360 [00:02<00:17, 18.68it/s][A[A

  8%|▊         | 28/360 [00:02<00:17, 18.99it/s][A[A

  9%|▊         | 31/360 [00:02<00:16, 20.12it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 21.78it/s][A[A

 10%|█         | 37/360 [00:02<00:14, 22.63it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 23.54it/s][A[A

 12%|█▏        | 43/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:24,  1.39it/s][A[A

  2%|▏         | 2/119 [00:00<00:51,  2.28it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.07it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.68it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.41it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.48it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 15.42it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 17.83it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 19.07it/s][A[A

 18%|█▊        | 22/119 [00:01<00:04, 21.06it/s][A[A

 21%|██        | 25/119 [00:02<00:04, 21.94it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:49,  1.09it/s][A[A

  2%|▏         | 2/120 [00:01<00:51,  2.28it/s][A[A

  3%|▎         | 4/120 [00:01<00:23,  4.84it/s][A[A

  5%|▌         | 6/120 [00:01<00:15,  7.53it/s][A[A

  7%|▋         | 8/120 [00:01<00:11, 10.01it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.56it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 15.92it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.89it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 16.21it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 18.21it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 17.70it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.47it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.67it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 15.95it/s][A[A

 26%|██▌       | 31/120 [00:02<00:05, 15.78it/s][A[A

 28%|██▊       | 33/120 [00:02<00:05, 16.26it/s][A[A

 29%|██▉       | 35/120 [00:02<00:05, 16.05it/s][A[A

 31%|███       | 37/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<03:55,  1.53it/s][A[A

  1%|          | 2/360 [00:00<02:45,  2.16it/s][A[A

  1%|          | 3/360 [00:01<01:50,  3.22it/s][A[A

  1%|▏         | 5/360 [00:01<00:59,  5.99it/s][A[A

  2%|▏         | 7/360 [00:01<00:40,  8.71it/s][A[A

  3%|▎         | 10/360 [00:01<00:27, 12.68it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 16.56it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.31it/s][A[A

  5%|▌         | 19/360 [00:01<00:16, 21.00it/s][A[A

  6%|▌         | 22/360 [00:01<00:15, 21.63it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 21.70it/s][A[A

  8%|▊         | 28/360 [00:02<00:14, 23.30it/s][A[A

  9%|▊         | 31/360 [00:02<00:15, 21.54it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.14it/s][A[A

 10%|█         | 37/360 [00:02<00:13, 23.46it/s][A[A

 11%|█         | 40/360 [00:02<00:12, 24.65it/s][A[A

 12%|█▏        | 43/360 [00:02<00:12, 25.45it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:01,  1.03s/it][A[A

  2%|▏         | 2/119 [00:01<01:10,  1.65it/s][A[A

  3%|▎         | 3/119 [00:01<00:45,  2.55it/s][A[A

  3%|▎         | 4/119 [00:01<00:32,  3.50it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.41it/s][A[A

  5%|▌         | 6/119 [00:01<00:20,  5.39it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.66it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.49it/s][A[A

 10%|█         | 12/119 [00:02<00:09, 11.32it/s][A[A

 12%|█▏        | 14/119 [00:02<00:08, 12.46it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 13.38it/s][A[A

 15%|█▌        | 18/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:27,  1.36it/s][A[A

  2%|▏         | 2/120 [00:00<00:49,  2.40it/s][A[A

  2%|▎         | 3/120 [00:01<00:33,  3.47it/s][A[A

  4%|▍         | 5/120 [00:01<00:17,  6.47it/s][A[A

  6%|▌         | 7/120 [00:01<00:12,  8.82it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 11.06it/s][A[A

 10%|█         | 12/120 [00:01<00:07, 14.73it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 17.15it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 19.58it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 21.63it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 21.97it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 22.71it/s][A[A

 25%|██▌       | 30/120 [00:02<00:03, 23.12it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 25.99it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 24.98it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 22.81it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 23.38it/s][A[A

 38%|███▊      | 46/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:07,  1.19s/it][A[A

  1%|          | 2/360 [00:01<03:36,  1.65it/s][A[A

  1%|          | 3/360 [00:01<02:40,  2.23it/s][A[A

  1%|          | 4/360 [00:01<01:51,  3.18it/s][A[A

  2%|▏         | 7/360 [00:01<00:53,  6.63it/s][A[A

  3%|▎         | 10/360 [00:02<00:34, 10.15it/s][A[A

  3%|▎         | 12/360 [00:02<00:33, 10.41it/s][A[A

  4%|▍         | 14/360 [00:02<00:29, 11.54it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.75it/s][A[A

  5%|▌         | 18/360 [00:02<00:24, 13.89it/s][A[A

  6%|▌         | 20/360 [00:02<00:23, 14.41it/s][A[A

  6%|▌         | 22/360 [00:02<00:21, 15.59it/s][A[A

  7%|▋         | 24/360 [00:02<00:23, 14.53it/s][A[A

  7%|▋         | 26/360 [00:03<00:21, 15.54it/s][A[A

  8%|▊         | 28/360 [00:03<00:20, 15.97it/s][A[A

  8%|▊         | 30/360 [00:03<00:21, 15.68it/s][A[A

  9%|▉         | 33/360 [00:03<00:18, 17.97it/s][A[A

 10%|▉         | 35/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:24,  1.40it/s][A[A

  2%|▏         | 2/119 [00:00<00:52,  2.25it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.84it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.25it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.35it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 13.17it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.67it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 18.74it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 19.51it/s][A[A

 19%|█▉        | 23/119 [00:01<00:04, 20.73it/s][A[A

 22%|██▏       | 26/119 [00:02<00:04, 22.22it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:18,  1.16s/it][A[A

  2%|▏         | 2/120 [00:01<01:08,  1.73it/s][A[A

  3%|▎         | 4/120 [00:01<00:29,  3.90it/s][A[A

  5%|▌         | 6/120 [00:01<00:19,  5.94it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.25it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.77it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 13.41it/s][A[A

 13%|█▎        | 16/120 [00:02<00:06, 16.11it/s][A[A

 16%|█▌        | 19/120 [00:02<00:05, 18.10it/s][A[A

 19%|█▉        | 23/120 [00:02<00:04, 21.55it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 21.38it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 21.10it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 24.28it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.72it/s][A[A

 32%|███▎      | 39/120 [00:03<00:03, 23.58it/s][A[A

 35%|███▌      | 42/120 [00:03<00:03, 23.58it/s][A[A

 38%|███▊      | 45/120 [00:03<00:03, 24.11it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:36,  1.30it/s][A[A

  1%|          | 2/360 [00:00<02:29,  2.40it/s][A[A

  1%|          | 3/360 [00:01<01:46,  3.36it/s][A[A

  1%|▏         | 5/360 [00:01<00:57,  6.17it/s][A[A

  2%|▏         | 7/360 [00:01<00:40,  8.73it/s][A[A

  3%|▎         | 10/360 [00:01<00:27, 12.66it/s][A[A

  4%|▎         | 13/360 [00:01<00:20, 16.57it/s][A[A

  4%|▍         | 16/360 [00:01<00:19, 17.93it/s][A[A

  5%|▌         | 19/360 [00:01<00:17, 19.73it/s][A[A

  6%|▌         | 22/360 [00:01<00:16, 20.91it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 22.10it/s][A[A

  8%|▊         | 28/360 [00:02<00:14, 23.00it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 22.76it/s][A[A

  9%|▉         | 34/360 [00:02<00:13, 23.56it/s][A[A

 10%|█         | 37/360 [00:02<00:14, 22.35it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 23.97it/s][A[A

 12%|█▏        | 43/360 [00:02<00:12, 24.69it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:25,  1.38it/s][A[A

  2%|▏         | 2/119 [00:00<00:42,  2.72it/s][A[A

  3%|▎         | 3/119 [00:00<00:28,  4.06it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.09it/s][A[A

  5%|▌         | 6/119 [00:01<00:14,  7.98it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.89it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.26it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 15.16it/s][A[A

 13%|█▎        | 15/119 [00:01<00:06, 16.17it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 18.33it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 18.65it/s][A[A

 19%|█▉        | 23/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:34,  1.26it/s][A[A

  2%|▏         | 2/120 [00:00<00:48,  2.42it/s][A[A

  2%|▎         | 3/120 [00:01<00:31,  3.67it/s][A[A

  4%|▍         | 5/120 [00:01<00:17,  6.47it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.93it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.69it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.05it/s][A[A

 11%|█         | 13/120 [00:01<00:08, 12.84it/s][A[A

 12%|█▎        | 15/120 [00:01<00:07, 14.00it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 15.18it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 15.76it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.89it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 16.35it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.27it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.54it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 18.40it/s][A[A

 27%|██▋       | 32/120 [00:02<00:05, 16.51it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:23,  1.36it/s][A[A

  1%|          | 2/360 [00:00<02:21,  2.52it/s][A[A

  1%|          | 3/360 [00:01<01:33,  3.81it/s][A[A

  1%|          | 4/360 [00:01<01:14,  4.79it/s][A[A

  2%|▏         | 7/360 [00:01<00:38,  9.06it/s][A[A

  3%|▎         | 10/360 [00:01<00:26, 13.45it/s][A[A

  3%|▎         | 12/360 [00:01<00:24, 14.49it/s][A[A

  4%|▍         | 14/360 [00:01<00:22, 15.44it/s][A[A

  5%|▍         | 17/360 [00:01<00:19, 18.02it/s][A[A

  6%|▌         | 20/360 [00:01<00:17, 19.74it/s][A[A

  6%|▋         | 23/360 [00:01<00:15, 21.27it/s][A[A

  7%|▋         | 26/360 [00:02<00:14, 22.55it/s][A[A

  8%|▊         | 29/360 [00:02<00:14, 23.17it/s][A[A

  9%|▉         | 32/360 [00:02<00:13, 23.84it/s][A[A

 10%|▉         | 35/360 [00:02<00:13, 23.38it/s][A[A

 11%|█         | 39/360 [00:02<00:12, 26.07it/s][A[A

 12%|█▏        | 43/360 [00:02<00:11, 27.27it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:08,  1.08s/it][A[A

  2%|▏         | 2/119 [00:01<01:10,  1.66it/s][A[A

  3%|▎         | 3/119 [00:01<00:44,  2.63it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.48it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.48it/s][A[A

  6%|▌         | 7/119 [00:01<00:17,  6.50it/s][A[A

  7%|▋         | 8/119 [00:02<00:15,  7.15it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.48it/s][A[A

 10%|█         | 12/119 [00:02<00:09, 11.61it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.99it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 14.51it/s][A[A

 16%|█▌        | 19/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:33,  1.27it/s][A[A

  2%|▎         | 3/120 [00:01<00:35,  3.25it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.64it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.74it/s][A[A

  8%|▊         | 10/120 [00:01<00:09, 11.72it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 13.44it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 16.78it/s][A[A

 15%|█▌        | 18/120 [00:01<00:05, 18.15it/s][A[A

 18%|█▊        | 21/120 [00:01<00:05, 19.28it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 21.24it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 21.84it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 21.36it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 22.80it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.06it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 24.26it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 24.31it/s][A[A

 38%|███▊      | 46/120 [00:02<00:02, 25.31it/s][A[A

 41%|████      | 49/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:08,  1.19s/it][A[A

  1%|          | 2/360 [00:01<03:31,  1.69it/s][A[A

  1%|          | 3/360 [00:01<02:34,  2.31it/s][A[A

  1%|▏         | 5/360 [00:01<01:24,  4.21it/s][A[A

  2%|▏         | 7/360 [00:01<00:57,  6.09it/s][A[A

  2%|▎         | 9/360 [00:02<00:42,  8.25it/s][A[A

  3%|▎         | 11/360 [00:02<00:35,  9.75it/s][A[A

  4%|▎         | 13/360 [00:02<00:29, 11.66it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 12.90it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.53it/s][A[A

  6%|▌         | 20/360 [00:02<00:21, 15.84it/s][A[A

  6%|▌         | 22/360 [00:02<00:21, 15.86it/s][A[A

  7%|▋         | 24/360 [00:02<00:21, 15.56it/s][A[A

  8%|▊         | 27/360 [00:03<00:19, 17.08it/s][A[A

  8%|▊         | 30/360 [00:03<00:18, 18.01it/s][A[A

  9%|▉         | 32/360 [00:03<00:18, 17.90it/s][A[A

  9%|▉         | 34/360 [00:03<00:19, 17.15it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:17,  1.53it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.31it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.80it/s][A[A

  4%|▍         | 5/119 [00:01<00:20,  5.67it/s][A[A

  6%|▌         | 7/119 [00:01<00:13,  8.02it/s][A[A

  8%|▊         | 9/119 [00:01<00:10, 10.54it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.57it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.15it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 18.22it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 19.09it/s][A[A

 19%|█▉        | 23/119 [00:02<00:04, 21.16it/s][A[A

 22%|██▏       | 26/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:20,  1.18s/it][A[A

  2%|▏         | 2/120 [00:01<01:08,  1.73it/s][A[A

  2%|▎         | 3/120 [00:01<00:42,  2.73it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.37it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.81it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.86it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 15.37it/s][A[A

 15%|█▌        | 18/120 [00:02<00:05, 17.68it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 18.48it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 20.63it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 21.87it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 21.64it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 23.13it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.84it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 24.23it/s][A[A

 35%|███▌      | 42/120 [00:03<00:03, 23.89it/s][A[A

 38%|███▊      | 45/120 [00:03<00:03, 23.52it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:17,  1.39it/s][A[A

  1%|          | 2/360 [00:00<02:24,  2.48it/s][A[A

  1%|          | 3/360 [00:01<01:52,  3.17it/s][A[A

  1%|▏         | 5/360 [00:01<01:00,  5.82it/s][A[A

  2%|▏         | 7/360 [00:01<00:43,  8.15it/s][A[A

  2%|▎         | 9/360 [00:01<00:33, 10.59it/s][A[A

  4%|▎         | 13/360 [00:01<00:22, 15.67it/s][A[A

  4%|▍         | 15/360 [00:01<00:20, 16.43it/s][A[A

  5%|▌         | 19/360 [00:01<00:17, 19.56it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.55it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 21.24it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 21.74it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 22.31it/s][A[A

 10%|▉         | 35/360 [00:02<00:13, 23.97it/s][A[A

 11%|█         | 39/360 [00:02<00:12, 25.05it/s][A[A

 12%|█▏        | 42/360 [00:02<00:12, 25.02it/s][A[A

 12%|█▎        | 45/360 [00:02<00:12, 24.61it/s][A[A

 13%|█▎        | 48/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:21,  1.45it/s][A[A

  2%|▏         | 2/119 [00:00<00:42,  2.72it/s][A[A

  3%|▎         | 3/119 [00:00<00:30,  3.84it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.62it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.16it/s][A[A

  8%|▊         | 9/119 [00:01<00:09, 11.42it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 13.04it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.36it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 19.26it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 18.45it/s][A[A

 19%|█▉        | 23/119 [00:02<00:04, 20.58it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:46,  1.12it/s][A[A

  2%|▎         | 3/120 [00:01<00:33,  3.53it/s][A[A

  4%|▍         | 5/120 [00:01<00:19,  5.97it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.53it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.58it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 15.37it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 17.46it/s][A[A

 17%|█▋        | 20/120 [00:01<00:04, 20.11it/s][A[A

 19%|█▉        | 23/120 [00:01<00:05, 19.26it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 19.68it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 18.37it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 18.41it/s][A[A

 28%|██▊       | 33/120 [00:02<00:04, 17.80it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 17.98it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 17.82it/s][A[A

 33%|███▎      | 40/120 [00:02<00:04, 18.39it/s][A[A

 36%|███▌      | 43/120 [00:03<00:03, 19.64it/s][A[A

 38%|███▊      | 45/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:41,  1.28it/s][A[A

  1%|          | 2/360 [00:01<02:43,  2.19it/s][A[A

  1%|          | 3/360 [00:01<01:51,  3.20it/s][A[A

  2%|▏         | 6/360 [00:01<00:46,  7.68it/s][A[A

  2%|▏         | 8/360 [00:01<00:35,  9.78it/s][A[A

  3%|▎         | 11/360 [00:01<00:25, 13.49it/s][A[A

  4%|▎         | 13/360 [00:01<00:23, 14.70it/s][A[A

  4%|▍         | 16/360 [00:01<00:19, 17.55it/s][A[A

  5%|▌         | 19/360 [00:01<00:16, 20.12it/s][A[A

  6%|▌         | 22/360 [00:01<00:16, 21.11it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 22.09it/s][A[A

  8%|▊         | 28/360 [00:02<00:14, 22.92it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 23.42it/s][A[A

  9%|▉         | 34/360 [00:02<00:13, 24.08it/s][A[A

 10%|█         | 37/360 [00:02<00:13, 23.65it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 24.36it/s][A[A

 12%|█▏        | 44/360 [00:02<00:11, 27.14it/s][A[A

 13%|█▎        | 47/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:06,  1.07s/it][A[A

  2%|▏         | 2/119 [00:01<01:06,  1.75it/s][A[A

  3%|▎         | 3/119 [00:01<00:43,  2.69it/s][A[A

  3%|▎         | 4/119 [00:01<00:31,  3.68it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.45it/s][A[A

  5%|▌         | 6/119 [00:01<00:20,  5.41it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  7.20it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.10it/s][A[A

 10%|█         | 12/119 [00:02<00:09, 10.80it/s][A[A

 12%|█▏        | 14/119 [00:02<00:08, 12.39it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 13.25it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:45,  1.13it/s][A[A

  2%|▏         | 2/120 [00:01<00:53,  2.21it/s][A[A

  2%|▎         | 3/120 [00:01<00:35,  3.27it/s][A[A

  5%|▌         | 6/120 [00:01<00:14,  7.79it/s][A[A

  8%|▊         | 9/120 [00:01<00:09, 11.22it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 12.10it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 13.70it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 16.98it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 18.45it/s][A[A

 19%|█▉        | 23/120 [00:02<00:04, 21.18it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 20.67it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 21.29it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 23.04it/s][A[A

 29%|██▉       | 35/120 [00:02<00:03, 21.98it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 22.88it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 22.85it/s][A[A

 37%|███▋      | 44/120 [00:02<00:03, 23.41it/s][A[A

 39%|███▉      | 47/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:07,  1.17it/s][A[A

  1%|          | 2/360 [00:00<02:32,  2.35it/s][A[A

  1%|          | 3/360 [00:01<02:13,  2.67it/s][A[A

  1%|          | 4/360 [00:01<01:37,  3.65it/s][A[A

  2%|▏         | 6/360 [00:01<00:55,  6.40it/s][A[A

  2%|▏         | 8/360 [00:01<00:40,  8.66it/s][A[A

  3%|▎         | 11/360 [00:01<00:29, 11.93it/s][A[A

  4%|▎         | 13/360 [00:01<00:29, 11.59it/s][A[A

  4%|▍         | 15/360 [00:02<00:27, 12.64it/s][A[A

  5%|▌         | 18/360 [00:02<00:21, 15.68it/s][A[A

  6%|▌         | 20/360 [00:02<00:20, 16.48it/s][A[A

  6%|▌         | 22/360 [00:02<00:20, 16.54it/s][A[A

  7%|▋         | 24/360 [00:02<00:21, 15.93it/s][A[A

  7%|▋         | 26/360 [00:02<00:20, 16.10it/s][A[A

  8%|▊         | 28/360 [00:02<00:21, 15.35it/s][A[A

  8%|▊         | 30/360 [00:02<00:20, 16.40it/s][A[A

  9%|▉         | 32/360 [00:03<00:20, 16.33it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:20,  1.46it/s][A[A

  2%|▏         | 2/119 [00:00<00:45,  2.58it/s][A[A

  3%|▎         | 3/119 [00:00<00:29,  3.89it/s][A[A

  3%|▎         | 4/119 [00:01<00:22,  5.08it/s][A[A

  4%|▍         | 5/119 [00:01<00:18,  6.05it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.37it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.32it/s][A[A

  9%|▉         | 11/119 [00:01<00:07, 14.17it/s][A[A

 12%|█▏        | 14/119 [00:01<00:05, 18.13it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.95it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 17.53it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:28,  1.25s/it][A[A

  2%|▏         | 2/120 [00:01<01:10,  1.67it/s][A[A

  2%|▎         | 3/120 [00:01<00:45,  2.58it/s][A[A

  4%|▍         | 5/120 [00:01<00:24,  4.74it/s][A[A

  6%|▌         | 7/120 [00:01<00:15,  7.06it/s][A[A

  8%|▊         | 9/120 [00:01<00:12,  8.82it/s][A[A

  9%|▉         | 11/120 [00:02<00:09, 10.98it/s][A[A

 12%|█▏        | 14/120 [00:02<00:07, 14.06it/s][A[A

 14%|█▍        | 17/120 [00:02<00:06, 16.53it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 18.85it/s][A[A

 19%|█▉        | 23/120 [00:02<00:04, 20.09it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 20.57it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 20.56it/s][A[A

 27%|██▋       | 32/120 [00:02<00:03, 22.08it/s][A[A

 29%|██▉       | 35/120 [00:03<00:03, 22.51it/s][A[A

 32%|███▏      | 38/120 [00:03<00:03, 22.06it/s][A[A

 34%|███▍      | 41/120 [00:03<00:03, 22.33it/s][A[A

 37%|███▋      | 44/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:07,  1.45it/s][A[A

  1%|          | 2/360 [00:01<03:08,  1.90it/s][A[A

  1%|          | 3/360 [00:01<02:07,  2.81it/s][A[A

  2%|▏         | 6/360 [00:01<00:52,  6.75it/s][A[A

  2%|▏         | 8/360 [00:01<00:40,  8.63it/s][A[A

  3%|▎         | 11/360 [00:01<00:29, 11.90it/s][A[A

  4%|▍         | 14/360 [00:01<00:23, 15.01it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 16.05it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.94it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.81it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 22.11it/s][A[A

  8%|▊         | 28/360 [00:02<00:14, 22.93it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 22.42it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 23.00it/s][A[A

 10%|█         | 37/360 [00:02<00:13, 23.13it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 23.87it/s][A[A

 12%|█▏        | 43/360 [00:02<00:12, 24.66it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:11,  1.64it/s][A[A

  2%|▏         | 2/119 [00:00<00:51,  2.27it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.87it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.90it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.90it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.64it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.25it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.77it/s][A[A

 16%|█▌        | 19/119 [00:01<00:05, 17.99it/s][A[A

 18%|█▊        | 22/119 [00:02<00:05, 19.26it/s][A[A

 21%|██        | 25/119 [00:02<00:04, 20.93it/s][A[A

 24%|██▎       | 28/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:51,  1.07it/s][A[A

  2%|▎         | 3/120 [00:01<00:34,  3.40it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.66it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.44it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.81it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.10it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 13.50it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 15.05it/s][A[A

 16%|█▌        | 19/120 [00:01<00:06, 15.78it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.81it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 16.28it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.84it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 15.74it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 15.27it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 18.47it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 18.63it/s][A[A

 30%|███       | 36/120 [00:02<00:04, 18.11it/s][A[A

 32%|███▏      | 38/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:05,  1.18it/s][A[A

  1%|          | 2/360 [00:00<02:33,  2.34it/s][A[A

  1%|          | 4/360 [00:01<01:16,  4.63it/s][A[A

  2%|▏         | 6/360 [00:01<00:54,  6.54it/s][A[A

  2%|▏         | 8/360 [00:01<00:40,  8.71it/s][A[A

  3%|▎         | 11/360 [00:01<00:28, 12.11it/s][A[A

  4%|▍         | 14/360 [00:01<00:23, 14.90it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 15.99it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.79it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.31it/s][A[A

  7%|▋         | 25/360 [00:02<00:16, 20.66it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 21.62it/s][A[A

  9%|▊         | 31/360 [00:02<00:14, 22.25it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.57it/s][A[A

 10%|█         | 37/360 [00:02<00:13, 23.90it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 24.13it/s][A[A

 12%|█▏        | 43/360 [00:02<00:13, 24.20it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:12,  1.12s/it][A[A

  2%|▏         | 2/119 [00:01<01:18,  1.49it/s][A[A

  3%|▎         | 3/119 [00:01<00:48,  2.40it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.40it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.42it/s][A[A

  5%|▌         | 6/119 [00:01<00:21,  5.27it/s][A[A

  7%|▋         | 8/119 [00:02<00:15,  7.31it/s][A[A

  8%|▊         | 9/119 [00:02<00:13,  7.86it/s][A[A

  9%|▉         | 11/119 [00:02<00:11,  9.18it/s][A[A

 11%|█         | 13/119 [00:02<00:09, 11.31it/s][A[A

 13%|█▎        | 15/119 [00:02<00:08, 12.94it/s][A[A

 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:40,  1.18it/s][A[A

  2%|▏         | 2/120 [00:00<00:49,  2.39it/s][A[A

  2%|▎         | 3/120 [00:01<00:33,  3.47it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.28it/s][A[A

  6%|▌         | 7/120 [00:01<00:12,  8.85it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 11.04it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.15it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 16.39it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 18.94it/s][A[A

 18%|█▊        | 21/120 [00:01<00:04, 22.15it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 21.78it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 21.88it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 21.92it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 22.78it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.39it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 24.26it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 24.12it/s][A[A

 38%|███▊      | 46/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:19,  1.38it/s][A[A

  1%|          | 2/360 [00:00<02:41,  2.21it/s][A[A

  1%|          | 3/360 [00:01<02:07,  2.81it/s][A[A

  1%|          | 4/360 [00:01<01:32,  3.86it/s][A[A

  2%|▏         | 6/360 [00:01<00:56,  6.29it/s][A[A

  2%|▏         | 8/360 [00:01<00:40,  8.67it/s][A[A

  3%|▎         | 10/360 [00:01<00:32, 10.63it/s][A[A

  3%|▎         | 12/360 [00:01<00:27, 12.57it/s][A[A

  4%|▍         | 14/360 [00:01<00:26, 12.98it/s][A[A

  5%|▍         | 17/360 [00:02<00:23, 14.81it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.19it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 15.47it/s][A[A

  6%|▋         | 23/360 [00:02<00:21, 15.71it/s][A[A

  7%|▋         | 25/360 [00:02<00:21, 15.52it/s][A[A

  8%|▊         | 27/360 [00:02<00:20, 16.53it/s][A[A

  8%|▊         | 30/360 [00:02<00:19, 16.98it/s][A[A

  9%|▉         | 32/360 [00:03<00:20, 16.12it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:27,  1.35it/s][A[A

  2%|▏         | 2/119 [00:00<00:46,  2.52it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.99it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  7.05it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  9.06it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.79it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.33it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.37it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 18.51it/s][A[A

 19%|█▉        | 23/119 [00:02<00:04, 19.71it/s][A[A

 22%|██▏       | 26/119 [00:02<00:04, 21.28it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:21,  1.19s/it][A[A

  2%|▏         | 2/120 [00:01<01:09,  1.70it/s][A[A

  2%|▎         | 3/120 [00:01<00:47,  2.49it/s][A[A

  4%|▍         | 5/120 [00:01<00:24,  4.66it/s][A[A

  6%|▌         | 7/120 [00:01<00:16,  6.71it/s][A[A

  8%|▊         | 9/120 [00:01<00:13,  8.22it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.19it/s][A[A

 12%|█▏        | 14/120 [00:02<00:08, 12.31it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 12.65it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 14.42it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.20it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 14.93it/s][A[A

 21%|██        | 25/120 [00:02<00:06, 14.79it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 17.06it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 17.26it/s][A[A

 28%|██▊       | 33/120 [00:03<00:04, 19.61it/s][A[A

 30%|███       | 36/120 [00:03<00:04, 19.26it/s][A[A

 32%|███▎      | 39/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:59,  1.20it/s][A[A

  1%|          | 2/360 [00:00<02:24,  2.47it/s][A[A

  1%|          | 3/360 [00:01<01:49,  3.27it/s][A[A

  1%|▏         | 5/360 [00:01<01:00,  5.91it/s][A[A

  2%|▏         | 7/360 [00:01<00:43,  8.18it/s][A[A

  3%|▎         | 10/360 [00:01<00:29, 11.98it/s][A[A

  4%|▎         | 13/360 [00:01<00:22, 15.56it/s][A[A

  4%|▍         | 15/360 [00:01<00:21, 15.84it/s][A[A

  5%|▌         | 18/360 [00:01<00:18, 18.05it/s][A[A

  6%|▌         | 21/360 [00:01<00:17, 19.84it/s][A[A

  7%|▋         | 24/360 [00:02<00:16, 20.78it/s][A[A

  8%|▊         | 27/360 [00:02<00:15, 21.63it/s][A[A

  8%|▊         | 30/360 [00:02<00:14, 22.34it/s][A[A

  9%|▉         | 33/360 [00:02<00:14, 23.04it/s][A[A

 10%|█         | 36/360 [00:02<00:14, 22.71it/s][A[A

 11%|█         | 39/360 [00:02<00:13, 23.15it/s][A[A

 12%|█▏        | 42/360 [00:02<00:12, 24.68it/s][A[A

 12%|█▎        | 45/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:31,  1.28it/s][A[A

  3%|▎         | 3/119 [00:00<00:28,  4.00it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.63it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  7.05it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.48it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.45it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.16it/s][A[A

 13%|█▎        | 15/119 [00:01<00:06, 15.50it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 17.29it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 17.11it/s][A[A

 20%|██        | 24/119 [00:02<00:04, 21.08it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:56,  1.02it/s][A[A

  3%|▎         | 4/120 [00:01<00:25,  4.55it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.70it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.34it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.64it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 15.73it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 16.71it/s][A[A

 18%|█▊        | 21/120 [00:01<00:05, 17.89it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 19.95it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 20.72it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 21.25it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 22.22it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.57it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 22.59it/s][A[A

 35%|███▌      | 42/120 [00:02<00:03, 22.68it/s][A[A

 38%|███▊      | 45/120 [00:02<00:03, 23.13it/s][A[A

 40%|████      | 48/120 [00:03<00:03, 22.19it/s][A[A

 42%|████▎     | 51/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:50,  1.14s/it][A[A

  1%|          | 2/360 [00:01<03:15,  1.83it/s][A[A

  1%|          | 3/360 [00:01<02:10,  2.73it/s][A[A

  1%|          | 4/360 [00:01<01:35,  3.73it/s][A[A

  2%|▏         | 7/360 [00:01<00:46,  7.56it/s][A[A

  2%|▎         | 9/360 [00:01<00:36,  9.50it/s][A[A

  3%|▎         | 12/360 [00:01<00:27, 12.74it/s][A[A

  4%|▍         | 15/360 [00:02<00:21, 15.88it/s][A[A

  5%|▌         | 18/360 [00:02<00:19, 17.82it/s][A[A

  6%|▌         | 21/360 [00:02<00:17, 19.47it/s][A[A

  7%|▋         | 24/360 [00:02<00:15, 21.46it/s][A[A

  8%|▊         | 27/360 [00:02<00:16, 20.60it/s][A[A

  8%|▊         | 30/360 [00:02<00:16, 20.51it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.92it/s][A[A

 10%|█         | 37/360 [00:03<00:15, 21.47it/s][A[A

 11%|█         | 40/360 [00:03<00:15, 21.21it/s][A[A

 12%|█▏        | 43/360 [00:03<00:15, 20.41it/s][A[A

 13%|█▎        | 46/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:16,  1.16s/it][A[A

  2%|▏         | 2/119 [00:01<01:12,  1.62it/s][A[A

  3%|▎         | 3/119 [00:01<00:45,  2.58it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.46it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.42it/s][A[A

  5%|▌         | 6/119 [00:01<00:20,  5.41it/s][A[A

  7%|▋         | 8/119 [00:02<00:14,  7.53it/s][A[A

  8%|▊         | 10/119 [00:02<00:10, 10.07it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.95it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.64it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.92it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:49,  1.08it/s][A[A

  2%|▏         | 2/120 [00:01<00:51,  2.28it/s][A[A

  3%|▎         | 4/120 [00:01<00:24,  4.75it/s][A[A

  6%|▌         | 7/120 [00:01<00:13,  8.67it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.45it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.43it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 14.06it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 16.63it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 15.93it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 16.44it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 18.94it/s][A[A

 22%|██▏       | 26/120 [00:02<00:05, 17.85it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 18.33it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 20.34it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 20.66it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 21.63it/s][A[A

 34%|███▍      | 41/120 [00:03<00:03, 21.11it/s][A[A

 37%|███▋      | 44/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:26,  1.10it/s][A[A

  1%|          | 2/360 [00:01<03:04,  1.95it/s][A[A

  1%|          | 3/360 [00:01<02:03,  2.88it/s][A[A

  1%|          | 4/360 [00:01<01:33,  3.79it/s][A[A

  1%|▏         | 5/360 [00:01<01:15,  4.69it/s][A[A

  2%|▏         | 7/360 [00:01<00:54,  6.49it/s][A[A

  3%|▎         | 11/360 [00:01<00:28, 12.35it/s][A[A

  4%|▎         | 13/360 [00:02<00:27, 12.70it/s][A[A

  4%|▍         | 15/360 [00:02<00:28, 12.14it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.29it/s][A[A

  5%|▌         | 19/360 [00:02<00:24, 14.18it/s][A[A

  6%|▌         | 21/360 [00:02<00:23, 14.52it/s][A[A

  7%|▋         | 24/360 [00:02<00:21, 15.79it/s][A[A

  8%|▊         | 27/360 [00:02<00:18, 18.17it/s][A[A

  8%|▊         | 29/360 [00:02<00:18, 17.71it/s][A[A

  9%|▊         | 31/360 [00:03<00:19, 16.66it/s][A[A

  9%|▉         | 33/360 [00:03<00:18, 17.42it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:27,  1.35it/s][A[A

  2%|▏         | 2/119 [00:01<00:58,  2.00it/s][A[A

  3%|▎         | 3/119 [00:01<00:38,  3.03it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.77it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.44it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.51it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.03it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.00it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.56it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.69it/s][A[A

 19%|█▉        | 23/119 [00:02<00:05, 17.93it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:44,  1.14it/s][A[A

  2%|▏         | 2/120 [00:01<00:53,  2.20it/s][A[A

  2%|▎         | 3/120 [00:01<00:35,  3.25it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.17it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.96it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 13.02it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 16.10it/s][A[A

 14%|█▍        | 17/120 [00:01<00:05, 17.44it/s][A[A

 17%|█▋        | 20/120 [00:01<00:05, 19.21it/s][A[A

 19%|█▉        | 23/120 [00:02<00:04, 19.77it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 21.43it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 21.12it/s][A[A

 28%|██▊       | 33/120 [00:02<00:03, 23.33it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 22.79it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 23.10it/s][A[A

 35%|███▌      | 42/120 [00:02<00:03, 22.35it/s][A[A

 38%|███▊      | 45/120 [00:03<00:03, 22.51it/s][A[A

 40%|████      | 48/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:33,  1.08it/s][A[A

  1%|          | 2/360 [00:01<02:48,  2.12it/s][A[A

  1%|          | 4/360 [00:01<01:18,  4.56it/s][A[A

  2%|▏         | 6/360 [00:01<00:50,  6.96it/s][A[A

  2%|▎         | 9/360 [00:01<00:32, 10.64it/s][A[A

  3%|▎         | 11/360 [00:01<00:28, 12.19it/s][A[A

  4%|▍         | 14/360 [00:01<00:22, 15.05it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 16.09it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.75it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.35it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 21.29it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 21.62it/s][A[A

  9%|▊         | 31/360 [00:02<00:15, 21.53it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.17it/s][A[A

 11%|█         | 38/360 [00:02<00:13, 24.48it/s][A[A

 11%|█▏        | 41/360 [00:02<00:13, 23.65it/s][A[A

 12%|█▏        | 44/360 [00:02<00:12, 24.42it/s][A[A

 13%|█▎        | 47/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:31,  1.29it/s][A[A

  2%|▏         | 2/119 [00:00<00:51,  2.26it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.88it/s][A[A

  5%|▌         | 6/119 [00:01<00:15,  7.35it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.46it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.08it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 17.10it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.59it/s][A[A

 17%|█▋        | 20/119 [00:01<00:05, 17.85it/s][A[A

 19%|█▉        | 23/119 [00:02<00:04, 20.10it/s][A[A

 22%|██▏       | 26/119 [00:02<00:04, 21.93it/s][A[A

 24%|██▍       | 29/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:46,  1.40s/it][A[A

  2%|▏         | 2/120 [00:01<01:16,  1.54it/s][A[A

  2%|▎         | 3/120 [00:01<00:50,  2.32it/s][A[A

  4%|▍         | 5/120 [00:01<00:26,  4.32it/s][A[A

  6%|▌         | 7/120 [00:01<00:17,  6.30it/s][A[A

  8%|▊         | 9/120 [00:02<00:12,  8.56it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.25it/s][A[A

 12%|█▏        | 14/120 [00:02<00:07, 13.73it/s][A[A

 14%|█▍        | 17/120 [00:02<00:06, 15.95it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 16.78it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.53it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 19.24it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.86it/s][A[A

 26%|██▌       | 31/120 [00:03<00:04, 21.25it/s][A[A

 28%|██▊       | 34/120 [00:03<00:03, 21.78it/s][A[A

 31%|███       | 37/120 [00:03<00:03, 21.76it/s][A[A

 33%|███▎      | 40/120 [00:03<00:03, 21.84it/s][A[A

 36%|███▌      | 43/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<04:40,  1.28it/s][A[A

  1%|          | 2/360 [00:00<02:23,  2.49it/s][A[A

  1%|          | 3/360 [00:01<01:59,  3.00it/s][A[A

  1%|▏         | 5/360 [00:01<01:03,  5.63it/s][A[A

  2%|▏         | 8/360 [00:01<00:35,  9.87it/s][A[A

  3%|▎         | 10/360 [00:01<00:30, 11.55it/s][A[A

  3%|▎         | 12/360 [00:01<00:26, 13.10it/s][A[A

  4%|▍         | 15/360 [00:01<00:22, 15.65it/s][A[A

  5%|▌         | 18/360 [00:01<00:18, 18.32it/s][A[A

  6%|▌         | 21/360 [00:02<00:17, 19.82it/s][A[A

  7%|▋         | 24/360 [00:02<00:16, 20.83it/s][A[A

  8%|▊         | 27/360 [00:02<00:15, 21.93it/s][A[A

  8%|▊         | 30/360 [00:02<00:14, 23.32it/s][A[A

  9%|▉         | 33/360 [00:02<00:14, 21.96it/s][A[A

 10%|█         | 36/360 [00:02<00:14, 22.86it/s][A[A

 11%|█         | 39/360 [00:02<00:13, 23.30it/s][A[A

 12%|█▏        | 42/360 [00:02<00:13, 23.58it/s][A[A

 12%|█▎        | 45/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:15,  1.57it/s][A[A

  2%|▏         | 2/119 [00:00<00:49,  2.36it/s][A[A

  3%|▎         | 4/119 [00:01<00:23,  4.84it/s][A[A

  4%|▍         | 5/119 [00:01<00:20,  5.62it/s][A[A

  6%|▌         | 7/119 [00:01<00:13,  8.05it/s][A[A

  8%|▊         | 9/119 [00:01<00:10, 10.16it/s][A[A

 11%|█         | 13/119 [00:01<00:06, 16.15it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 18.96it/s][A[A

 16%|█▌        | 19/119 [00:01<00:06, 15.98it/s][A[A

 18%|█▊        | 22/119 [00:02<00:05, 18.50it/s][A[A

 21%|██        | 25/119 [00:02<00:04, 20.33it/s][A[A

 24%|██▎       | 28/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:36,  1.31s/it][A[A

  2%|▏         | 2/120 [00:01<01:11,  1.64it/s][A[A

  2%|▎         | 3/120 [00:01<00:45,  2.56it/s][A[A

  4%|▍         | 5/120 [00:01<00:25,  4.57it/s][A[A

  6%|▌         | 7/120 [00:01<00:18,  6.27it/s][A[A

  8%|▊         | 9/120 [00:02<00:14,  7.62it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.40it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.19it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.52it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.11it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 14.35it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 14.40it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.73it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 18.21it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 17.78it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.07it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 15.79it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:21,  1.12it/s][A[A

  1%|          | 2/360 [00:01<02:38,  2.26it/s][A[A

  1%|          | 3/360 [00:01<01:50,  3.23it/s][A[A

  1%|▏         | 5/360 [00:01<00:58,  6.12it/s][A[A

  2%|▏         | 7/360 [00:01<00:40,  8.77it/s][A[A

  2%|▎         | 9/360 [00:01<00:31, 11.23it/s][A[A

  3%|▎         | 11/360 [00:01<00:26, 13.26it/s][A[A

  4%|▎         | 13/360 [00:01<00:23, 14.93it/s][A[A

  4%|▍         | 16/360 [00:01<00:18, 18.42it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.81it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.18it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 22.29it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 21.83it/s][A[A

  9%|▊         | 31/360 [00:02<00:15, 21.58it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.48it/s][A[A

 10%|█         | 37/360 [00:02<00:14, 22.39it/s][A[A

 11%|█         | 40/360 [00:02<00:13, 23.25it/s][A[A

 12%|█▏        | 43/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:42,  1.16it/s][A[A

  2%|▏         | 2/119 [00:01<00:50,  2.31it/s][A[A

  3%|▎         | 3/119 [00:01<00:33,  3.46it/s][A[A

  3%|▎         | 4/119 [00:01<00:26,  4.37it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.33it/s][A[A

  6%|▌         | 7/119 [00:01<00:13,  8.29it/s][A[A

  8%|▊         | 9/119 [00:01<00:10, 10.65it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.68it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 15.97it/s][A[A

 14%|█▍        | 17/119 [00:01<00:05, 17.59it/s][A[A

 16%|█▌        | 19/119 [00:02<00:05, 17.07it/s][A[A

 18%|█▊        | 21/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:02,  1.54s/it][A[A

  2%|▏         | 2/120 [00:01<01:23,  1.41it/s][A[A

  2%|▎         | 3/120 [00:01<00:53,  2.20it/s][A[A

  4%|▍         | 5/120 [00:01<00:28,  4.08it/s][A[A

  6%|▌         | 7/120 [00:02<00:18,  5.97it/s][A[A

  8%|▊         | 9/120 [00:02<00:15,  7.27it/s][A[A

  9%|▉         | 11/120 [00:02<00:12,  8.72it/s][A[A

 11%|█         | 13/120 [00:02<00:10, 10.25it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 13.05it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 13.61it/s][A[A

 17%|█▋        | 20/120 [00:03<00:07, 13.84it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 14.18it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.10it/s][A[A

 22%|██▏       | 26/120 [00:03<00:06, 15.32it/s][A[A

 23%|██▎       | 28/120 [00:03<00:06, 15.28it/s][A[A

 25%|██▌       | 30/120 [00:03<00:06, 14.48it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 15.28it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<05:54,  1.01it/s][A[A

  1%|          | 2/360 [00:01<02:53,  2.07it/s][A[A

  1%|          | 3/360 [00:01<01:51,  3.21it/s][A[A

  1%|          | 4/360 [00:01<01:25,  4.18it/s][A[A

  2%|▏         | 7/360 [00:01<00:42,  8.23it/s][A[A

  3%|▎         | 11/360 [00:01<00:25, 13.71it/s][A[A

  4%|▎         | 13/360 [00:01<00:25, 13.51it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 15.99it/s][A[A

  5%|▌         | 19/360 [00:02<00:19, 17.90it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.64it/s][A[A

  7%|▋         | 24/360 [00:02<00:17, 19.59it/s][A[A

  8%|▊         | 27/360 [00:02<00:16, 20.25it/s][A[A

  8%|▊         | 30/360 [00:02<00:16, 20.07it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 22.20it/s][A[A

 10%|█         | 37/360 [00:02<00:14, 22.19it/s][A[A

 11%|█         | 40/360 [00:03<00:14, 21.82it/s][A[A

 12%|█▏        | 43/360 [00:03<00:13, 22.65it/s][A[A

 13%|█▎        | 46/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:40,  1.17it/s][A[A

  2%|▏         | 2/119 [00:01<00:53,  2.17it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.46it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.39it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.47it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.47it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.45it/s][A[A

 13%|█▎        | 15/119 [00:01<00:06, 15.18it/s][A[A

 15%|█▌        | 18/119 [00:01<00:05, 16.97it/s][A[A

 18%|█▊        | 21/119 [00:02<00:05, 17.76it/s][A[A

 20%|██        | 24/119 [00:02<00:04, 19.76it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:03,  1.54s/it][A[A

  2%|▏         | 2/120 [00:01<01:26,  1.36it/s][A[A

  2%|▎         | 3/120 [00:01<00:53,  2.19it/s][A[A

  3%|▎         | 4/120 [00:01<00:37,  3.09it/s][A[A

  5%|▌         | 6/120 [00:02<00:23,  4.93it/s][A[A

  7%|▋         | 8/120 [00:02<00:16,  6.89it/s][A[A

  8%|▊         | 10/120 [00:02<00:12,  8.99it/s][A[A

 10%|█         | 12/120 [00:02<00:10, 10.28it/s][A[A

 12%|█▏        | 14/120 [00:02<00:09, 11.58it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 13.14it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 14.05it/s][A[A

 17%|█▋        | 20/120 [00:03<00:06, 15.34it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 15.27it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.87it/s][A[A

 22%|██▏       | 26/120 [00:03<00:06, 15.32it/s][A[A

 23%|██▎       | 28/120 [00:03<00:06, 15.21it/s][A[A

 25%|██▌       | 30/120 [00:03<00:06, 14.98it/s][A[A

 27%|██▋       | 32/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:47,  1.03it/s][A[A

  1%|          | 2/360 [00:01<02:51,  2.09it/s][A[A

  1%|          | 3/360 [00:01<01:51,  3.21it/s][A[A

  2%|▏         | 6/360 [00:01<00:48,  7.36it/s][A[A

  2%|▏         | 8/360 [00:01<00:36,  9.69it/s][A[A

  3%|▎         | 10/360 [00:01<00:30, 11.60it/s][A[A

  4%|▍         | 14/360 [00:01<00:21, 16.46it/s][A[A

  4%|▍         | 16/360 [00:01<00:21, 16.02it/s][A[A

  5%|▌         | 19/360 [00:01<00:18, 18.48it/s][A[A

  6%|▌         | 22/360 [00:02<00:16, 20.60it/s][A[A

  7%|▋         | 25/360 [00:02<00:15, 22.18it/s][A[A

  8%|▊         | 28/360 [00:02<00:15, 21.18it/s][A[A

  9%|▊         | 31/360 [00:02<00:15, 21.61it/s][A[A

  9%|▉         | 34/360 [00:02<00:14, 21.79it/s][A[A

 10%|█         | 37/360 [00:02<00:14, 22.36it/s][A[A

 11%|█▏        | 41/360 [00:02<00:13, 24.41it/s][A[A

 12%|█▏        | 44/360 [00:03<00:12, 24.60it/s][A[A

 13%|█▎        | 47/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:17,  1.53it/s][A[A

  2%|▏         | 2/119 [00:00<00:54,  2.14it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.79it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.25it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.31it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.35it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.15it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 15.61it/s][A[A

 15%|█▌        | 18/119 [00:01<00:06, 15.18it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.84it/s][A[A

 19%|█▉        | 23/119 [00:02<00:05, 17.58it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:50,  1.08it/s][A[A

  2%|▏         | 2/120 [00:01<00:53,  2.20it/s][A[A

  3%|▎         | 4/120 [00:01<00:25,  4.53it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.53it/s][A[A

  7%|▋         | 8/120 [00:01<00:12,  8.93it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.26it/s][A[A

 10%|█         | 12/120 [00:01<00:09, 11.64it/s][A[A

 12%|█▎        | 15/120 [00:01<00:07, 14.62it/s][A[A

 14%|█▍        | 17/120 [00:02<00:06, 14.81it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.93it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 17.07it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 17.24it/s][A[A

 22%|██▏       | 26/120 [00:02<00:05, 16.96it/s][A[A

 23%|██▎       | 28/120 [00:02<00:05, 16.65it/s][A[A

 25%|██▌       | 30/120 [00:02<00:05, 15.30it/s][A[A

 27%|██▋       | 32/120 [00:02<00:05, 15.91it/s][A[A

 29%|██▉       | 35/120 [00:03<00:05, 16.78it/s][A[A

 31%|███       | 37/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:10,  1.03s/it][A[A

  1%|          | 2/360 [00:01<02:58,  2.01it/s][A[A

  1%|          | 3/360 [00:01<02:00,  2.97it/s][A[A

  1%|▏         | 5/360 [00:01<01:03,  5.61it/s][A[A

  2%|▏         | 7/360 [00:01<00:43,  8.15it/s][A[A

  2%|▎         | 9/360 [00:01<00:35,  9.78it/s][A[A

  3%|▎         | 12/360 [00:01<00:26, 13.08it/s][A[A

  4%|▍         | 14/360 [00:01<00:24, 14.12it/s][A[A

  5%|▍         | 17/360 [00:02<00:21, 16.31it/s][A[A

  6%|▌         | 20/360 [00:02<00:18, 18.46it/s][A[A

  6%|▋         | 23/360 [00:02<00:17, 19.69it/s][A[A

  7%|▋         | 26/360 [00:02<00:16, 20.23it/s][A[A

  8%|▊         | 29/360 [00:02<00:15, 21.48it/s][A[A

  9%|▉         | 32/360 [00:02<00:15, 20.76it/s][A[A

 10%|▉         | 35/360 [00:02<00:16, 20.08it/s][A[A

 11%|█         | 38/360 [00:02<00:14, 21.94it/s][A[A

 11%|█▏        | 41/360 [00:03<00:14, 22.54it/s][A[A

 12%|█▏        | 44/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:28,  1.34it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.32it/s][A[A

  3%|▎         | 3/119 [00:01<00:34,  3.40it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.60it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.75it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.91it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.68it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 15.93it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.75it/s][A[A

 15%|█▌        | 18/119 [00:01<00:06, 16.06it/s][A[A

 18%|█▊        | 21/119 [00:02<00:05, 18.53it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:04,  1.04s/it][A[A

  2%|▎         | 3/120 [00:01<00:37,  3.08it/s][A[A

  5%|▌         | 6/120 [00:01<00:16,  6.77it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.98it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 13.20it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 15.27it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 15.74it/s][A[A

 18%|█▊        | 21/120 [00:01<00:05, 18.41it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 19.27it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 18.93it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 19.71it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 22.59it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 21.83it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 20.80it/s][A[A

 36%|███▌      | 43/120 [00:02<00:03, 22.46it/s][A[A

 38%|███▊      | 46/120 [00:03<00:03, 22.48it/s][A[A

 41%|████      | 49/120 [00:03<00:03, 21.92it/s][A[A

 43%|████▎     | 52/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:12,  1.21s/it][A[A

  1%|          | 2/360 [00:01<03:38,  1.64it/s][A[A

  1%|          | 3/360 [00:01<02:34,  2.31it/s][A[A

  1%|          | 4/360 [00:01<01:51,  3.20it/s][A[A

  2%|▏         | 7/360 [00:01<00:52,  6.77it/s][A[A

  2%|▎         | 9/360 [00:02<00:39,  8.99it/s][A[A

  3%|▎         | 12/360 [00:02<00:28, 12.32it/s][A[A

  4%|▍         | 14/360 [00:02<00:25, 13.49it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 14.24it/s][A[A

  5%|▌         | 19/360 [00:02<00:20, 16.80it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.02it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 18.17it/s][A[A

  7%|▋         | 26/360 [00:02<00:18, 18.31it/s][A[A

  8%|▊         | 28/360 [00:02<00:18, 18.33it/s][A[A

  8%|▊         | 30/360 [00:03<00:18, 18.30it/s][A[A

  9%|▉         | 32/360 [00:03<00:17, 18.62it/s][A[A

  9%|▉         | 34/360 [00:03<00:17, 18.73it/s][A[A

 10%|█         | 37/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:04<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:13,  1.13s/it][A[A

  2%|▏         | 2/119 [00:01<01:12,  1.61it/s][A[A

  3%|▎         | 3/119 [00:01<00:46,  2.51it/s][A[A

  3%|▎         | 4/119 [00:01<00:34,  3.38it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.68it/s][A[A

  7%|▋         | 8/119 [00:02<00:15,  7.01it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.59it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.21it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 14.92it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.87it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.97it/s][A[A

 19%|█▉        | 23/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:50,  1.08it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.19it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.56it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.61it/s][A[A

  8%|▊         | 9/120 [00:01<00:12,  9.19it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 10.92it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 13.93it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 16.47it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 17.05it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 19.19it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 21.74it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 19.75it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 21.26it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 20.42it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 20.69it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 20.89it/s][A[A

 36%|███▌      | 43/120 [00:03<00:03, 22.29it/s][A[A

 38%|███▊      | 46/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:05,  1.35s/it][A[A

  1%|          | 2/360 [00:01<03:58,  1.50it/s][A[A

  1%|          | 3/360 [00:01<02:44,  2.16it/s][A[A

  1%|          | 4/360 [00:01<01:54,  3.10it/s][A[A

  2%|▏         | 7/360 [00:02<00:55,  6.37it/s][A[A

  2%|▎         | 9/360 [00:02<00:44,  7.87it/s][A[A

  3%|▎         | 11/360 [00:02<00:36,  9.53it/s][A[A

  4%|▎         | 13/360 [00:02<00:32, 10.74it/s][A[A

  4%|▍         | 15/360 [00:02<00:29, 11.74it/s][A[A

  5%|▍         | 17/360 [00:02<00:26, 12.71it/s][A[A

  5%|▌         | 19/360 [00:02<00:24, 14.18it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.26it/s][A[A

  7%|▋         | 24/360 [00:03<00:20, 16.42it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 17.14it/s][A[A

  8%|▊         | 29/360 [00:03<00:16, 19.78it/s][A[A

  9%|▉         | 32/360 [00:03<00:16, 20.30it/s][A[A

 10%|▉         | 35/360 [00:03<00:15, 20.62it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:18,  1.17s/it][A[A

  2%|▏         | 2/119 [00:01<01:09,  1.69it/s][A[A

  3%|▎         | 3/119 [00:01<00:47,  2.47it/s][A[A

  3%|▎         | 4/119 [00:01<00:35,  3.25it/s][A[A

  4%|▍         | 5/119 [00:01<00:26,  4.29it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.12it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.66it/s][A[A

  8%|▊         | 10/119 [00:02<00:13,  8.07it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 11.81it/s][A[A

 13%|█▎        | 15/119 [00:02<00:08, 12.03it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.57it/s][A[A

 16%|█▌        | 19/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:31,  1.31it/s][A[A

  2%|▏         | 2/120 [00:00<00:46,  2.56it/s][A[A

  2%|▎         | 3/120 [00:01<00:33,  3.44it/s][A[A

  4%|▍         | 5/120 [00:01<00:18,  6.19it/s][A[A

  5%|▌         | 6/120 [00:01<00:16,  6.71it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.35it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.90it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 15.75it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.63it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 17.16it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.97it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 20.50it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 21.15it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 21.99it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 22.74it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 21.34it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 21.55it/s][A[A

 36%|███▌      | 43/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:55,  1.16s/it][A[A

  1%|          | 2/360 [00:01<03:36,  1.65it/s][A[A

  1%|          | 3/360 [00:01<02:27,  2.42it/s][A[A

  1%|          | 4/360 [00:01<01:49,  3.26it/s][A[A

  1%|▏         | 5/360 [00:01<01:22,  4.29it/s][A[A

  2%|▏         | 6/360 [00:01<01:08,  5.14it/s][A[A

  2%|▏         | 7/360 [00:02<00:59,  5.96it/s][A[A

  2%|▎         | 9/360 [00:02<00:45,  7.75it/s][A[A

  3%|▎         | 11/360 [00:02<00:35,  9.89it/s][A[A

  4%|▍         | 14/360 [00:02<00:25, 13.73it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 13.92it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.47it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.27it/s][A[A

  6%|▋         | 23/360 [00:03<00:21, 15.96it/s][A[A

  7%|▋         | 25/360 [00:03<00:21, 15.87it/s][A[A

  8%|▊         | 27/360 [00:03<00:21, 15.81it/s][A[A

  8%|▊         | 29/360 [00:03<00:20, 16.13it/s][A[A

  9%|▊         | 31/360 [00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:25,  1.23s/it][A[A

  2%|▏         | 2/119 [00:01<01:13,  1.60it/s][A[A

  3%|▎         | 3/119 [00:01<00:48,  2.41it/s][A[A

  3%|▎         | 4/119 [00:01<00:36,  3.14it/s][A[A

  4%|▍         | 5/119 [00:01<00:27,  4.08it/s][A[A

  6%|▌         | 7/119 [00:02<00:18,  6.18it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.62it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.53it/s][A[A

 10%|█         | 12/119 [00:02<00:10, 10.32it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.20it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.31it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:32,  1.28it/s][A[A

  2%|▏         | 2/120 [00:00<00:50,  2.34it/s][A[A

  2%|▎         | 3/120 [00:01<00:35,  3.30it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.67it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.99it/s][A[A

  8%|▊         | 10/120 [00:01<00:09, 11.35it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.99it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 13.98it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 16.57it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 18.39it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.45it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 20.35it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.59it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 21.43it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 21.94it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 21.19it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 22.09it/s][A[A

 36%|███▌      | 43/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:57,  1.33s/it][A[A

  1%|          | 2/360 [00:01<04:01,  1.48it/s][A[A

  1%|          | 3/360 [00:01<02:43,  2.18it/s][A[A

  1%|          | 4/360 [00:01<01:53,  3.14it/s][A[A

  1%|▏         | 5/360 [00:01<01:26,  4.11it/s][A[A

  2%|▏         | 6/360 [00:02<01:09,  5.08it/s][A[A

  2%|▏         | 8/360 [00:02<00:47,  7.43it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.83it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.25it/s][A[A

  4%|▍         | 14/360 [00:02<00:29, 11.63it/s][A[A

  4%|▍         | 16/360 [00:02<00:27, 12.60it/s][A[A

  5%|▌         | 18/360 [00:02<00:24, 14.20it/s][A[A

  6%|▌         | 20/360 [00:02<00:22, 15.07it/s][A[A

  6%|▌         | 22/360 [00:03<00:21, 15.57it/s][A[A

  7%|▋         | 25/360 [00:03<00:19, 17.37it/s][A[A

  8%|▊         | 27/360 [00:03<00:21, 15.54it/s][A[A

  8%|▊         | 30/360 [00:03<00:19, 16.87it/s][A[A

  9%|▉         | 33/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:47,  1.10it/s][A[A

  2%|▏         | 2/119 [00:01<01:18,  1.49it/s][A[A

  3%|▎         | 3/119 [00:01<00:49,  2.34it/s][A[A

  3%|▎         | 4/119 [00:01<00:35,  3.24it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.01it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.83it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.74it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.38it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 14.77it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.51it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.30it/s][A[A

 20%|██        | 24/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<01:59,  1.01s/it][A[A

  2%|▎         | 3/120 [00:01<00:38,  3.05it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.50it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.47it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.71it/s][A[A

 11%|█         | 13/120 [00:01<00:08, 13.27it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 16.24it/s][A[A

 16%|█▌        | 19/120 [00:01<00:05, 17.55it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.79it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 20.07it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.05it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 21.70it/s][A[A

 28%|██▊       | 34/120 [00:02<00:03, 21.53it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 22.56it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 21.03it/s][A[A

 36%|███▌      | 43/120 [00:03<00:03, 21.15it/s][A[A

 38%|███▊      | 46/120 [00:03<00:03, 22.20it/s][A[A

 41%|████      | 49/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:22,  1.23s/it][A[A

  1%|          | 2/360 [00:01<03:42,  1.61it/s][A[A

  1%|          | 3/360 [00:01<02:43,  2.19it/s][A[A

  1%|          | 4/360 [00:01<01:54,  3.12it/s][A[A

  2%|▏         | 6/360 [00:01<01:04,  5.46it/s][A[A

  2%|▏         | 7/360 [00:02<00:57,  6.10it/s][A[A

  2%|▎         | 9/360 [00:02<00:40,  8.66it/s][A[A

  3%|▎         | 11/360 [00:02<00:36,  9.65it/s][A[A

  4%|▎         | 13/360 [00:02<00:30, 11.20it/s][A[A

  4%|▍         | 16/360 [00:02<00:25, 13.58it/s][A[A

  5%|▌         | 18/360 [00:02<00:23, 14.81it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.36it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 17.98it/s][A[A

  8%|▊         | 27/360 [00:03<00:17, 19.15it/s][A[A

  8%|▊         | 29/360 [00:03<00:17, 19.02it/s][A[A

  9%|▊         | 31/360 [00:03<00:17, 18.73it/s][A[A

  9%|▉         | 34/360 [00:03<00:15, 21.14it/s][A[A

 10%|█         | 37/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:11,  1.11s/it][A[A

  2%|▏         | 2/119 [00:01<01:04,  1.80it/s][A[A

  3%|▎         | 3/119 [00:01<00:42,  2.74it/s][A[A

  3%|▎         | 4/119 [00:01<00:34,  3.32it/s][A[A

  5%|▌         | 6/119 [00:01<00:21,  5.25it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  7.26it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.24it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.82it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 14.34it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 14.77it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 15.82it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:44,  1.14it/s][A[A

  2%|▏         | 2/120 [00:00<00:50,  2.34it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.22it/s][A[A

  4%|▍         | 5/120 [00:01<00:19,  5.85it/s][A[A

  6%|▌         | 7/120 [00:01<00:13,  8.39it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.69it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.54it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 13.88it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 16.40it/s][A[A

 15%|█▌        | 18/120 [00:02<00:06, 16.10it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 17.02it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 20.03it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 19.50it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 18.81it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 20.28it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 20.57it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 20.74it/s][A[A

 34%|███▍      | 41/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:51,  1.15s/it][A[A

  1%|          | 2/360 [00:01<03:30,  1.70it/s][A[A

  1%|          | 3/360 [00:01<02:37,  2.27it/s][A[A

  1%|          | 4/360 [00:01<01:49,  3.25it/s][A[A

  2%|▏         | 6/360 [00:01<01:03,  5.57it/s][A[A

  2%|▏         | 8/360 [00:01<00:45,  7.69it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.89it/s][A[A

  4%|▎         | 13/360 [00:02<00:25, 13.50it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 12.98it/s][A[A

  5%|▍         | 17/360 [00:02<00:24, 14.03it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.23it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.08it/s][A[A

  6%|▋         | 23/360 [00:02<00:21, 15.99it/s][A[A

  7%|▋         | 25/360 [00:02<00:20, 16.61it/s][A[A

  8%|▊         | 27/360 [00:03<00:20, 16.35it/s][A[A

  8%|▊         | 29/360 [00:03<00:19, 17.02it/s][A[A

  9%|▊         | 31/360 [00:03<00:20, 16.43it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:17,  1.17s/it][A[A

  2%|▏         | 2/119 [00:01<01:15,  1.55it/s][A[A

  3%|▎         | 3/119 [00:01<00:46,  2.52it/s][A[A

  3%|▎         | 4/119 [00:01<00:34,  3.29it/s][A[A

  4%|▍         | 5/119 [00:01<00:27,  4.09it/s][A[A

  5%|▌         | 6/119 [00:01<00:23,  4.84it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.55it/s][A[A

  8%|▊         | 10/119 [00:02<00:13,  8.29it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 11.91it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.11it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.78it/s][A[A

 17%|█▋        | 20/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:54,  1.04it/s][A[A

  2%|▏         | 2/120 [00:01<00:55,  2.12it/s][A[A

  3%|▎         | 4/120 [00:01<00:25,  4.53it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.51it/s][A[A

  7%|▋         | 8/120 [00:01<00:12,  8.77it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.23it/s][A[A

 12%|█▏        | 14/120 [00:01<00:06, 15.17it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.59it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 16.48it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 18.50it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 20.34it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 19.56it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 20.68it/s][A[A

 28%|██▊       | 33/120 [00:02<00:04, 20.16it/s][A[A

 30%|███       | 36/120 [00:02<00:03, 21.81it/s][A[A

 32%|███▎      | 39/120 [00:02<00:03, 20.90it/s][A[A

 35%|███▌      | 42/120 [00:03<00:03, 20.95it/s][A[A

 38%|███▊      | 45/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:53,  1.32s/it][A[A

  1%|          | 2/360 [00:01<03:48,  1.57it/s][A[A

  1%|          | 3/360 [00:01<02:35,  2.29it/s][A[A

  1%|          | 4/360 [00:01<01:52,  3.17it/s][A[A

  1%|▏         | 5/360 [00:01<01:28,  4.00it/s][A[A

  2%|▏         | 7/360 [00:02<01:00,  5.87it/s][A[A

  2%|▎         | 9/360 [00:02<00:43,  8.02it/s][A[A

  3%|▎         | 11/360 [00:02<00:36,  9.63it/s][A[A

  4%|▎         | 13/360 [00:02<00:32, 10.80it/s][A[A

  4%|▍         | 16/360 [00:02<00:23, 14.45it/s][A[A

  5%|▌         | 18/360 [00:02<00:24, 13.77it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 15.70it/s][A[A

  7%|▋         | 24/360 [00:03<00:19, 17.47it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 17.44it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 17.22it/s][A[A

  8%|▊         | 30/360 [00:03<00:19, 17.34it/s][A[A

  9%|▉         | 32/360 [00:03<00:19, 17.20it/s][A[A

  9%|▉         | 34/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:08,  1.09s/it][A[A

  2%|▏         | 2/119 [00:01<01:15,  1.56it/s][A[A

  3%|▎         | 3/119 [00:01<00:45,  2.54it/s][A[A

  3%|▎         | 4/119 [00:01<00:32,  3.49it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.42it/s][A[A

  6%|▌         | 7/119 [00:01<00:15,  7.02it/s][A[A

  8%|▊         | 9/119 [00:02<00:12,  8.87it/s][A[A

  9%|▉         | 11/119 [00:02<00:09, 10.93it/s][A[A

 12%|█▏        | 14/119 [00:02<00:07, 14.59it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 16.93it/s][A[A

 17%|█▋        | 20/119 [00:02<00:05, 17.40it/s][A[A

 19%|█▉        | 23/119 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:40,  1.18it/s][A[A

  2%|▏         | 2/120 [00:00<00:49,  2.38it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.24it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.65it/s][A[A

  6%|▌         | 7/120 [00:01<00:13,  8.31it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.65it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.90it/s][A[A

 11%|█         | 13/120 [00:01<00:07, 13.62it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.89it/s][A[A

 16%|█▌        | 19/120 [00:02<00:05, 17.96it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 19.24it/s][A[A

 21%|██        | 25/120 [00:02<00:04, 20.56it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 20.46it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 21.87it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 21.18it/s][A[A

 31%|███       | 37/120 [00:02<00:03, 21.99it/s][A[A

 33%|███▎      | 40/120 [00:02<00:03, 21.21it/s][A[A

 36%|███▌      | 43/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:33,  1.43s/it][A[A

  1%|          | 2/360 [00:01<04:09,  1.44it/s][A[A

  1%|          | 3/360 [00:01<02:44,  2.17it/s][A[A

  1%|          | 4/360 [00:01<01:54,  3.10it/s][A[A

  2%|▏         | 6/360 [00:02<01:08,  5.17it/s][A[A

  2%|▏         | 8/360 [00:02<00:48,  7.31it/s][A[A

  3%|▎         | 10/360 [00:02<00:37,  9.34it/s][A[A

  3%|▎         | 12/360 [00:02<00:31, 10.90it/s][A[A

  4%|▍         | 14/360 [00:02<00:28, 12.04it/s][A[A

  5%|▍         | 17/360 [00:02<00:24, 14.01it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.28it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.32it/s][A[A

  6%|▋         | 23/360 [00:03<00:20, 16.52it/s][A[A

  7%|▋         | 25/360 [00:03<00:20, 15.96it/s][A[A

  8%|▊         | 27/360 [00:03<00:20, 16.35it/s][A[A

  8%|▊         | 29/360 [00:03<00:19, 17.28it/s][A[A

  9%|▊         | 31/360 [00:03<00:19, 17.13it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:07,  1.08s/it][A[A

  2%|▏         | 2/119 [00:01<01:16,  1.52it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.40it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.08it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  7.11it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.11it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.37it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 14.57it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 16.54it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 16.20it/s][A[A

 19%|█▉        | 23/119 [00:02<00:05, 17.05it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:04,  1.04s/it][A[A

  2%|▎         | 3/120 [00:01<00:37,  3.14it/s][A[A

  4%|▍         | 5/120 [00:01<00:22,  5.20it/s][A[A

  7%|▋         | 8/120 [00:01<00:12,  8.80it/s][A[A

  9%|▉         | 11/120 [00:01<00:08, 12.40it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.42it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 15.73it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 17.40it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 19.05it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 19.49it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 19.75it/s][A[A

 27%|██▋       | 32/120 [00:02<00:04, 21.82it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 20.01it/s][A[A

 32%|███▏      | 38/120 [00:02<00:03, 20.52it/s][A[A

 34%|███▍      | 41/120 [00:02<00:03, 21.03it/s][A[A

 37%|███▋      | 44/120 [00:03<00:03, 22.02it/s][A[A

 39%|███▉      | 47/120 [00:03<00:03, 21.27it/s][A[A

 42%|████▏     | 50/120 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:53,  1.15s/it][A[A

  1%|          | 2/360 [00:01<04:25,  1.35it/s][A[A

  1%|          | 3/360 [00:01<03:02,  1.96it/s][A[A

  1%|▏         | 5/360 [00:02<01:33,  3.80it/s][A[A

  2%|▏         | 7/360 [00:02<01:04,  5.43it/s][A[A

  2%|▎         | 9/360 [00:02<00:47,  7.44it/s][A[A

  3%|▎         | 11/360 [00:02<00:37,  9.28it/s][A[A

  4%|▎         | 13/360 [00:02<00:32, 10.81it/s][A[A

  4%|▍         | 15/360 [00:02<00:28, 12.09it/s][A[A

  5%|▌         | 18/360 [00:02<00:23, 14.82it/s][A[A

  6%|▌         | 20/360 [00:02<00:21, 15.91it/s][A[A

  6%|▌         | 22/360 [00:03<00:21, 16.05it/s][A[A

  7%|▋         | 24/360 [00:03<00:20, 16.71it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 16.89it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 17.43it/s][A[A

  8%|▊         | 30/360 [00:03<00:19, 16.97it/s][A[A

  9%|▉         | 32/360 [00:03<00:18, 17.58it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:24,  1.22s/it][A[A

  2%|▏         | 2/119 [00:01<01:13,  1.60it/s][A[A

  3%|▎         | 3/119 [00:01<00:49,  2.36it/s][A[A

  3%|▎         | 4/119 [00:01<00:35,  3.22it/s][A[A

  4%|▍         | 5/119 [00:01<00:28,  4.01it/s][A[A

  5%|▌         | 6/119 [00:02<00:22,  4.95it/s][A[A

  7%|▋         | 8/119 [00:02<00:15,  7.00it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.93it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.85it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 14.58it/s][A[A

 15%|█▌        | 18/119 [00:02<00:07, 13.73it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:55,  1.03it/s][A[A

  2%|▏         | 2/120 [00:01<00:58,  2.02it/s][A[A

  2%|▎         | 3/120 [00:01<00:38,  3.07it/s][A[A

  3%|▎         | 4/120 [00:01<00:27,  4.19it/s][A[A

  5%|▌         | 6/120 [00:01<00:16,  7.08it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.36it/s][A[A

  8%|▊         | 10/120 [00:01<00:09, 11.61it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.91it/s][A[A

 12%|█▎        | 15/120 [00:01<00:06, 15.85it/s][A[A

 15%|█▌        | 18/120 [00:02<00:05, 18.46it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 19.15it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 21.67it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 19.16it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 20.05it/s][A[A

 28%|██▊       | 33/120 [00:02<00:04, 20.71it/s][A[A

 30%|███       | 36/120 [00:02<00:04, 20.97it/s][A[A

 32%|███▎      | 39/120 [00:03<00:03, 21.12it/s][A[A

 35%|███▌      | 42/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:49,  1.48s/it][A[A

  1%|          | 2/360 [00:01<04:19,  1.38it/s][A[A

  1%|          | 3/360 [00:01<02:46,  2.14it/s][A[A

  1%|▏         | 5/360 [00:01<01:27,  4.06it/s][A[A

  2%|▏         | 7/360 [00:02<00:58,  6.06it/s][A[A

  2%|▎         | 9/360 [00:02<00:42,  8.20it/s][A[A

  3%|▎         | 11/360 [00:02<00:34, 10.26it/s][A[A

  4%|▍         | 14/360 [00:02<00:26, 12.89it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 14.03it/s][A[A

  5%|▌         | 18/360 [00:02<00:22, 15.31it/s][A[A

  6%|▌         | 20/360 [00:02<00:21, 15.68it/s][A[A

  6%|▌         | 22/360 [00:02<00:20, 16.22it/s][A[A

  7%|▋         | 24/360 [00:03<00:19, 17.13it/s][A[A

  8%|▊         | 27/360 [00:03<00:17, 19.28it/s][A[A

  8%|▊         | 30/360 [00:03<00:17, 19.30it/s][A[A

  9%|▉         | 32/360 [00:03<00:17, 18.96it/s][A[A

  9%|▉         | 34/360 [00:03<00:17, 18.77it/s][A[A

 10%|█         | 37/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:16,  1.16s/it][A[A

  2%|▏         | 2/119 [00:01<01:08,  1.72it/s][A[A

  3%|▎         | 4/119 [00:01<00:31,  3.69it/s][A[A

  5%|▌         | 6/119 [00:01<00:20,  5.55it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.55it/s][A[A

  8%|▊         | 10/119 [00:01<00:12,  9.08it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.45it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 15.69it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.09it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.11it/s][A[A

 19%|█▉        | 23/119 [00:02<00:05, 17.21it/s][A[A

 22%|██▏       | 26/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:57,  1.02it/s][A[A

  2%|▏         | 2/120 [00:01<00:55,  2.13it/s][A[A

  2%|▎         | 3/120 [00:01<00:35,  3.26it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.62it/s][A[A

  6%|▌         | 7/120 [00:01<00:13,  8.10it/s][A[A

  8%|▊         | 10/120 [00:01<00:08, 12.26it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 13.28it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.73it/s][A[A

 14%|█▍        | 17/120 [00:01<00:06, 17.00it/s][A[A

 17%|█▋        | 20/120 [00:02<00:05, 19.66it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 19.05it/s][A[A

 22%|██▏       | 26/120 [00:02<00:04, 20.42it/s][A[A

 24%|██▍       | 29/120 [00:02<00:04, 18.77it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 18.76it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 20.90it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 20.20it/s][A[A

 33%|███▎      | 40/120 [00:03<00:03, 20.06it/s][A[A

 36%|███▌      | 43/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:36,  1.44s/it][A[A

  1%|          | 2/360 [00:01<04:13,  1.41it/s][A[A

  1%|          | 3/360 [00:01<02:48,  2.12it/s][A[A

  1%|          | 4/360 [00:01<01:59,  2.98it/s][A[A

  1%|▏         | 5/360 [00:02<01:30,  3.93it/s][A[A

  2%|▏         | 7/360 [00:02<00:55,  6.40it/s][A[A

  3%|▎         | 10/360 [00:02<00:36,  9.52it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.43it/s][A[A

  4%|▍         | 15/360 [00:02<00:24, 14.11it/s][A[A

  5%|▍         | 17/360 [00:02<00:23, 14.47it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.41it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 15.58it/s][A[A

  7%|▋         | 24/360 [00:03<00:19, 17.08it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 16.98it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 16.91it/s][A[A

  9%|▊         | 31/360 [00:03<00:18, 17.72it/s][A[A

  9%|▉         | 34/360 [00:03<00:16, 19.61it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:35,  1.23it/s][A[A

  2%|▏         | 2/119 [00:01<00:52,  2.24it/s][A[A

  3%|▎         | 4/119 [00:01<00:24,  4.72it/s][A[A

  4%|▍         | 5/119 [00:01<00:20,  5.58it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.14it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.47it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.19it/s][A[A

 10%|█         | 12/119 [00:01<00:08, 12.19it/s][A[A

 13%|█▎        | 16/119 [00:01<00:05, 18.57it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 15.68it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 16.30it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:01,  1.02s/it][A[A

  2%|▏         | 2/120 [00:01<00:57,  2.06it/s][A[A

  3%|▎         | 4/120 [00:01<00:26,  4.45it/s][A[A

  5%|▌         | 6/120 [00:01<00:16,  6.94it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.52it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 13.50it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.02it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.33it/s][A[A

 15%|█▌        | 18/120 [00:01<00:06, 16.11it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 18.21it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 19.77it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 18.46it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 16.95it/s][A[A

 26%|██▌       | 31/120 [00:02<00:05, 17.16it/s][A[A

 28%|██▊       | 34/120 [00:02<00:05, 17.05it/s][A[A

 31%|███       | 37/120 [00:03<00:04, 18.51it/s][A[A

 32%|███▎      | 39/120 [00:03<00:04, 17.92it/s][A[A

 35%|███▌      | 42/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:24,  1.40s/it][A[A

  1%|          | 2/360 [00:01<04:02,  1.48it/s][A[A

  1%|          | 3/360 [00:01<02:37,  2.26it/s][A[A

  1%|▏         | 5/360 [00:01<01:25,  4.17it/s][A[A

  2%|▏         | 7/360 [00:02<00:57,  6.10it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.88it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.30it/s][A[A

  4%|▍         | 15/360 [00:02<00:23, 14.80it/s][A[A

  5%|▍         | 17/360 [00:02<00:22, 15.47it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.15it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 15.89it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 16.96it/s][A[A

  8%|▊         | 27/360 [00:03<00:17, 18.85it/s][A[A

  8%|▊         | 30/360 [00:03<00:17, 18.72it/s][A[A

  9%|▉         | 33/360 [00:03<00:15, 20.75it/s][A[A

 10%|█         | 36/360 [00:03<00:14, 22.74it/s][A[A

 11%|█         | 39/360 [00:03<00:17, 18.70it/s][A[A

 12%|█▏        | 42/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:32,  1.27it/s][A[A

  2%|▏         | 2/119 [00:01<00:54,  2.16it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.46it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.55it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.60it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.78it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.32it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.26it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.61it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.33it/s][A[A

 18%|█▊        | 22/119 [00:02<00:05, 16.38it/s][A[A

 21%|██        | 25/11

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:55,  1.03it/s][A[A

  2%|▏         | 2/120 [00:01<00:54,  2.16it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.19it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.56it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.70it/s][A[A

  8%|▊         | 9/120 [00:01<00:11, 10.04it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.60it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.64it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.65it/s][A[A

 16%|█▌        | 19/120 [00:02<00:05, 17.23it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 17.46it/s][A[A

 20%|██        | 24/120 [00:02<00:04, 19.78it/s][A[A

 22%|██▎       | 27/120 [00:02<00:04, 20.42it/s][A[A

 25%|██▌       | 30/120 [00:02<00:04, 19.70it/s][A[A

 28%|██▊       | 33/120 [00:02<00:04, 19.55it/s][A[A

 29%|██▉       | 35/120 [00:02<00:04, 18.51it/s][A[A

 31%|███       | 37/120 [00:03<00:04, 16.68it/s][A[A

 32%|███▎      | 39/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:37,  1.27s/it][A[A

  1%|          | 2/360 [00:01<03:38,  1.64it/s][A[A

  1%|          | 3/360 [00:01<02:43,  2.18it/s][A[A

  1%|▏         | 5/360 [00:01<01:21,  4.34it/s][A[A

  2%|▏         | 7/360 [00:01<00:57,  6.10it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.92it/s][A[A

  3%|▎         | 12/360 [00:02<00:31, 10.92it/s][A[A

  4%|▍         | 15/360 [00:02<00:25, 13.40it/s][A[A

  5%|▍         | 17/360 [00:02<00:23, 14.36it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.32it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 17.34it/s][A[A

  8%|▊         | 27/360 [00:02<00:17, 18.56it/s][A[A

  8%|▊         | 29/360 [00:03<00:17, 18.57it/s][A[A

  9%|▊         | 31/360 [00:03<00:17, 18.37it/s][A[A

  9%|▉         | 34/360 [00:03<00:17, 19.18it/s][A[A

 10%|█         | 37/360 [00:03<00:15, 20.32it/s][A[A

 11%|█         | 40/360 [00:03<00:14, 21.54it/s][A[A

 12%|█▏        | 43/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:24,  1.40it/s][A[A

  2%|▏         | 2/119 [00:00<00:52,  2.24it/s][A[A

  3%|▎         | 3/119 [00:01<00:34,  3.38it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.42it/s][A[A

  5%|▌         | 6/119 [00:01<00:16,  6.71it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.61it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.48it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.79it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.16it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 16.01it/s][A[A

 18%|█▊        | 21/119 [00:02<00:05, 17.30it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:42,  1.16it/s][A[A

  2%|▏         | 2/120 [00:01<00:51,  2.29it/s][A[A

  2%|▎         | 3/120 [00:01<00:42,  2.78it/s][A[A

  4%|▍         | 5/120 [00:01<00:22,  5.21it/s][A[A

  6%|▌         | 7/120 [00:01<00:14,  7.73it/s][A[A

  8%|▊         | 9/120 [00:01<00:11,  9.62it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.43it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 14.82it/s][A[A

 13%|█▎        | 16/120 [00:01<00:06, 15.17it/s][A[A

 15%|█▌        | 18/120 [00:02<00:06, 15.55it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 16.01it/s][A[A

 18%|█▊        | 22/120 [00:02<00:06, 15.95it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 16.78it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.18it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 16.70it/s][A[A

 26%|██▌       | 31/120 [00:02<00:05, 16.08it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 16.57it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:05,  1.35s/it][A[A

  1%|          | 2/360 [00:01<03:49,  1.56it/s][A[A

  1%|          | 3/360 [00:01<02:36,  2.29it/s][A[A

  1%|▏         | 5/360 [00:01<01:18,  4.52it/s][A[A

  2%|▏         | 7/360 [00:01<00:52,  6.76it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.98it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.36it/s][A[A

  4%|▍         | 15/360 [00:02<00:23, 14.52it/s][A[A

  5%|▌         | 18/360 [00:02<00:20, 16.56it/s][A[A

  6%|▌         | 20/360 [00:02<00:19, 17.26it/s][A[A

  6%|▌         | 22/360 [00:02<00:19, 17.51it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 17.83it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 16.93it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 17.19it/s][A[A

  8%|▊         | 30/360 [00:03<00:20, 16.24it/s][A[A

  9%|▉         | 32/360 [00:03<00:19, 16.80it/s][A[A

  9%|▉         | 34/360 [00:03<00:18, 17.42it/s][A[A

 10%|█         | 36/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:30,  1.30it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.30it/s][A[A

  3%|▎         | 3/119 [00:01<00:36,  3.17it/s][A[A

  3%|▎         | 4/119 [00:01<00:29,  3.84it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.38it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.59it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.54it/s][A[A

 12%|█▏        | 14/119 [00:01<00:06, 16.40it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 16.48it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 16.23it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 15.16it/s][A[A

 19%|█▉        | 23/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<01:59,  1.01s/it][A[A

  2%|▏         | 2/120 [00:01<00:58,  2.02it/s][A[A

  2%|▎         | 3/120 [00:01<00:38,  3.08it/s][A[A

  3%|▎         | 4/120 [00:01<00:27,  4.20it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.43it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.59it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.37it/s][A[A

 10%|█         | 12/120 [00:01<00:09, 11.60it/s][A[A

 12%|█▏        | 14/120 [00:02<00:08, 12.76it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 14.09it/s][A[A

 15%|█▌        | 18/120 [00:02<00:06, 14.72it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 15.67it/s][A[A

 18%|█▊        | 22/120 [00:02<00:06, 15.95it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.83it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 15.90it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 15.63it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.22it/s][A[A

 28%|██▊       | 33/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:45,  1.30s/it][A[A

  1%|          | 2/360 [00:01<03:46,  1.58it/s][A[A

  1%|          | 3/360 [00:01<02:49,  2.11it/s][A[A

  1%|          | 4/360 [00:01<01:57,  3.03it/s][A[A

  2%|▏         | 6/360 [00:01<01:06,  5.32it/s][A[A

  2%|▏         | 7/360 [00:02<00:58,  6.08it/s][A[A

  2%|▎         | 9/360 [00:02<00:40,  8.75it/s][A[A

  3%|▎         | 11/360 [00:02<00:34, 10.16it/s][A[A

  4%|▎         | 13/360 [00:02<00:28, 12.28it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 13.22it/s][A[A

  5%|▌         | 18/360 [00:02<00:21, 16.27it/s][A[A

  6%|▌         | 21/360 [00:02<00:18, 18.50it/s][A[A

  6%|▋         | 23/360 [00:02<00:18, 18.05it/s][A[A

  7%|▋         | 26/360 [00:03<00:17, 18.60it/s][A[A

  8%|▊         | 29/360 [00:03<00:17, 19.38it/s][A[A

  9%|▊         | 31/360 [00:03<00:18, 17.53it/s][A[A

  9%|▉         | 34/360 [00:03<00:17, 18.34it/s][A[A

 10%|█         | 36/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:24,  1.40it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.34it/s][A[A

  3%|▎         | 3/119 [00:01<00:33,  3.43it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.21it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.16it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  7.95it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 11.70it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.28it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 14.91it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.87it/s][A[A

 17%|█▋        | 20/119 [00:02<00:05, 16.62it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:57,  1.49s/it][A[A

  2%|▏         | 2/120 [00:01<01:22,  1.42it/s][A[A

  2%|▎         | 3/120 [00:01<00:52,  2.22it/s][A[A

  4%|▍         | 5/120 [00:02<00:28,  4.01it/s][A[A

  6%|▌         | 7/120 [00:02<00:19,  5.85it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  7.96it/s][A[A

 10%|█         | 12/120 [00:02<00:10, 10.59it/s][A[A

 12%|█▏        | 14/120 [00:02<00:09, 11.25it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 11.91it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 13.48it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 16.24it/s][A[A

 19%|█▉        | 23/120 [00:03<00:06, 15.40it/s][A[A

 21%|██        | 25/120 [00:03<00:06, 14.69it/s][A[A

 22%|██▎       | 27/120 [00:03<00:06, 15.20it/s][A[A

 24%|██▍       | 29/120 [00:03<00:06, 14.19it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 17.13it/s][A[A

 28%|██▊       | 34/120 [00:03<00:05, 16.03it/s][A[A

 30%|███       | 36/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:46,  1.04it/s][A[A

  1%|          | 2/360 [00:01<02:58,  2.01it/s][A[A

  1%|          | 4/360 [00:01<01:25,  4.14it/s][A[A

  2%|▏         | 6/360 [00:01<00:53,  6.57it/s][A[A

  2%|▏         | 8/360 [00:01<00:43,  8.12it/s][A[A

  3%|▎         | 11/360 [00:01<00:29, 11.79it/s][A[A

  4%|▎         | 13/360 [00:01<00:26, 12.88it/s][A[A

  4%|▍         | 15/360 [00:01<00:23, 14.38it/s][A[A

  5%|▍         | 17/360 [00:02<00:22, 15.09it/s][A[A

  5%|▌         | 19/360 [00:02<00:21, 16.13it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 16.95it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 17.74it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 16.91it/s][A[A

  8%|▊         | 29/360 [00:02<00:16, 19.47it/s][A[A

  9%|▉         | 32/360 [00:02<00:18, 18.22it/s][A[A

 10%|▉         | 35/360 [00:02<00:16, 19.49it/s][A[A

 11%|█         | 38/360 [00:03<00:15, 20.17it/s][A[A

 11%|█▏        | 41/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:19,  1.48it/s][A[A

  2%|▏         | 2/119 [00:01<01:00,  1.94it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.05it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.12it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.20it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.65it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.82it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.53it/s][A[A

 13%|█▎        | 15/119 [00:01<00:07, 14.67it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.29it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 15.68it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:55,  1.48s/it][A[A

  2%|▏         | 2/120 [00:01<01:23,  1.42it/s][A[A

  2%|▎         | 3/120 [00:01<00:51,  2.25it/s][A[A

  3%|▎         | 4/120 [00:01<00:36,  3.21it/s][A[A

  4%|▍         | 5/120 [00:02<00:27,  4.17it/s][A[A

  6%|▌         | 7/120 [00:02<00:17,  6.36it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.41it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.65it/s][A[A

 11%|█         | 13/120 [00:02<00:08, 11.95it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 13.04it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.62it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.80it/s][A[A

 18%|█▊        | 21/120 [00:03<00:06, 15.32it/s][A[A

 19%|█▉        | 23/120 [00:03<00:06, 15.68it/s][A[A

 21%|██        | 25/120 [00:03<00:05, 15.98it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 16.61it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 17.31it/s][A[A

 26%|██▌       | 31/120 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:42,  1.05it/s][A[A

  1%|          | 2/360 [00:01<02:57,  2.02it/s][A[A

  1%|          | 3/360 [00:01<01:55,  3.10it/s][A[A

  1%|▏         | 5/360 [00:01<01:05,  5.43it/s][A[A

  2%|▏         | 7/360 [00:01<00:44,  7.88it/s][A[A

  2%|▎         | 9/360 [00:01<00:35,  9.91it/s][A[A

  3%|▎         | 11/360 [00:01<00:30, 11.31it/s][A[A

  4%|▍         | 14/360 [00:01<00:23, 14.81it/s][A[A

  4%|▍         | 16/360 [00:02<00:22, 15.60it/s][A[A

  5%|▌         | 18/360 [00:02<00:21, 16.00it/s][A[A

  6%|▌         | 20/360 [00:02<00:21, 16.04it/s][A[A

  6%|▋         | 23/360 [00:02<00:18, 17.80it/s][A[A

  7%|▋         | 25/360 [00:02<00:18, 18.00it/s][A[A

  8%|▊         | 28/360 [00:02<00:16, 19.74it/s][A[A

  9%|▊         | 31/360 [00:02<00:16, 19.46it/s][A[A

  9%|▉         | 33/360 [00:02<00:16, 19.44it/s][A[A

 10%|▉         | 35/360 [00:03<00:18, 18.03it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:32,  1.28it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.33it/s][A[A

  3%|▎         | 3/119 [00:01<00:32,  3.54it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.48it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.26it/s][A[A

  7%|▋         | 8/119 [00:01<00:11,  9.34it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.48it/s][A[A

 10%|█         | 12/119 [00:01<00:08, 12.95it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.19it/s][A[A

 14%|█▍        | 17/119 [00:01<00:06, 16.84it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 16.55it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:50,  1.43s/it][A[A

  2%|▏         | 2/120 [00:01<01:17,  1.51it/s][A[A

  2%|▎         | 3/120 [00:01<00:48,  2.39it/s][A[A

  4%|▍         | 5/120 [00:01<00:25,  4.59it/s][A[A

  5%|▌         | 6/120 [00:01<00:20,  5.43it/s][A[A

  7%|▋         | 8/120 [00:02<00:13,  8.06it/s][A[A

  9%|▉         | 11/120 [00:02<00:09, 11.23it/s][A[A

 11%|█         | 13/120 [00:02<00:08, 12.16it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.24it/s][A[A

 15%|█▌        | 18/120 [00:02<00:06, 16.56it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 17.63it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 17.40it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 17.62it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 17.54it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 17.04it/s][A[A

 27%|██▋       | 32/120 [00:03<00:04, 19.43it/s][A[A

 29%|██▉       | 35/120 [00:03<00:04, 20.43it/s][A[A

 32%|███▏      | 38/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:00<05:48,  1.03it/s][A[A

  1%|          | 2/360 [00:01<02:48,  2.13it/s][A[A

  1%|          | 3/360 [00:01<01:51,  3.19it/s][A[A

  1%|▏         | 5/360 [00:01<01:02,  5.64it/s][A[A

  2%|▏         | 7/360 [00:01<00:46,  7.63it/s][A[A

  3%|▎         | 10/360 [00:01<00:31, 11.18it/s][A[A

  3%|▎         | 12/360 [00:01<00:29, 11.81it/s][A[A

  4%|▍         | 14/360 [00:01<00:26, 13.13it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 14.19it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.37it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.33it/s][A[A

  6%|▋         | 23/360 [00:02<00:21, 15.44it/s][A[A

  7%|▋         | 25/360 [00:02<00:21, 15.42it/s][A[A

  8%|▊         | 28/360 [00:02<00:18, 17.99it/s][A[A

  8%|▊         | 30/360 [00:02<00:21, 15.39it/s][A[A

  9%|▉         | 33/360 [00:03<00:19, 16.36it/s][A[A

 10%|▉         | 35/360 [00:03<00:19, 16.98it/s][A[A

 11%|█         | 38/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:36,  1.22it/s][A[A

  2%|▏         | 2/119 [00:01<00:56,  2.08it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.20it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.29it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.15it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.36it/s][A[A

 10%|█         | 12/119 [00:01<00:08, 12.21it/s][A[A

 13%|█▎        | 15/119 [00:01<00:07, 14.86it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.53it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 14.33it/s][A[A

 18%|█▊        | 21/119 [00:02<00:07, 13.66it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:49,  1.42s/it][A[A

  2%|▏         | 2/120 [00:01<01:18,  1.51it/s][A[A

  2%|▎         | 3/120 [00:01<00:51,  2.27it/s][A[A

  4%|▍         | 5/120 [00:01<00:25,  4.50it/s][A[A

  6%|▌         | 7/120 [00:01<00:17,  6.53it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.09it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.64it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.52it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.38it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.41it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 14.06it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 17.48it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 17.92it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 16.33it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 16.98it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.88it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 17.10it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<05:58,  1.00it/s][A[A

  1%|          | 2/360 [00:01<02:58,  2.00it/s][A[A

  1%|          | 3/360 [00:01<02:06,  2.82it/s][A[A

  1%|          | 4/360 [00:01<01:30,  3.93it/s][A[A

  2%|▏         | 6/360 [00:01<00:56,  6.26it/s][A[A

  2%|▏         | 8/360 [00:01<00:43,  8.15it/s][A[A

  3%|▎         | 10/360 [00:01<00:34, 10.17it/s][A[A

  3%|▎         | 12/360 [00:01<00:30, 11.51it/s][A[A

  4%|▍         | 15/360 [00:02<00:25, 13.51it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.54it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 14.92it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.09it/s][A[A

  7%|▋         | 24/360 [00:02<00:20, 16.09it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 16.79it/s][A[A

  8%|▊         | 29/360 [00:02<00:18, 17.61it/s][A[A

  9%|▊         | 31/360 [00:03<00:19, 17.08it/s][A[A

  9%|▉         | 33/360 [00:03<00:19, 16.45it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:00,  1.02s/it][A[A

  2%|▏         | 2/119 [00:01<01:11,  1.64it/s][A[A

  3%|▎         | 3/119 [00:01<00:45,  2.57it/s][A[A

  3%|▎         | 4/119 [00:01<00:32,  3.52it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.41it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.08it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.51it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.91it/s][A[A

 10%|█         | 12/119 [00:02<00:09, 10.92it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.51it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.96it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:44,  1.14it/s][A[A

  2%|▏         | 2/120 [00:01<00:56,  2.10it/s][A[A

  2%|▎         | 3/120 [00:01<00:39,  2.96it/s][A[A

  4%|▍         | 5/120 [00:01<00:20,  5.57it/s][A[A

  7%|▋         | 8/120 [00:01<00:11,  9.54it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.80it/s][A[A

 10%|█         | 12/120 [00:01<00:08, 12.19it/s][A[A

 12%|█▏        | 14/120 [00:01<00:07, 13.46it/s][A[A

 13%|█▎        | 16/120 [00:01<00:07, 14.47it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 16.17it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 18.85it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 18.78it/s][A[A

 23%|██▎       | 28/120 [00:02<00:04, 19.94it/s][A[A

 26%|██▌       | 31/120 [00:02<00:04, 20.65it/s][A[A

 28%|██▊       | 34/120 [00:02<00:04, 20.96it/s][A[A

 31%|███       | 37/120 [00:02<00:04, 20.27it/s][A[A

 33%|███▎      | 40/120 [00:03<00:04, 18.69it/s][A[A

 36%|███▌      | 43/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:37,  1.44s/it][A[A

  1%|          | 2/360 [00:01<04:10,  1.43it/s][A[A

  1%|          | 3/360 [00:01<02:53,  2.06it/s][A[A

  1%|▏         | 5/360 [00:01<01:27,  4.05it/s][A[A

  2%|▏         | 7/360 [00:02<01:03,  5.57it/s][A[A

  3%|▎         | 10/360 [00:02<00:41,  8.52it/s][A[A

  3%|▎         | 12/360 [00:02<00:36,  9.49it/s][A[A

  4%|▍         | 14/360 [00:02<00:31, 10.94it/s][A[A

  4%|▍         | 16/360 [00:02<00:28, 12.11it/s][A[A

  5%|▌         | 18/360 [00:02<00:27, 12.65it/s][A[A

  6%|▌         | 20/360 [00:03<00:25, 13.40it/s][A[A

  6%|▌         | 22/360 [00:03<00:23, 14.66it/s][A[A

  7%|▋         | 24/360 [00:03<00:24, 13.64it/s][A[A

  7%|▋         | 26/360 [00:03<00:23, 14.00it/s][A[A

  8%|▊         | 29/360 [00:03<00:21, 15.59it/s][A[A

  9%|▊         | 31/360 [00:03<00:21, 15.55it/s][A[A

  9%|▉         | 33/360 [00:03<00:20, 16.23it/s][A[A

 10%|▉         | 35/360 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:20,  1.19s/it][A[A

  2%|▏         | 2/119 [00:01<01:21,  1.44it/s][A[A

  3%|▎         | 3/119 [00:01<00:50,  2.30it/s][A[A

  3%|▎         | 4/119 [00:01<00:36,  3.17it/s][A[A

  4%|▍         | 5/119 [00:01<00:27,  4.14it/s][A[A

  5%|▌         | 6/119 [00:02<00:22,  5.07it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.89it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.41it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.04it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.54it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.64it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:40,  1.35s/it][A[A

  2%|▏         | 2/120 [00:01<01:15,  1.55it/s][A[A

  2%|▎         | 3/120 [00:01<00:50,  2.32it/s][A[A

  3%|▎         | 4/120 [00:01<00:34,  3.32it/s][A[A

  6%|▌         | 7/120 [00:01<00:15,  7.28it/s][A[A

  8%|▊         | 9/120 [00:02<00:12,  9.11it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.64it/s][A[A

 12%|█▏        | 14/120 [00:02<00:07, 13.36it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 14.59it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 15.86it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 16.50it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 16.08it/s][A[A

 21%|██        | 25/120 [00:02<00:06, 15.83it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 15.56it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 15.82it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.44it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 17.17it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:11,  1.37s/it][A[A

  1%|          | 2/360 [00:01<04:02,  1.48it/s][A[A

  1%|          | 3/360 [00:01<02:55,  2.03it/s][A[A

  1%|          | 4/360 [00:01<02:01,  2.93it/s][A[A

  2%|▏         | 6/360 [00:02<01:10,  4.99it/s][A[A

  2%|▏         | 8/360 [00:02<00:51,  6.87it/s][A[A

  3%|▎         | 10/360 [00:02<00:40,  8.61it/s][A[A

  3%|▎         | 12/360 [00:02<00:33, 10.49it/s][A[A

  4%|▍         | 14/360 [00:02<00:27, 12.47it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.85it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.31it/s][A[A

  6%|▌         | 21/360 [00:03<00:22, 14.93it/s][A[A

  6%|▋         | 23/360 [00:03<00:20, 16.06it/s][A[A

  7%|▋         | 25/360 [00:03<00:21, 15.64it/s][A[A

  8%|▊         | 27/360 [00:03<00:20, 15.98it/s][A[A

  8%|▊         | 29/360 [00:03<00:20, 15.97it/s][A[A

  9%|▊         | 31/360 [00:03<00:21, 15.28it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:05<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:24,  1.40it/s][A[A

  2%|▏         | 2/119 [00:01<00:58,  1.99it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.48it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.35it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.08it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.38it/s][A[A

  9%|▉         | 11/119 [00:01<00:08, 12.69it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.97it/s][A[A

 13%|█▎        | 16/119 [00:01<00:06, 15.15it/s][A[A

 16%|█▌        | 19/119 [00:02<00:05, 17.21it/s][A[A

 18%|█▊        | 22/119 [00:02<00:05, 17.85it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:53,  1.05it/s][A[A

  2%|▏         | 2/120 [00:01<00:57,  2.06it/s][A[A

  2%|▎         | 3/120 [00:01<00:36,  3.22it/s][A[A

  4%|▍         | 5/120 [00:01<00:22,  5.20it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.35it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 10.92it/s][A[A

 12%|█▏        | 14/120 [00:01<00:08, 13.08it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 13.66it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 15.86it/s][A[A

 18%|█▊        | 21/120 [00:02<00:05, 16.54it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.81it/s][A[A

 21%|██        | 25/120 [00:02<00:06, 15.68it/s][A[A

 22%|██▎       | 27/120 [00:02<00:06, 15.46it/s][A[A

 24%|██▍       | 29/120 [00:02<00:05, 15.64it/s][A[A

 26%|██▌       | 31/120 [00:02<00:05, 16.51it/s][A[A

 28%|██▊       | 34/120 [00:03<00:05, 17.14it/s][A[A

 30%|███       | 36/120 [00:03<00:05, 16.31it/s][A[A

 32%|███▏      | 38/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:44,  1.29s/it][A[A

  1%|          | 2/360 [00:01<03:43,  1.60it/s][A[A

  1%|          | 3/360 [00:01<02:37,  2.27it/s][A[A

  1%|          | 4/360 [00:01<01:52,  3.17it/s][A[A

  1%|▏         | 5/360 [00:01<01:24,  4.19it/s][A[A

  2%|▏         | 7/360 [00:02<00:54,  6.52it/s][A[A

  2%|▎         | 9/360 [00:02<00:39,  8.92it/s][A[A

  3%|▎         | 11/360 [00:02<00:33, 10.48it/s][A[A

  4%|▎         | 13/360 [00:02<00:29, 11.72it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 12.92it/s][A[A

  5%|▍         | 17/360 [00:02<00:24, 13.93it/s][A[A

  5%|▌         | 19/360 [00:02<00:24, 14.16it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.10it/s][A[A

  6%|▋         | 23/360 [00:03<00:21, 15.94it/s][A[A

  7%|▋         | 25/360 [00:03<00:20, 16.43it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 17.07it/s][A[A

  8%|▊         | 30/360 [00:03<00:19, 16.97it/s][A[A

  9%|▉         | 33/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:27,  1.36it/s][A[A

  2%|▏         | 2/119 [00:00<00:52,  2.22it/s][A[A

  3%|▎         | 3/119 [00:01<00:34,  3.40it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.20it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.54it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.11it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.38it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.16it/s][A[A

 13%|█▎        | 15/119 [00:01<00:07, 14.47it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.38it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 16.03it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:07,  1.58s/it][A[A

  2%|▏         | 2/120 [00:01<01:28,  1.33it/s][A[A

  2%|▎         | 3/120 [00:01<00:55,  2.10it/s][A[A

  4%|▍         | 5/120 [00:02<00:29,  3.93it/s][A[A

  6%|▌         | 7/120 [00:02<00:19,  5.86it/s][A[A

  8%|▊         | 9/120 [00:02<00:15,  7.27it/s][A[A

  9%|▉         | 11/120 [00:02<00:12,  8.91it/s][A[A

 12%|█▏        | 14/120 [00:02<00:09, 11.73it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 12.81it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 12.94it/s][A[A

 17%|█▋        | 20/120 [00:03<00:07, 13.84it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 14.87it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.23it/s][A[A

 22%|██▏       | 26/120 [00:03<00:06, 15.33it/s][A[A

 23%|██▎       | 28/120 [00:03<00:06, 14.72it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.65it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 16.69it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:52,  1.15s/it][A[A

  1%|          | 2/360 [00:01<03:23,  1.76it/s][A[A

  1%|          | 3/360 [00:01<02:17,  2.60it/s][A[A

  1%|▏         | 5/360 [00:01<01:09,  5.08it/s][A[A

  2%|▏         | 7/360 [00:01<00:47,  7.39it/s][A[A

  2%|▎         | 9/360 [00:01<00:38,  9.20it/s][A[A

  3%|▎         | 12/360 [00:01<00:26, 13.28it/s][A[A

  4%|▍         | 14/360 [00:02<00:26, 13.04it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 13.78it/s][A[A

  5%|▌         | 19/360 [00:02<00:20, 16.50it/s][A[A

  6%|▌         | 21/360 [00:02<00:19, 17.00it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 17.99it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 17.49it/s][A[A

  8%|▊         | 28/360 [00:02<00:18, 17.67it/s][A[A

  8%|▊         | 30/360 [00:02<00:19, 17.34it/s][A[A

  9%|▉         | 33/360 [00:03<00:16, 19.25it/s][A[A

 10%|▉         | 35/360 [00:03<00:18, 17.32it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:43,  1.14it/s][A[A

  2%|▏         | 2/119 [00:01<00:50,  2.32it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.43it/s][A[A

  5%|▌         | 6/119 [00:01<00:17,  6.28it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.86it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.06it/s][A[A

 11%|█         | 13/119 [00:01<00:08, 13.19it/s][A[A

 13%|█▎        | 15/119 [00:01<00:07, 14.42it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 14.94it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 15.45it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 15.77it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:47,  1.41s/it][A[A

  2%|▏         | 2/120 [00:01<01:20,  1.47it/s][A[A

  2%|▎         | 3/120 [00:01<00:49,  2.37it/s][A[A

  4%|▍         | 5/120 [00:01<00:24,  4.64it/s][A[A

  6%|▌         | 7/120 [00:01<00:16,  6.69it/s][A[A

  8%|▊         | 9/120 [00:02<00:12,  8.73it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.52it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.64it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.54it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.11it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.60it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.44it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.42it/s][A[A

 21%|██        | 25/120 [00:03<00:05, 16.21it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 15.67it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 15.24it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 15.79it/s][A[A

 28%|██▊       | 33/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<06:57,  1.16s/it][A[A

  1%|          | 2/360 [00:01<03:12,  1.86it/s][A[A

  1%|          | 3/360 [00:01<02:09,  2.76it/s][A[A

  1%|          | 4/360 [00:01<01:34,  3.77it/s][A[A

  2%|▏         | 6/360 [00:01<00:56,  6.23it/s][A[A

  2%|▏         | 8/360 [00:01<00:42,  8.31it/s][A[A

  3%|▎         | 11/360 [00:01<00:28, 12.26it/s][A[A

  4%|▎         | 13/360 [00:02<00:26, 12.99it/s][A[A

  4%|▍         | 15/360 [00:02<00:24, 13.80it/s][A[A

  5%|▌         | 18/360 [00:02<00:21, 16.00it/s][A[A

  6%|▌         | 20/360 [00:02<00:20, 16.36it/s][A[A

  6%|▌         | 22/360 [00:02<00:20, 16.11it/s][A[A

  7%|▋         | 25/360 [00:02<00:19, 17.13it/s][A[A

  8%|▊         | 27/360 [00:02<00:20, 16.63it/s][A[A

  8%|▊         | 29/360 [00:03<00:21, 15.70it/s][A[A

  9%|▊         | 31/360 [00:03<00:21, 15.35it/s][A[A

  9%|▉         | 33/360 [00:03<00:20, 15.77it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:48,  1.09it/s][A[A

  2%|▏         | 2/119 [00:01<01:16,  1.52it/s][A[A

  3%|▎         | 3/119 [00:01<00:48,  2.41it/s][A[A

  4%|▍         | 5/119 [00:01<00:25,  4.45it/s][A[A

  5%|▌         | 6/119 [00:01<00:21,  5.18it/s][A[A

  7%|▋         | 8/119 [00:01<00:16,  6.63it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.50it/s][A[A

 10%|█         | 12/119 [00:02<00:10, 10.47it/s][A[A

 12%|█▏        | 14/119 [00:02<00:08, 12.42it/s][A[A

 13%|█▎        | 16/119 [00:02<00:08, 12.87it/s][A[A

 15%|█▌        | 18/119 [00:02<00:07, 12.77it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:42,  1.36s/it][A[A

  2%|▏         | 2/120 [00:01<01:13,  1.60it/s][A[A

  3%|▎         | 4/120 [00:01<00:32,  3.53it/s][A[A

  5%|▌         | 6/120 [00:01<00:20,  5.67it/s][A[A

  7%|▋         | 8/120 [00:01<00:15,  7.43it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.56it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.89it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.88it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.73it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.79it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.40it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.36it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 16.23it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 16.97it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 17.20it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.99it/s][A[A

 28%|██▊       | 34/120 [00:03<00:04, 19.53it/s][A[A

 30%|███       | 36/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:38,  1.44s/it][A[A

  1%|          | 2/360 [00:01<04:04,  1.46it/s][A[A

  1%|          | 3/360 [00:01<03:01,  1.96it/s][A[A

  1%|          | 4/360 [00:02<02:06,  2.81it/s][A[A

  2%|▏         | 6/360 [00:02<01:09,  5.06it/s][A[A

  2%|▏         | 8/360 [00:02<00:48,  7.20it/s][A[A

  3%|▎         | 10/360 [00:02<00:37,  9.23it/s][A[A

  3%|▎         | 12/360 [00:02<00:35,  9.88it/s][A[A

  4%|▍         | 14/360 [00:02<00:29, 11.55it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.95it/s][A[A

  5%|▌         | 18/360 [00:02<00:26, 12.97it/s][A[A

  6%|▌         | 20/360 [00:03<00:25, 13.38it/s][A[A

  6%|▌         | 22/360 [00:03<00:22, 14.77it/s][A[A

  7%|▋         | 24/360 [00:03<00:22, 15.07it/s][A[A

  7%|▋         | 26/360 [00:03<00:21, 15.72it/s][A[A

  8%|▊         | 28/360 [00:03<00:20, 15.84it/s][A[A

  8%|▊         | 30/360 [00:03<00:21, 15.65it/s][A[A

  9%|▉         | 33/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:22,  1.21s/it][A[A

  2%|▏         | 2/119 [00:01<01:09,  1.68it/s][A[A

  3%|▎         | 3/119 [00:01<00:43,  2.64it/s][A[A

  3%|▎         | 4/119 [00:01<00:31,  3.65it/s][A[A

  4%|▍         | 5/119 [00:01<00:24,  4.58it/s][A[A

  5%|▌         | 6/119 [00:01<00:22,  5.06it/s][A[A

  7%|▋         | 8/119 [00:02<00:14,  7.59it/s][A[A

  8%|▊         | 10/119 [00:02<00:11,  9.28it/s][A[A

 12%|█▏        | 14/119 [00:02<00:06, 15.46it/s][A[A

 13%|█▎        | 16/119 [00:02<00:07, 14.58it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.54it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:00<01:56,  1.02it/s][A[A

  2%|▏         | 2/120 [00:01<00:54,  2.16it/s][A[A

  2%|▎         | 3/120 [00:01<00:41,  2.80it/s][A[A

  5%|▌         | 6/120 [00:01<00:17,  6.54it/s][A[A

  7%|▋         | 8/120 [00:01<00:13,  8.58it/s][A[A

  8%|▊         | 10/120 [00:01<00:10, 10.45it/s][A[A

 10%|█         | 12/120 [00:01<00:09, 11.46it/s][A[A

 12%|█▎        | 15/120 [00:01<00:07, 14.05it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.40it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.86it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.06it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 16.92it/s][A[A

 22%|██▏       | 26/120 [00:02<00:05, 17.38it/s][A[A

 23%|██▎       | 28/120 [00:02<00:05, 17.72it/s][A[A

 25%|██▌       | 30/120 [00:02<00:05, 17.72it/s][A[A

 28%|██▊       | 33/120 [00:02<00:04, 19.23it/s][A[A

 29%|██▉       | 35/120 [00:03<00:05, 16.58it/s][A[A

 31%|███       | 37/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:51,  1.48s/it][A[A

  1%|          | 2/360 [00:01<04:19,  1.38it/s][A[A

  1%|          | 3/360 [00:01<02:53,  2.06it/s][A[A

  1%|          | 4/360 [00:02<02:04,  2.85it/s][A[A

  1%|▏         | 5/360 [00:02<01:32,  3.83it/s][A[A

  2%|▏         | 7/360 [00:02<01:00,  5.83it/s][A[A

  2%|▎         | 9/360 [00:02<00:43,  8.11it/s][A[A

  3%|▎         | 11/360 [00:02<00:34, 10.03it/s][A[A

  4%|▍         | 14/360 [00:02<00:26, 13.29it/s][A[A

  4%|▍         | 16/360 [00:02<00:23, 14.41it/s][A[A

  5%|▌         | 18/360 [00:02<00:22, 15.10it/s][A[A

  6%|▌         | 21/360 [00:03<00:19, 17.16it/s][A[A

  7%|▋         | 24/360 [00:03<00:17, 18.77it/s][A[A

  7%|▋         | 26/360 [00:03<00:17, 18.91it/s][A[A

  8%|▊         | 28/360 [00:03<00:17, 18.63it/s][A[A

  9%|▊         | 31/360 [00:03<00:16, 20.33it/s][A[A

  9%|▉         | 34/360 [00:03<00:18, 17.16it/s][A[A

 10%|█         | 36/360 [00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:39,  1.19it/s][A[A

  2%|▏         | 2/119 [00:01<00:57,  2.04it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.13it/s][A[A

  3%|▎         | 4/119 [00:01<00:29,  3.90it/s][A[A

  4%|▍         | 5/119 [00:01<00:23,  4.79it/s][A[A

  6%|▌         | 7/119 [00:01<00:15,  7.34it/s][A[A

  8%|▊         | 10/119 [00:01<00:09, 11.09it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 14.45it/s][A[A

 13%|█▎        | 15/119 [00:02<00:06, 15.08it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.96it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 16.33it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:47,  1.41s/it][A[A

  2%|▏         | 2/120 [00:01<01:23,  1.42it/s][A[A

  2%|▎         | 3/120 [00:01<00:54,  2.15it/s][A[A

  4%|▍         | 5/120 [00:01<00:27,  4.14it/s][A[A

  5%|▌         | 6/120 [00:02<00:23,  4.94it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.42it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.06it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.67it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.64it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.49it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 13.80it/s][A[A

 18%|█▊        | 21/120 [00:02<00:07, 14.09it/s][A[A

 20%|██        | 24/120 [00:03<00:05, 16.17it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 16.16it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 15.44it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.89it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 15.66it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:44,  1.29s/it][A[A

  1%|          | 3/360 [00:01<02:28,  2.40it/s][A[A

  1%|          | 4/360 [00:01<01:57,  3.02it/s][A[A

  1%|▏         | 5/360 [00:01<01:30,  3.91it/s][A[A

  2%|▏         | 7/360 [00:01<00:57,  6.10it/s][A[A

  2%|▎         | 9/360 [00:02<00:42,  8.26it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.45it/s][A[A

  4%|▍         | 15/360 [00:02<00:24, 14.36it/s][A[A

  5%|▍         | 17/360 [00:02<00:23, 14.75it/s][A[A

  6%|▌         | 20/360 [00:02<00:20, 16.41it/s][A[A

  6%|▌         | 22/360 [00:02<00:20, 16.65it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 16.92it/s][A[A

  7%|▋         | 26/360 [00:02<00:20, 16.28it/s][A[A

  8%|▊         | 28/360 [00:03<00:20, 16.17it/s][A[A

  8%|▊         | 30/360 [00:03<00:20, 16.35it/s][A[A

  9%|▉         | 32/360 [00:03<00:19, 16.88it/s][A[A

 10%|▉         | 35/360 [00:03<00:18, 17.60it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:43,  1.14it/s][A[A

  2%|▏         | 2/119 [00:00<00:50,  2.34it/s][A[A

  3%|▎         | 3/119 [00:01<00:33,  3.48it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.13it/s][A[A

  4%|▍         | 5/119 [00:01<00:24,  4.70it/s][A[A

  6%|▌         | 7/119 [00:01<00:15,  7.17it/s][A[A

  8%|▊         | 9/119 [00:01<00:11,  9.35it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 11.16it/s][A[A

 11%|█         | 13/119 [00:01<00:08, 13.06it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 15.39it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.93it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:14,  1.63s/it][A[A

  2%|▏         | 2/120 [00:01<01:29,  1.32it/s][A[A

  2%|▎         | 3/120 [00:01<00:54,  2.15it/s][A[A

  4%|▍         | 5/120 [00:02<00:28,  4.06it/s][A[A

  6%|▌         | 7/120 [00:02<00:19,  5.76it/s][A[A

  8%|▊         | 9/120 [00:02<00:14,  7.52it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.42it/s][A[A

 12%|█▏        | 14/120 [00:02<00:08, 11.94it/s][A[A

 13%|█▎        | 16/120 [00:02<00:07, 13.07it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 14.32it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 14.72it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 14.88it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.73it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 16.47it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 17.31it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 16.90it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 17.26it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:05,  1.35s/it][A[A

  1%|          | 2/360 [00:01<03:57,  1.51it/s][A[A

  1%|          | 3/360 [00:01<02:52,  2.07it/s][A[A

  1%|▏         | 5/360 [00:01<01:28,  4.01it/s][A[A

  2%|▏         | 7/360 [00:02<00:56,  6.21it/s][A[A

  2%|▎         | 9/360 [00:02<00:42,  8.18it/s][A[A

  3%|▎         | 11/360 [00:02<00:35,  9.88it/s][A[A

  4%|▎         | 13/360 [00:02<00:30, 11.39it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 13.08it/s][A[A

  5%|▌         | 18/360 [00:02<00:20, 16.97it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.54it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 17.40it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 17.36it/s][A[A

  8%|▊         | 29/360 [00:03<00:18, 18.07it/s][A[A

  9%|▉         | 32/360 [00:03<00:17, 19.22it/s][A[A

  9%|▉         | 34/360 [00:03<00:19, 16.83it/s][A[A

 10%|█         | 36/360 [00:03<00:19, 16.33it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:44,  1.13it/s][A[A

  2%|▏         | 2/119 [00:01<00:59,  1.96it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.18it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  5.99it/s][A[A

  8%|▊         | 9/119 [00:01<00:11,  9.42it/s][A[A

  9%|▉         | 11/119 [00:01<00:10, 10.80it/s][A[A

 11%|█         | 13/119 [00:01<00:08, 12.52it/s][A[A

 13%|█▎        | 16/119 [00:01<00:07, 14.57it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.64it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.50it/s][A[A

 18%|█▊        | 22/119 [00:02<00:06, 15.08it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:45,  1.39s/it][A[A

  2%|▏         | 2/120 [00:01<01:18,  1.50it/s][A[A

  2%|▎         | 3/120 [00:01<00:48,  2.42it/s][A[A

  4%|▍         | 5/120 [00:01<00:26,  4.32it/s][A[A

  6%|▌         | 7/120 [00:01<00:17,  6.54it/s][A[A

  8%|▊         | 10/120 [00:02<00:10, 10.03it/s][A[A

 10%|█         | 12/120 [00:02<00:09, 11.28it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.84it/s][A[A

 14%|█▍        | 17/120 [00:02<00:06, 14.93it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 16.50it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 16.63it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 16.48it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 15.69it/s][A[A

 23%|██▎       | 28/120 [00:03<00:06, 15.27it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.38it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 16.75it/s][A[A

 29%|██▉       | 35/120 [00:03<00:05, 16.64it/s][A[A

 31%|███       | 37/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:54,  1.32s/it][A[A

  1%|          | 2/360 [00:01<03:54,  1.53it/s][A[A

  1%|          | 3/360 [00:01<02:52,  2.07it/s][A[A

  1%|          | 4/360 [00:01<01:59,  2.97it/s][A[A

  1%|▏         | 5/360 [00:02<01:29,  3.96it/s][A[A

  2%|▏         | 7/360 [00:02<00:57,  6.09it/s][A[A

  3%|▎         | 10/360 [00:02<00:36,  9.53it/s][A[A

  3%|▎         | 12/360 [00:02<00:33, 10.49it/s][A[A

  4%|▍         | 14/360 [00:02<00:28, 12.23it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.88it/s][A[A

  5%|▌         | 19/360 [00:02<00:23, 14.35it/s][A[A

  6%|▌         | 21/360 [00:02<00:22, 15.34it/s][A[A

  6%|▋         | 23/360 [00:03<00:22, 14.81it/s][A[A

  7%|▋         | 25/360 [00:03<00:22, 15.04it/s][A[A

  8%|▊         | 27/360 [00:03<00:21, 15.73it/s][A[A

  8%|▊         | 30/360 [00:03<00:20, 16.36it/s][A[A

  9%|▉         | 32/360 [00:03<00:20, 16.21it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:34,  1.31s/it][A[A

  2%|▏         | 2/119 [00:01<01:17,  1.52it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.48it/s][A[A

  4%|▍         | 5/119 [00:01<00:26,  4.30it/s][A[A

  5%|▌         | 6/119 [00:01<00:23,  4.81it/s][A[A

  7%|▋         | 8/119 [00:02<00:16,  6.83it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.71it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.82it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.94it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 14.11it/s][A[A

 16%|█▌        | 19/119 [00:02<00:07, 13.86it/s][A[A

 18%|█▊        | 22/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:17,  1.16s/it][A[A

  2%|▏         | 2/120 [00:01<01:05,  1.81it/s][A[A

  2%|▎         | 3/120 [00:01<00:42,  2.77it/s][A[A

  3%|▎         | 4/120 [00:01<00:30,  3.86it/s][A[A

  5%|▌         | 6/120 [00:01<00:19,  5.95it/s][A[A

  8%|▊         | 9/120 [00:01<00:10, 10.19it/s][A[A

  9%|▉         | 11/120 [00:01<00:09, 11.16it/s][A[A

 11%|█         | 13/120 [00:02<00:08, 12.77it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.78it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.70it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.64it/s][A[A

 18%|█▊        | 22/120 [00:02<00:06, 16.27it/s][A[A

 20%|██        | 24/120 [00:02<00:05, 16.43it/s][A[A

 22%|██▏       | 26/120 [00:02<00:05, 16.67it/s][A[A

 23%|██▎       | 28/120 [00:02<00:05, 17.43it/s][A[A

 25%|██▌       | 30/120 [00:03<00:04, 18.06it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 17.48it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:50,  1.31s/it][A[A

  1%|          | 2/360 [00:01<03:48,  1.57it/s][A[A

  1%|          | 3/360 [00:01<02:42,  2.19it/s][A[A

  1%|          | 4/360 [00:01<01:52,  3.15it/s][A[A

  2%|▏         | 6/360 [00:01<01:04,  5.45it/s][A[A

  2%|▏         | 8/360 [00:02<00:46,  7.54it/s][A[A

  3%|▎         | 11/360 [00:02<00:32, 10.84it/s][A[A

  4%|▍         | 14/360 [00:02<00:24, 14.06it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 14.18it/s][A[A

  5%|▌         | 18/360 [00:02<00:22, 15.31it/s][A[A

  6%|▌         | 20/360 [00:02<00:21, 16.08it/s][A[A

  6%|▋         | 23/360 [00:02<00:19, 17.68it/s][A[A

  7%|▋         | 26/360 [00:03<00:19, 17.13it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 16.90it/s][A[A

  8%|▊         | 30/360 [00:03<00:19, 16.73it/s][A[A

  9%|▉         | 33/360 [00:03<00:18, 17.92it/s][A[A

 10%|▉         | 35/360 [00:03<00:18, 17.78it/s][A[A

 10%|█         | 37/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:28,  1.34it/s][A[A

  2%|▏         | 2/119 [00:01<00:55,  2.12it/s][A[A

  3%|▎         | 4/119 [00:01<00:25,  4.44it/s][A[A

  4%|▍         | 5/119 [00:01<00:22,  5.09it/s][A[A

  6%|▌         | 7/119 [00:01<00:15,  7.32it/s][A[A

  8%|▊         | 9/119 [00:01<00:11,  9.43it/s][A[A

  9%|▉         | 11/119 [00:01<00:09, 10.91it/s][A[A

 12%|█▏        | 14/119 [00:01<00:07, 14.86it/s][A[A

 13%|█▎        | 16/119 [00:01<00:07, 14.05it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 14.51it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.45it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:53,  1.46s/it][A[A

  2%|▏         | 2/120 [00:01<01:21,  1.44it/s][A[A

  2%|▎         | 3/120 [00:01<00:51,  2.28it/s][A[A

  4%|▍         | 5/120 [00:01<00:27,  4.16it/s][A[A

  6%|▌         | 7/120 [00:02<00:19,  5.87it/s][A[A

  7%|▋         | 8/120 [00:02<00:17,  6.32it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.63it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.45it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 12.97it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 13.13it/s][A[A

 17%|█▋        | 20/120 [00:02<00:07, 13.19it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 14.57it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.69it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 15.90it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 15.54it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.36it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 15.16it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:21,  1.23s/it][A[A

  1%|          | 2/360 [00:01<03:43,  1.60it/s][A[A

  1%|          | 3/360 [00:01<02:49,  2.11it/s][A[A

  1%|          | 4/360 [00:01<01:57,  3.03it/s][A[A

  2%|▏         | 6/360 [00:01<01:06,  5.36it/s][A[A

  2%|▏         | 8/360 [00:02<00:47,  7.41it/s][A[A

  3%|▎         | 10/360 [00:02<00:37,  9.41it/s][A[A

  3%|▎         | 12/360 [00:02<00:31, 10.93it/s][A[A

  4%|▍         | 14/360 [00:02<00:28, 12.01it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.93it/s][A[A

  5%|▌         | 19/360 [00:02<00:21, 15.84it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 15.95it/s][A[A

  7%|▋         | 24/360 [00:03<00:19, 16.87it/s][A[A

  8%|▊         | 27/360 [00:03<00:17, 19.24it/s][A[A

  8%|▊         | 30/360 [00:03<00:17, 18.76it/s][A[A

  9%|▉         | 32/360 [00:03<00:17, 18.63it/s][A[A

  9%|▉         | 34/360 [00:03<00:17, 18.34it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:26,  1.37it/s][A[A

  2%|▏         | 2/119 [00:00<00:48,  2.40it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.17it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.91it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.76it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.56it/s][A[A

 10%|█         | 12/119 [00:01<00:09, 11.52it/s][A[A

 12%|█▏        | 14/119 [00:01<00:08, 13.08it/s][A[A

 13%|█▎        | 16/119 [00:01<00:07, 14.33it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.26it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 14.74it/s][A[A


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:06,  1.57s/it][A[A

  2%|▏         | 2/120 [00:01<01:25,  1.38it/s][A[A

  2%|▎         | 3/120 [00:01<00:54,  2.13it/s][A[A

  4%|▍         | 5/120 [00:02<00:28,  3.98it/s][A[A

  6%|▌         | 7/120 [00:02<00:19,  5.73it/s][A[A

  8%|▊         | 9/120 [00:02<00:14,  7.42it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.22it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.00it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.49it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.04it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 13.19it/s][A[A

 18%|█▊        | 21/120 [00:03<00:06, 14.33it/s][A[A

 19%|█▉        | 23/120 [00:03<00:06, 15.34it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 16.65it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 15.61it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 15.40it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 16.25it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:16,  1.38s/it][A[A

  1%|          | 2/360 [00:01<03:59,  1.50it/s][A[A

  1%|          | 3/360 [00:01<02:31,  2.35it/s][A[A

  1%|          | 4/360 [00:01<01:48,  3.29it/s][A[A

  2%|▏         | 6/360 [00:01<01:02,  5.66it/s][A[A

  2%|▎         | 9/360 [00:02<00:37,  9.24it/s][A[A

  3%|▎         | 11/360 [00:02<00:32, 10.71it/s][A[A

  4%|▎         | 13/360 [00:02<00:29, 11.61it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 12.79it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.69it/s][A[A

  5%|▌         | 19/360 [00:02<00:22, 15.03it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 16.01it/s][A[A

  6%|▋         | 23/360 [00:02<00:20, 16.73it/s][A[A

  7%|▋         | 25/360 [00:03<00:20, 16.15it/s][A[A

  8%|▊         | 27/360 [00:03<00:19, 17.12it/s][A[A

  8%|▊         | 29/360 [00:03<00:19, 16.89it/s][A[A

  9%|▊         | 31/360 [00:03<00:20, 16.38it/s][A[A

  9%|▉         | 33/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:35,  1.23it/s][A[A

  2%|▏         | 2/119 [00:01<00:59,  1.97it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.07it/s][A[A

  3%|▎         | 4/119 [00:01<00:29,  3.95it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.15it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.06it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.28it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.81it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 15.19it/s][A[A

 16%|█▌        | 19/119 [00:02<00:05, 17.00it/s][A[A

 18%|█▊        | 21/119 [00:02<00:06, 16.33it/s][A[A

 20%|██        | 24/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:47,  1.41s/it][A[A

  2%|▏         | 2/120 [00:01<01:19,  1.49it/s][A[A

  2%|▎         | 3/120 [00:01<00:51,  2.29it/s][A[A

  4%|▍         | 5/120 [00:01<00:26,  4.33it/s][A[A

  6%|▌         | 7/120 [00:02<00:18,  6.22it/s][A[A

  8%|▊         | 9/120 [00:02<00:14,  7.86it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.76it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.61it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.29it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 14.16it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.53it/s][A[A

 18%|█▊        | 22/120 [00:02<00:05, 16.34it/s][A[A

 21%|██        | 25/120 [00:03<00:05, 18.03it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 16.57it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 15.66it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.25it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 17.15it/s][A[A

 29%|██▉       | 35/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:12,  1.37s/it][A[A

  1%|          | 2/360 [00:01<03:57,  1.50it/s][A[A

  1%|          | 3/360 [00:01<02:52,  2.07it/s][A[A

  1%|          | 4/360 [00:01<02:00,  2.95it/s][A[A

  2%|▏         | 6/360 [00:02<01:11,  4.94it/s][A[A

  2%|▏         | 8/360 [00:02<00:49,  7.18it/s][A[A

  3%|▎         | 10/360 [00:02<00:36,  9.50it/s][A[A

  3%|▎         | 12/360 [00:02<00:34, 10.11it/s][A[A

  4%|▍         | 14/360 [00:02<00:30, 11.39it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 13.02it/s][A[A

  5%|▌         | 18/360 [00:02<00:23, 14.30it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.15it/s][A[A

  6%|▋         | 23/360 [00:03<00:20, 16.53it/s][A[A

  7%|▋         | 25/360 [00:03<00:20, 16.23it/s][A[A

  8%|▊         | 27/360 [00:03<00:19, 16.91it/s][A[A

  8%|▊         | 29/360 [00:03<00:19, 17.36it/s][A[A

  9%|▊         | 31/360 [00:03<00:19, 17.24it/s][A[A

  9%|▉         | 33/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:01<02:20,  1.19s/it][A[A

  2%|▏         | 2/119 [00:01<01:18,  1.49it/s][A[A

  3%|▎         | 3/119 [00:01<00:50,  2.31it/s][A[A

  3%|▎         | 4/119 [00:01<00:37,  3.09it/s][A[A

  4%|▍         | 5/119 [00:01<00:28,  3.99it/s][A[A

  5%|▌         | 6/119 [00:02<00:23,  4.85it/s][A[A

  7%|▋         | 8/119 [00:02<00:17,  6.46it/s][A[A

  8%|▊         | 10/119 [00:02<00:12,  8.40it/s][A[A

 10%|█         | 12/119 [00:02<00:10, 10.45it/s][A[A

 12%|█▏        | 14/119 [00:02<00:08, 12.33it/s][A[A

 13%|█▎        | 16/119 [00:02<00:08, 12.10it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:37,  1.32s/it][A[A

  2%|▏         | 2/120 [00:01<01:15,  1.56it/s][A[A

  2%|▎         | 3/120 [00:01<00:48,  2.43it/s][A[A

  4%|▍         | 5/120 [00:01<00:25,  4.45it/s][A[A

  6%|▌         | 7/120 [00:01<00:17,  6.46it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.15it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.67it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.60it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.87it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.89it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.74it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.66it/s][A[A

 19%|█▉        | 23/120 [00:02<00:05, 16.23it/s][A[A

 21%|██        | 25/120 [00:03<00:05, 16.04it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 16.85it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 16.06it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 16.64it/s][A[A

 28%|██▊       | 33/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:59,  1.50s/it][A[A

  1%|          | 2/360 [00:01<04:12,  1.42it/s][A[A

  1%|          | 3/360 [00:01<03:10,  1.87it/s][A[A

  1%|          | 4/360 [00:02<02:09,  2.75it/s][A[A

  2%|▏         | 6/360 [00:02<01:10,  5.05it/s][A[A

  2%|▏         | 8/360 [00:02<00:48,  7.26it/s][A[A

  3%|▎         | 11/360 [00:02<00:33, 10.35it/s][A[A

  4%|▎         | 13/360 [00:02<00:29, 11.73it/s][A[A

  4%|▍         | 15/360 [00:02<00:27, 12.36it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.72it/s][A[A

  5%|▌         | 19/360 [00:02<00:24, 13.78it/s][A[A

  6%|▌         | 22/360 [00:03<00:21, 15.92it/s][A[A

  7%|▋         | 25/360 [00:03<00:18, 17.63it/s][A[A

  8%|▊         | 27/360 [00:03<00:20, 16.08it/s][A[A

  8%|▊         | 29/360 [00:03<00:20, 16.18it/s][A[A

  9%|▊         | 31/360 [00:03<00:20, 16.34it/s][A[A

  9%|▉         | 34/360 [00:03<00:19, 16.98it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:26,  1.36it/s][A[A

  2%|▏         | 2/119 [00:00<00:53,  2.19it/s][A[A

  3%|▎         | 3/119 [00:01<00:34,  3.33it/s][A[A

  3%|▎         | 4/119 [00:01<00:26,  4.30it/s][A[A

  4%|▍         | 5/119 [00:01<00:21,  5.33it/s][A[A

  5%|▌         | 6/119 [00:01<00:18,  6.12it/s][A[A

  7%|▋         | 8/119 [00:01<00:14,  7.66it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.76it/s][A[A

 11%|█         | 13/119 [00:01<00:08, 12.98it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 13.80it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.81it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:59,  1.50s/it][A[A

  2%|▎         | 3/120 [00:01<00:55,  2.10it/s][A[A

  4%|▍         | 5/120 [00:01<00:30,  3.82it/s][A[A

  6%|▌         | 7/120 [00:01<00:20,  5.59it/s][A[A

  8%|▊         | 9/120 [00:02<00:15,  7.31it/s][A[A

  9%|▉         | 11/120 [00:02<00:12,  9.07it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 10.87it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.23it/s][A[A

 14%|█▍        | 17/120 [00:02<00:08, 12.20it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 13.30it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 14.19it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.10it/s][A[A

 21%|██        | 25/120 [00:03<00:05, 16.08it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 16.11it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 15.29it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 15.42it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 15.54it/s][A[A

 29%|██▉       | 35/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:44,  1.29s/it][A[A

  1%|          | 2/360 [00:01<03:44,  1.59it/s][A[A

  1%|          | 3/360 [00:01<02:31,  2.35it/s][A[A

  1%|          | 4/360 [00:01<01:46,  3.34it/s][A[A

  2%|▏         | 6/360 [00:01<01:03,  5.59it/s][A[A

  2%|▏         | 8/360 [00:02<00:44,  7.92it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.81it/s][A[A

  3%|▎         | 12/360 [00:02<00:32, 10.71it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 13.21it/s][A[A

  5%|▌         | 18/360 [00:02<00:22, 15.20it/s][A[A

  6%|▌         | 21/360 [00:02<00:20, 16.52it/s][A[A

  7%|▋         | 24/360 [00:02<00:17, 18.96it/s][A[A

  8%|▊         | 27/360 [00:03<00:18, 18.34it/s][A[A

  8%|▊         | 29/360 [00:03<00:18, 18.20it/s][A[A

  9%|▊         | 31/360 [00:03<00:18, 17.78it/s][A[A

  9%|▉         | 34/360 [00:03<00:18, 17.64it/s][A[A

 10%|█         | 36/360 [00:03<00:18, 17.87it/s][A[A

 11%|█         | 38/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:27,  1.35it/s][A[A

  2%|▏         | 2/119 [00:00<00:48,  2.42it/s][A[A

  3%|▎         | 3/119 [00:01<00:34,  3.39it/s][A[A

  3%|▎         | 4/119 [00:01<00:29,  3.87it/s][A[A

  4%|▍         | 5/119 [00:01<00:23,  4.77it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.74it/s][A[A

  7%|▋         | 8/119 [00:01<00:12,  8.58it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.52it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.56it/s][A[A

 13%|█▎        | 15/119 [00:02<00:07, 14.59it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.04it/s][A[A

 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:06,  1.57s/it][A[A

  2%|▏         | 2/120 [00:01<01:26,  1.37it/s][A[A

  2%|▎         | 3/120 [00:01<00:54,  2.16it/s][A[A

  4%|▍         | 5/120 [00:02<00:28,  4.05it/s][A[A

  6%|▌         | 7/120 [00:02<00:18,  6.06it/s][A[A

  8%|▊         | 9/120 [00:02<00:15,  7.31it/s][A[A

  9%|▉         | 11/120 [00:02<00:12,  8.75it/s][A[A

 11%|█         | 13/120 [00:02<00:10, 10.47it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 12.59it/s][A[A

 15%|█▌        | 18/120 [00:02<00:07, 13.31it/s][A[A

 17%|█▋        | 20/120 [00:03<00:07, 13.70it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 14.46it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 14.57it/s][A[A

 22%|██▏       | 26/120 [00:03<00:06, 15.65it/s][A[A

 23%|██▎       | 28/120 [00:03<00:06, 14.72it/s][A[A

 25%|██▌       | 30/120 [00:03<00:06, 14.59it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 15.16it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:12,  1.21s/it][A[A

  1%|          | 2/360 [00:01<03:42,  1.61it/s][A[A

  1%|          | 3/360 [00:01<02:31,  2.36it/s][A[A

  1%|          | 4/360 [00:01<01:47,  3.31it/s][A[A

  2%|▏         | 6/360 [00:01<01:00,  5.81it/s][A[A

  2%|▏         | 8/360 [00:01<00:45,  7.73it/s][A[A

  3%|▎         | 10/360 [00:02<00:35,  9.86it/s][A[A

  3%|▎         | 12/360 [00:02<00:31, 11.04it/s][A[A

  4%|▍         | 14/360 [00:02<00:27, 12.56it/s][A[A

  4%|▍         | 16/360 [00:02<00:24, 13.98it/s][A[A

  5%|▌         | 19/360 [00:02<00:20, 16.51it/s][A[A

  6%|▌         | 21/360 [00:02<00:21, 16.05it/s][A[A

  7%|▋         | 24/360 [00:02<00:18, 18.24it/s][A[A

  7%|▋         | 26/360 [00:03<00:20, 16.23it/s][A[A

  8%|▊         | 29/360 [00:03<00:18, 17.67it/s][A[A

  9%|▊         | 31/360 [00:03<00:19, 16.60it/s][A[A

  9%|▉         | 34/360 [00:03<00:19, 17.14it/s][A[A

 10%|█         | 36/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:37,  1.21it/s][A[A

  2%|▏         | 2/119 [00:01<00:53,  2.19it/s][A[A

  3%|▎         | 4/119 [00:01<00:26,  4.37it/s][A[A

  4%|▍         | 5/119 [00:01<00:22,  5.04it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.91it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.12it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.78it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.40it/s][A[A

 13%|█▎        | 15/119 [00:01<00:07, 14.34it/s][A[A

 14%|█▍        | 17/119 [00:02<00:06, 15.54it/s][A[A

 16%|█▌        | 19/119 [00:02<00:06, 15.74it/s][A[A

 18%|█▊        | 21/119

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<03:16,  1.65s/it][A[A

  2%|▏         | 2/120 [00:01<01:30,  1.31it/s][A[A

  2%|▎         | 3/120 [00:01<00:54,  2.15it/s][A[A

  4%|▍         | 5/120 [00:02<00:29,  3.90it/s][A[A

  6%|▌         | 7/120 [00:02<00:20,  5.65it/s][A[A

  7%|▋         | 8/120 [00:02<00:18,  6.09it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.77it/s][A[A

 11%|█         | 13/120 [00:02<00:10, 10.13it/s][A[A

 13%|█▎        | 16/120 [00:02<00:08, 11.98it/s][A[A

 16%|█▌        | 19/120 [00:03<00:07, 14.32it/s][A[A

 18%|█▊        | 22/120 [00:03<00:06, 16.16it/s][A[A

 20%|██        | 24/120 [00:03<00:06, 15.53it/s][A[A

 22%|██▎       | 27/120 [00:03<00:05, 17.03it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 16.33it/s][A[A

 27%|██▋       | 32/120 [00:03<00:04, 17.81it/s][A[A

 28%|██▊       | 34/120 [00:03<00:05, 16.23it/s][A[A

 30%|███       | 36/120 [00:04<00:05, 16.24it/s][A[A

 32%|███▏      | 38/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:24,  1.24s/it][A[A

  1%|          | 2/360 [00:01<03:36,  1.65it/s][A[A

  1%|          | 3/360 [00:01<02:32,  2.35it/s][A[A

  1%|          | 4/360 [00:01<01:48,  3.27it/s][A[A

  2%|▏         | 6/360 [00:01<01:04,  5.46it/s][A[A

  2%|▏         | 8/360 [00:02<00:46,  7.59it/s][A[A

  3%|▎         | 10/360 [00:02<00:36,  9.50it/s][A[A

  3%|▎         | 12/360 [00:02<00:30, 11.58it/s][A[A

  4%|▍         | 15/360 [00:02<00:24, 13.98it/s][A[A

  5%|▌         | 18/360 [00:02<00:19, 17.12it/s][A[A

  6%|▌         | 20/360 [00:02<00:20, 16.65it/s][A[A

  6%|▌         | 22/360 [00:02<00:20, 16.38it/s][A[A

  7%|▋         | 24/360 [00:02<00:19, 16.98it/s][A[A

  7%|▋         | 26/360 [00:02<00:19, 17.01it/s][A[A

  8%|▊         | 28/360 [00:03<00:19, 17.25it/s][A[A

  8%|▊         | 30/360 [00:03<00:20, 16.12it/s][A[A

  9%|▉         | 32/360 [00:03<00:20, 15.65it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:06<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:37,  1.21it/s][A[A

  2%|▏         | 2/119 [00:01<00:55,  2.10it/s][A[A

  3%|▎         | 3/119 [00:01<00:37,  3.09it/s][A[A

  4%|▍         | 5/119 [00:01<00:22,  5.11it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.74it/s][A[A

  7%|▋         | 8/119 [00:01<00:13,  8.09it/s][A[A

  8%|▊         | 10/119 [00:01<00:10, 10.50it/s][A[A

 11%|█         | 13/119 [00:01<00:07, 13.86it/s][A[A

 13%|█▎        | 16/119 [00:02<00:06, 16.77it/s][A[A

 15%|█▌        | 18/119 [00:02<00:06, 15.97it/s][A[A

 17%|█▋        | 20/119 [00:02<00:06, 16.11it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:59,  1.51s/it][A[A

  2%|▏         | 2/120 [00:01<01:23,  1.41it/s][A[A

  2%|▎         | 3/120 [00:01<00:50,  2.32it/s][A[A

  3%|▎         | 4/120 [00:01<00:35,  3.30it/s][A[A

  5%|▌         | 6/120 [00:02<00:21,  5.36it/s][A[A

  7%|▋         | 8/120 [00:02<00:15,  7.40it/s][A[A

  8%|▊         | 10/120 [00:02<00:11,  9.51it/s][A[A

 10%|█         | 12/120 [00:02<00:09, 11.05it/s][A[A

 12%|█▎        | 15/120 [00:02<00:07, 13.66it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.44it/s][A[A

 16%|█▌        | 19/120 [00:02<00:06, 14.61it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 15.44it/s][A[A

 19%|█▉        | 23/120 [00:03<00:06, 15.60it/s][A[A

 22%|██▏       | 26/120 [00:03<00:05, 17.43it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 15.93it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 16.03it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 16.39it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<07:44,  1.29s/it][A[A

  1%|          | 2/360 [00:01<03:45,  1.59it/s][A[A

  1%|          | 3/360 [00:01<02:30,  2.37it/s][A[A

  1%|          | 4/360 [00:01<01:50,  3.23it/s][A[A

  2%|▏         | 6/360 [00:01<01:04,  5.52it/s][A[A

  2%|▏         | 7/360 [00:02<00:56,  6.28it/s][A[A

  3%|▎         | 10/360 [00:02<00:32, 10.87it/s][A[A

  4%|▎         | 13/360 [00:02<00:26, 13.11it/s][A[A

  4%|▍         | 15/360 [00:02<00:26, 12.83it/s][A[A

  5%|▍         | 17/360 [00:02<00:26, 12.89it/s][A[A

  5%|▌         | 19/360 [00:02<00:24, 13.71it/s][A[A

  6%|▌         | 21/360 [00:02<00:23, 14.30it/s][A[A

  7%|▋         | 24/360 [00:03<00:22, 14.77it/s][A[A

  7%|▋         | 26/360 [00:03<00:21, 15.19it/s][A[A

  8%|▊         | 28/360 [00:03<00:20, 15.88it/s][A[A

  8%|▊         | 30/360 [00:03<00:20, 15.96it/s][A[A

  9%|▉         | 33/360 [00:03<00:18, 17.83it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:32,  1.28it/s][A[A

  2%|▏         | 2/119 [00:01<01:09,  1.69it/s][A[A

  3%|▎         | 3/119 [00:01<00:43,  2.68it/s][A[A

  3%|▎         | 4/119 [00:01<00:33,  3.46it/s][A[A

  5%|▌         | 6/119 [00:01<00:21,  5.31it/s][A[A

  6%|▌         | 7/119 [00:01<00:18,  5.96it/s][A[A

  8%|▊         | 10/119 [00:01<00:11,  9.47it/s][A[A

 11%|█         | 13/119 [00:02<00:08, 12.33it/s][A[A

 13%|█▎        | 15/119 [00:02<00:08, 12.72it/s][A[A

 14%|█▍        | 17/119 [00:02<00:07, 13.71it/s][A[A

 16%|█▌        | 19/119 [00:02<00:07, 13.22it/s][A[A



Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:41,  1.35s/it][A[A

  2%|▎         | 3/120 [00:01<00:50,  2.29it/s][A[A

  4%|▍         | 5/120 [00:01<00:27,  4.13it/s][A[A

  6%|▌         | 7/120 [00:01<00:19,  5.84it/s][A[A

  8%|▊         | 9/120 [00:02<00:15,  7.35it/s][A[A

  9%|▉         | 11/120 [00:02<00:11,  9.44it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.43it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 12.11it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 13.48it/s][A[A

 17%|█▋        | 20/120 [00:02<00:06, 15.12it/s][A[A

 18%|█▊        | 22/120 [00:02<00:06, 16.08it/s][A[A

 21%|██        | 25/120 [00:02<00:05, 18.24it/s][A[A

 22%|██▎       | 27/120 [00:02<00:05, 17.83it/s][A[A

 24%|██▍       | 29/120 [00:03<00:05, 16.72it/s][A[A

 26%|██▌       | 31/120 [00:03<00:05, 17.31it/s][A[A

 28%|██▊       | 33/120 [00:03<00:05, 17.18it/s][A[A

 30%|███       | 36/120 [00:03<00:04, 17.32it/s][A[A

 32%|███▎      | 39/120 [

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<09:02,  1.51s/it][A[A

  1%|          | 2/360 [00:01<04:23,  1.36it/s][A[A

  1%|          | 3/360 [00:01<03:05,  1.92it/s][A[A

  1%|          | 4/360 [00:02<02:06,  2.80it/s][A[A

  2%|▏         | 6/360 [00:02<01:13,  4.82it/s][A[A

  2%|▏         | 8/360 [00:02<00:50,  6.97it/s][A[A

  3%|▎         | 10/360 [00:02<00:40,  8.69it/s][A[A

  3%|▎         | 12/360 [00:02<00:34, 10.14it/s][A[A

  4%|▍         | 15/360 [00:02<00:27, 12.76it/s][A[A

  5%|▍         | 17/360 [00:02<00:25, 13.63it/s][A[A

  6%|▌         | 20/360 [00:03<00:21, 15.88it/s][A[A

  6%|▌         | 22/360 [00:03<00:21, 15.48it/s][A[A

  7%|▋         | 24/360 [00:03<00:21, 15.73it/s][A[A

  7%|▋         | 26/360 [00:03<00:22, 15.15it/s][A[A

  8%|▊         | 29/360 [00:03<00:20, 16.18it/s][A[A

  9%|▊         | 31/360 [00:03<00:21, 15.56it/s][A[A

  9%|▉         | 33/360 [00:03<00:20, 16.34it/s][A[A

 10%|▉         | 35/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [01:07<?, ?it/s]


  0%|          | 0/119 [00:00<?, ?it/s][A[A

  1%|          | 1/119 [00:00<01:38,  1.20it/s][A[A

  2%|▏         | 2/119 [00:01<00:55,  2.13it/s][A[A

  3%|▎         | 3/119 [00:01<00:35,  3.29it/s][A[A

  3%|▎         | 4/119 [00:01<00:27,  4.14it/s][A[A

  4%|▍         | 5/119 [00:01<00:23,  4.90it/s][A[A

  5%|▌         | 6/119 [00:01<00:19,  5.76it/s][A[A

  6%|▌         | 7/119 [00:01<00:16,  6.65it/s][A[A

  7%|▋         | 8/119 [00:01<00:15,  6.98it/s][A[A

  8%|▊         | 10/119 [00:01<00:12,  9.04it/s][A[A

 10%|█         | 12/119 [00:02<00:09, 11.01it/s][A[A

 12%|█▏        | 14/119 [00:02<00:08, 12.90it/s][A[A

 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  1%|          | 1/120 [00:01<02:41,  1.36s/it][A[A

  2%|▏         | 2/120 [00:01<01:13,  1.60it/s][A[A

  2%|▎         | 3/120 [00:01<00:46,  2.52it/s][A[A

  4%|▍         | 5/120 [00:01<00:25,  4.48it/s][A[A

  6%|▌         | 7/120 [00:01<00:17,  6.61it/s][A[A

  8%|▊         | 9/120 [00:02<00:13,  8.32it/s][A[A

  9%|▉         | 11/120 [00:02<00:10, 10.44it/s][A[A

 11%|█         | 13/120 [00:02<00:09, 11.19it/s][A[A

 12%|█▎        | 15/120 [00:02<00:08, 11.78it/s][A[A

 14%|█▍        | 17/120 [00:02<00:07, 12.97it/s][A[A

 16%|█▌        | 19/120 [00:02<00:07, 14.27it/s][A[A

 18%|█▊        | 21/120 [00:02<00:06, 14.58it/s][A[A

 19%|█▉        | 23/120 [00:02<00:06, 15.57it/s][A[A

 21%|██        | 25/120 [00:03<00:06, 14.81it/s][A[A

 23%|██▎       | 28/120 [00:03<00:05, 16.67it/s][A[A

 25%|██▌       | 30/120 [00:03<00:05, 17.33it/s][A[A

 27%|██▋       | 32/120 [00:03<00:05, 17.17it/s][A[A

 28%|██▊       | 34/120 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





  self.pid = os.fork()


  0%|          | 1/360 [00:01<08:42,  1.46s/it][A[A

  1%|          | 2/360 [00:01<04:01,  1.48it/s][A[A

  1%|          | 3/360 [00:01<02:39,  2.23it/s][A[A

  1%|          | 4/360 [00:01<01:52,  3.18it/s][A[A

  2%|▏         | 6/360 [00:02<01:04,  5.46it/s][A[A

  2%|▏         | 8/360 [00:02<00:47,  7.47it/s][A[A

  3%|▎         | 10/360 [00:02<00:37,  9.25it/s][A[A

  3%|▎         | 12/360 [00:02<00:32, 10.83it/s][A[A

  4%|▍         | 14/360 [00:02<00:28, 12.25it/s][A[A

  4%|▍         | 16/360 [00:02<00:26, 12.99it/s][A[A

  5%|▌         | 18/360 [00:02<00:23, 14.54it/s][A[A

  6%|▌         | 20/360 [00:02<00:23, 14.70it/s][A[A

  6%|▌         | 22/360 [00:03<00:21, 15.57it/s][A[A

  7%|▋         | 25/360 [00:03<00:20, 16.74it/s][A[A

  8%|▊         | 27/360 [00:03<00:20, 16.56it/s][A[A

  8%|▊         | 30/360 [00:03<00:18, 17.73it/s][A[A

  9%|▉         | 32/360 [00:03<00:19, 17.03it/s][A[A

  9%|▉         | 34/360 [0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()


  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  5%|▍         | 93/2000 [2:38:03<54:00:56, 101.97s/it]


KeyboardInterrupt: 

#### Type 2 - Image-Level Pre Training

In [None]:
RUNS_DIR = '/content/runs/'

args = {
    'training_seed': 0,
    'Pretrained': 'Whole', # Options are Whole for Study Level, FeatureExtractor for Image Level, or NoPretail for no SSL.
    'data_seed': 0,
    'checkpoint_dir': MODEL_CHECKPOINTS,
    'MIL_checkpoint_path': '',
    'use_class_weights': 'True',
    'ViewRegularization_warmup_schedule_type': 'Linear',
    'optimizer_type': 'SGD',
    'lr_schedule_type': 'CosineLR',
    'lr_cycle_epochs': 1,
    'lr': 0.0008, # learning rate
    'wd': 0.0001, # weight decay
    'T': 0.1, # tempertature
    'lambda_ViewRegularization': 15.0, # λsA
    'train_dir': RUNS_DIR + 'SAMIL',
    'resume': 'last_checkpoint.pth.tar',
    'dataset_name': 'echo',
    'train_epoch': 2000, # number of epochs, 2000 defined in the paper. CHANGE ME!
    'development_size': 'DEV479',
    'lr_warmup_epochs': 0,
    'ema_decay': 0.999,
    'device': device,
    'start_epoch': 0,
    'patience': 200,
    'early_stopping_warmup': 200,
    'ViewRegularization_warmup_pos': 0.4,
    'eval_every_Xepoch': 1
}

args, brief_summary = setup_samil_train(args)

weights = args['class_weights']
weights = [float(i) for i in weights.split(',')]
weights = torch.Tensor(weights)
weights = weights.to(device)

#load the view model, the output is unnormalized logits, need to use softmax on the output
view_model = create_view_model(args)
view_model.to(device)

model = create_model(args)
model.to(device)

no_decay = ['bias', 'bn']
grouped_parameters = [
    {'params': [p for n, p in model.named_parameters() if not any(
        nd in n for nd in no_decay)], 'weight_decay': args['wd']},
    {'params': [p for n, p in model.named_parameters() if any(
        nd in n for nd in no_decay)], 'weight_decay': 0.0}
]

if args['optimizer_type'] == 'SGD':
    optimizer = optim.SGD(grouped_parameters, lr=args['lr'],
                          momentum=0.9, nesterov=True)

elif args['optimizer_type'] == 'Adam':
    optimizer = optim.Adam(grouped_parameters, lr=args['lr'])

elif args['optimizer_type'] == 'AdamW':
    optimizer = optim.AdamW(grouped_parameters, lr=args['lr'])

else:
    raise NameError('Not supported optimizer setting')

#lr_schedule_type choice
if args['lr_schedule_type'] == 'CosineLR':
    scheduler = get_cosine_schedule_with_warmup(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

elif args['lr_schedule_type'] == 'FixedLR':
    scheduler = get_fixed_lr(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

else:
    raise NameError('Not supported lr scheduler setting')


#instantiate the ema_model object
ema_model = ModelEMA(args, model, args['ema_decay'])


# !!! Start training
train_samil(args)

#### Training Type 3 - Study-Level (i.e., Bag-Level) Pre Training

This training run trains the model with the two novel contributions of the paper. Namely, the self-supervised (SSL) pre-training of the entire study-level representations that builds upon MoCo (V2). MoCo is a "recent method for self-supervised image-level contrastive learning (img-CL). (Huang et., al, 2023).

In [None]:
RUNS_DIR = '/content/runs/'

args = {
    'training_seed': 0,
    'Pretrained': 'Whole', # Options are Whole for Study Level, FeatureExtractor for Image Level, or NoPretail for no SSL.
    'data_seed': 0,
    'checkpoint_dir': MODEL_CHECKPOINTS,
    'MIL_checkpoint_path': '',
    'use_class_weights': 'True',
    'ViewRegularization_warmup_schedule_type': 'Linear',
    'optimizer_type': 'SGD',
    'lr_schedule_type': 'CosineLR',
    'lr_cycle_epochs': 1,
    'lr': 0.0008, # learning rate
    'wd': 0.0001, # weight decay
    'T': 0.1, # tempertature
    'lambda_ViewRegularization': 15.0, # λsA
    'train_dir': RUNS_DIR + 'SAMIL',
    'resume': 'last_checkpoint.pth.tar',
    'dataset_name': 'echo',
    'train_epoch': 2000, # number of epochs, 2000 defined in the paper. CHANGE ME!
    'development_size': 'DEV479',
    'lr_warmup_epochs': 0,
    'ema_decay': 0.999,
    'device': device,
    'start_epoch': 0,
    'patience': 200,
    'early_stopping_warmup': 200,
    'ViewRegularization_warmup_pos': 0.4,
    'eval_every_Xepoch': 1
}

args, brief_summary = setup_samil_train(args)

weights = args['class_weights']
weights = [float(i) for i in weights.split(',')]
weights = torch.Tensor(weights)
weights = weights.to(device)

#load the view model, the output is unnormalized logits, need to use softmax on the output
view_model = create_view_model(args)
view_model.to(device)

model = create_model(args)
model.to(device)

no_decay = ['bias', 'bn']
grouped_parameters = [
    {'params': [p for n, p in model.named_parameters() if not any(
        nd in n for nd in no_decay)], 'weight_decay': args['wd']},
    {'params': [p for n, p in model.named_parameters() if any(
        nd in n for nd in no_decay)], 'weight_decay': 0.0}
]

if args['optimizer_type'] == 'SGD':
    optimizer = optim.SGD(grouped_parameters, lr=args['lr'],
                          momentum=0.9, nesterov=True)

elif args['optimizer_type'] == 'Adam':
    optimizer = optim.Adam(grouped_parameters, lr=args['lr'])

elif args['optimizer_type'] == 'AdamW':
    optimizer = optim.AdamW(grouped_parameters, lr=args['lr'])

else:
    raise NameError('Not supported optimizer setting')

#lr_schedule_type choice
if args['lr_schedule_type'] == 'CosineLR':
    scheduler = get_cosine_schedule_with_warmup(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

elif args['lr_schedule_type'] == 'FixedLR':
    scheduler = get_fixed_lr(optimizer, args['lr_warmup_epochs'], args['lr_cycle_epochs'])

else:
    raise NameError('Not supported lr scheduler setting')


#instantiate the ema_model object
ema_model = ModelEMA(args, model, args['ema_decay'])


# !!! Start training
train_samil(args)

setting training seed0


INFO:__main__:Model: WideResNet 28x2


!!!!!!!!Using pre-calculated class weights!!!!!!!!
args.resume_checkpoint_fullpath: /content/runs/SAMIL/Whole/last_checkpoint.pth.tar


INFO:__main__:Total params for View Model: 5.93M


!!!!!!!!!!!!!!!!!!!!!initializing from pretrained checkpoint!!!!!!!!!!!!!!!!!!!!!


INFO:__main__:Total params: 2.31M
INFO:__main__:***** Running training *****
INFO:__main__:  Task = echo
INFO:__main__:  Num Epochs = 2000
INFO:__main__:  Total optimization steps = 720000


self.param_keys: ['feature_extractor_part1.0.weight', 'feature_extractor_part1.0.bias', 'feature_extractor_part1.3.weight', 'feature_extractor_part1.3.bias', 'feature_extractor_part1.6.weight', 'feature_extractor_part1.6.bias', 'feature_extractor_part1.9.weight', 'feature_extractor_part1.9.bias', 'feature_extractor_part2.0.weight', 'feature_extractor_part2.0.bias', 'feature_extractor_part3.0.weight', 'feature_extractor_part3.0.bias', 'feature_extractor_part3.2.weight', 'feature_extractor_part3.2.bias', 'attention_V.0.weight', 'attention_V.0.bias', 'attention_V.2.weight', 'attention_V.2.bias', 'attention_U.0.weight', 'attention_U.0.bias', 'attention_U.2.weight', 'attention_U.2.bias', 'classifier.0.weight', 'classifier.0.bias']
self.buffer_keys: []
!!!!Does not have checkpoint yet!!!!


  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:17<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.21it/s][A
  4%|▍         | 5/119 [00:00<00:10, 10.90it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.12it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.87it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.53it/s][A
 31%|███       | 37/119 [00:00<00:01, 54.87it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 59.96it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 62.60it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 68.23it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 72.87it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 75.83it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 76.04it/s][A
 81%|████████  | 96/119 [00:01<00:00, 71.67it/s][A
 87%|████████▋ | 104/119 [00:01<00:00, 72.40it/s][A
100%|██████████| 119/119 [00:02<00:00, 54.37it/s]

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.04
class1 recall: 0.0
class2 recall: 0.9333333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.14it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.91it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 35.22it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 48.42it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 59.09it/s][A
 37%|███▋      | 44/120 [00:00<00:01, 66.07it/s][A
 44%|████▍     | 53/120 [00:01<00:00, 72.07it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 75.30it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 77.22it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 79.62it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 80.81it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 77.27it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 81.12it/s][A
100%|██████████| 120/120 [00:02<00:00, 59.45it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.038461538461538464
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:38,  2.27it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.96it/s][A
  4%|▍         | 15/360 [00:00<00:10, 32.03it/s][A
  7%|▋         | 24/360 [00:00<00:07, 47.36it/s][A
  9%|▉         | 33/360 [00:00<00:05, 57.71it/s][A
 11%|█▏        | 41/360 [00:00<00:05, 63.36it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 66.15it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 69.69it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 72.91it/s][A
 21%|██        | 74/360 [00:01<00:03, 72.01it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 72.26it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 72.14it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 71.90it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 73.49it/s][A
 32%|███▏      | 115/360 [00:01<00:03, 72.60it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 76.05it/s][A
 37%|███▋      | 133/360 [00:02<00:02, 79.13it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 82.75it/s][A
 42%|████▏     | 152/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.07894736842105263
class1 recall: 0.0
class2 recall: 0.9060773480662984



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 32.44 34.62 32.83
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.24it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.94it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 33.96it/s][A
 21%|██        | 25/119 [00:00<00:01, 47.45it/s][A
 29%|██▊       | 34/119 [00:00<00:01, 56.32it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 66.27it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 70.99it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 76.41it/s][A
 61%|██████    | 72/119 [00:01<00:00, 78.91it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 80.58it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 78.58it/s][A
 83%|████████▎ | 99/119 [00:01<00:00, 79.62it/s][A
 91%|█████████ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.20it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.86it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 31.78it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 44.44it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 55.46it/s][A
 33%|███▎      | 40/120 [00:00<00:01, 60.74it/s][A
 41%|████      | 49/120 [00:01<00:01, 67.51it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 73.14it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 77.35it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 79.21it/s][A
 71%|███████   | 85/120 [00:01<00:00, 78.60it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 80.36it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 77.78it/s][A
100%|██████████| 120/120 [00:02<00:00, 58.54it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:37,  2.28it/s][A
  2%|▏         | 6/360 [00:00<00:25, 13.62it/s][A
  4%|▍         | 14/360 [00:00<00:11, 29.94it/s][A
  6%|▋         | 23/360 [00:00<00:07, 45.43it/s][A
  9%|▊         | 31/360 [00:00<00:06, 54.75it/s][A
 11%|█         | 39/360 [00:00<00:05, 61.39it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 68.46it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.68it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.44it/s][A
 20%|██        | 72/360 [00:01<00:03, 73.50it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 72.88it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 71.37it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 72.24it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 70.58it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 73.63it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 75.28it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 74.27it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 79.94it/s][A
 41%|████      | 148/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:50,  2.32it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.84it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.24it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 45.08it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 53.26it/s][A
 34%|███▎      | 40/119 [00:00<00:01, 63.11it/s][A
 41%|████      | 49/119 [00:01<00:01, 68.98it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 70.73it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 75.66it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 80.79it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 82.90it/s][A
 79%|███████▉  | 94/119 [00:01<00:00, 80.87it/s][A
 87%|████████▋ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.20it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.85it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 34.72it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.30it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 56.56it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 64.19it/s][A
 43%|████▎     | 52/120 [00:01<00:00, 70.00it/s][A
 50%|█████     | 60/120 [00:01<00:00, 72.70it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 72.17it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 76.35it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 78.42it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 79.17it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 76.99it/s][A
100%|██████████| 120/120 [00:02<00:00, 58.53it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:37,  2.28it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.98it/s][A
  5%|▍         | 17/360 [00:00<00:09, 36.73it/s][A
  7%|▋         | 25/360 [00:00<00:07, 46.70it/s][A
  9%|▉         | 34/360 [00:00<00:05, 57.05it/s][A
 12%|█▏        | 42/360 [00:00<00:05, 63.21it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 66.93it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 70.38it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 76.48it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 75.26it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 72.44it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 72.55it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 70.22it/s][A
 31%|███       | 111/360 [00:01<00:03, 73.71it/s][A
 33%|███▎      | 120/360 [00:01<00:03, 77.35it/s][A
 36%|███▌      | 128/360 [00:02<00:02, 78.03it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 80.84it/s][A
 41%|████      | 146/360 [00:02<00:02, 79.34it/s][A
 43%|████▎     | 154/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.29it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.42it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.57it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 40.05it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 53.21it/s][A
 32%|███▏      | 38/119 [00:00<00:01, 58.31it/s][A
 40%|████      | 48/119 [00:01<00:01, 67.43it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 69.59it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 74.73it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 78.28it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 80.73it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 76.75it/s][A
 84%|████████▍ | 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:56,  2.10it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.69it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.75it/s][A
 20%|██        | 24/120 [00:00<00:02, 44.40it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 55.32it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.81it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 66.99it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 72.53it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 72.99it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 72.57it/s][A
 70%|███████   | 84/120 [00:01<00:00, 75.03it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 74.51it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 75.36it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 78.10it/s][A
100%|██████████| 120/120 [00:02<00:00, 57.07it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:43,  2.20it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.42it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.20it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.77it/s][A
  9%|▊         | 31/360 [00:00<00:06, 52.10it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.66it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 60.87it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 64.25it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 68.33it/s][A
 20%|█▉        | 71/360 [00:01<00:03, 72.76it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 71.39it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 72.16it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 75.40it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 71.53it/s][A
 31%|███       | 112/360 [00:01<00:03, 73.67it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 74.12it/s][A
 36%|███▌      | 129/360 [00:02<00:02, 77.34it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 80.14it/s][A
 41%|████      | 147/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.22it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.81it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.65it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 42.97it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.68it/s][A
 32%|███▏      | 38/119 [00:00<00:01, 57.10it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 64.53it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 67.09it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.38it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 72.96it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 76.19it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 76.84it/s][A
 81%|████████  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.20it/s][A
  7%|▋         | 8/120 [00:00<00:06, 18.05it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 34.39it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 47.08it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 57.63it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 64.75it/s][A
 44%|████▍     | 53/120 [00:01<00:00, 69.77it/s][A
 51%|█████     | 61/120 [00:01<00:00, 71.41it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 71.56it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 72.05it/s][A
 71%|███████   | 85/120 [00:01<00:00, 72.66it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 74.51it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 75.03it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 77.60it/s][A
100%|██████████| 120/120 [00:02<00:00, 57.55it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:41,  2.22it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.75it/s][A
  4%|▍         | 15/360 [00:00<00:10, 31.49it/s][A
  7%|▋         | 24/360 [00:00<00:07, 45.22it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.91it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 64.74it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 69.58it/s][A
 17%|█▋        | 61/360 [00:01<00:03, 76.05it/s][A
 19%|█▉        | 70/360 [00:01<00:03, 77.16it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 75.66it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 75.25it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 73.95it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 73.96it/s][A
 31%|███       | 112/360 [00:01<00:03, 76.85it/s][A
 33%|███▎      | 120/360 [00:01<00:03, 76.72it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 76.61it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 78.82it/s][A
 41%|████      | 146/360 [00:02<00:02, 79.95it/s][A
 43%|████▎     | 155/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.30it/s][A
  5%|▌         | 6/119 [00:00<00:08, 14.09it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.60it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 43.01it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.98it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 61.94it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 66.25it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 69.37it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 71.97it/s][A
 61%|██████    | 72/119 [00:01<00:00, 73.56it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 73.36it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 76.76it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.15it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.68it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.77it/s][A
 20%|██        | 24/120 [00:00<00:02, 44.44it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.84it/s][A
 32%|███▎      | 39/120 [00:00<00:01, 57.29it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 59.99it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 64.90it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 69.38it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 70.23it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 69.72it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 73.40it/s][A
 80%|████████  | 96/120 [00:01<00:00, 74.01it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 73.70it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.67it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0





class2 recall: 1.0



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:42,  2.20it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.54it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.36it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.81it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.52it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.95it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 61.90it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.20it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 75.94it/s][A
 21%|██        | 75/360 [00:01<00:03, 76.66it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 79.35it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 74.71it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 72.08it/s][A
 30%|███       | 109/360 [00:01<00:03, 72.64it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 76.40it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 77.30it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 78.00it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 81.11it/s][A
 42%|████▏     | 152/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.28it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.93it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.85it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 41.13it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 53.14it/s][A
 31%|███       | 37/119 [00:00<00:01, 57.51it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 67.62it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 71.01it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 73.06it/s][A
 61%|██████    | 72/119 [00:01<00:00, 77.08it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 78.73it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 79.58it/s][A
 83%|████████▎ | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.17it/s][A
  8%|▊         | 9/120 [00:00<00:05, 20.05it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 34.54it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 48.30it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 58.12it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 62.32it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 64.98it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 67.58it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 69.72it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 68.97it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.28it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 73.90it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 71.61it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 74.99it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.50it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.12it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.17it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.33it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.43it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.76it/s][A
 12%|█▏        | 42/360 [00:01<00:04, 64.83it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 68.26it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 72.77it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 72.24it/s][A
 21%|██        | 75/360 [00:01<00:03, 73.78it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 75.03it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 73.19it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 69.38it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 70.54it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 71.52it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 76.07it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 82.11it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 77.60it/s][A
 42%|████▏     | 152/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.25it/s][A
  6%|▌         | 7/119 [00:00<00:06, 16.08it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.61it/s][A
 20%|██        | 24/119 [00:00<00:02, 45.62it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 56.44it/s][A
 35%|███▌      | 42/119 [00:00<00:01, 64.47it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 67.78it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 69.58it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 73.34it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 73.92it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 74.88it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 77.01it/s][A
 85%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.16it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.76it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.51it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 42.12it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 52.23it/s][A
 33%|███▎      | 40/120 [00:00<00:01, 61.03it/s][A
 41%|████      | 49/120 [00:01<00:01, 67.20it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 70.53it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 74.90it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 73.45it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 73.46it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 72.85it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 72.92it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 77.29it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.99it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.12it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.02it/s][A
  4%|▍         | 16/360 [00:00<00:10, 33.15it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.78it/s][A
  9%|▉         | 33/360 [00:00<00:05, 56.14it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 65.04it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 70.16it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 71.52it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 77.39it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 79.12it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 76.73it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 77.37it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 74.51it/s][A
 31%|███       | 111/360 [00:01<00:03, 74.25it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 73.05it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 72.67it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 77.19it/s][A
 40%|████      | 144/360 [00:02<00:02, 76.45it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.30it/s][A
  6%|▌         | 7/119 [00:00<00:06, 16.30it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.07it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 43.19it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.61it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 62.32it/s][A
 41%|████      | 49/119 [00:01<00:00, 71.10it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 67.83it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 69.87it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 72.54it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 73.83it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 77.71it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.14it/s][A
  8%|▊         | 10/120 [00:00<00:04, 22.28it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 34.06it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.19it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.50it/s][A
 34%|███▍      | 41/120 [00:00<00:01, 60.59it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 67.64it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 72.43it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 73.26it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 71.32it/s][A
 70%|███████   | 84/120 [00:01<00:00, 74.68it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 76.14it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 74.06it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 75.49it/s][A
100%|██████████| 120/120 [00:02<00:00, 57.14it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:41,  2.22it/s][A
  2%|▏         | 7/360 [00:00<00:21, 16.08it/s][A
  4%|▍         | 16/360 [00:00<00:09, 34.74it/s][A
  7%|▋         | 25/360 [00:00<00:06, 48.25it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.87it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 64.72it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 70.52it/s][A
 17%|█▋        | 60/360 [00:01<00:03, 75.83it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 78.56it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 78.46it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 72.44it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 73.93it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 72.89it/s][A
 31%|███       | 111/360 [00:01<00:03, 72.40it/s][A
 33%|███▎      | 120/360 [00:01<00:03, 75.00it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 76.04it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 78.04it/s][A
 40%|████      | 145/360 [00:02<00:02, 78.41it/s][A
 43%|████▎     | 154/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.30it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.77it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 32.82it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.37it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.40it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 62.08it/s][A
 40%|████      | 48/119 [00:01<00:01, 68.04it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 67.49it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 72.82it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 75.95it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 76.40it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 77.54it/s][A
 84%|████████▍ | 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.15it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.72it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 34.64it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 47.99it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 58.69it/s][A
 36%|███▌      | 43/120 [00:00<00:01, 63.97it/s][A
 43%|████▎     | 52/120 [00:01<00:00, 69.59it/s][A
 51%|█████     | 61/120 [00:01<00:00, 74.49it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 76.45it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 77.08it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 78.41it/s][A
 81%|████████  | 97/120 [00:01<00:00, 77.54it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 77.29it/s][A
100%|██████████| 120/120 [00:02<00:00, 58.77it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:39,  2.25it/s][A
  2%|▏         | 7/360 [00:00<00:21, 16.15it/s][A
  4%|▍         | 15/360 [00:00<00:10, 32.16it/s][A
  6%|▋         | 23/360 [00:00<00:07, 44.66it/s][A
  9%|▊         | 31/360 [00:00<00:06, 52.86it/s][A
 11%|█         | 40/360 [00:00<00:05, 61.26it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 67.95it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 71.17it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 77.56it/s][A
 21%|██        | 76/360 [00:01<00:03, 77.37it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 78.61it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 76.90it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 73.24it/s][A
 31%|███       | 111/360 [00:01<00:03, 75.72it/s][A
 33%|███▎      | 120/360 [00:01<00:03, 77.52it/s][A
 36%|███▌      | 129/360 [00:02<00:02, 79.01it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 84.38it/s][A
 41%|████      | 148/360 [00:02<00:02, 80.80it/s][A
 44%|████▎     | 157/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0





class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:14<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.20it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.92it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.70it/s][A
 20%|██        | 24/119 [00:00<00:02, 45.62it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 56.99it/s][A
 35%|███▌      | 42/119 [00:00<00:01, 65.36it/s][A
 43%|████▎     | 51/119 [00:01<00:00, 69.24it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 72.05it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 73.99it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 76.98it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 79.03it/s][A
 79%|███████▉  | 94/119 [00:01<00:00, 79.68it/s][A
 87%|████████▋ | 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:56,  2.09it/s][A
  2%|▏         | 2/120 [00:00<00:49,  2.36it/s][A
  9%|▉         | 11/120 [00:00<00:06, 16.59it/s][A
 16%|█▌        | 19/120 [00:01<00:03, 28.20it/s][A
 23%|██▎       | 28/120 [00:01<00:02, 41.17it/s][A
 30%|███       | 36/120 [00:01<00:01, 48.08it/s][A
 38%|███▊      | 45/120 [00:01<00:01, 57.58it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 64.72it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 71.21it/s][A
 60%|██████    | 72/120 [00:01<00:00, 73.01it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 76.16it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 78.19it/s][A
 82%|████████▎ | 99/120 [00:02<00:00, 75.60it/s][A
 89%|████████▉ | 107/120 [00:02<00:00, 75.67it/s][A
100%|██████████| 120/120 [00:02<00:00, 49.40it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:40,  2.24it/s][A
  2%|▏         | 7/360 [00:00<00:21, 16.16it/s][A
  4%|▍         | 15/360 [00:00<00:10, 31.41it/s][A
  7%|▋         | 24/360 [00:00<00:07, 45.99it/s][A
  9%|▉         | 33/360 [00:00<00:05, 57.32it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 65.84it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 71.85it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 73.60it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 75.74it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 74.77it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 74.81it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 73.52it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 70.00it/s][A
 30%|███       | 109/360 [00:01<00:03, 71.80it/s][A
 32%|███▎      | 117/360 [00:01<00:03, 72.96it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 77.61it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 82.60it/s][A
 40%|████      | 145/360 [00:02<00:02, 83.03it/s][A
 43%|████▎     | 154/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.21it/s][A
  7%|▋         | 8/119 [00:00<00:06, 18.08it/s][A
 14%|█▍        | 17/119 [00:00<00:02, 35.56it/s][A
 21%|██        | 25/119 [00:00<00:01, 47.13it/s][A
 29%|██▊       | 34/119 [00:00<00:01, 56.77it/s][A
 35%|███▌      | 42/119 [00:00<00:01, 61.92it/s][A
 43%|████▎     | 51/119 [00:01<00:00, 68.82it/s][A
 50%|█████     | 60/119 [00:01<00:00, 73.17it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 73.67it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 77.31it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 77.78it/s][A
 79%|███████▉  | 94/119 [00:01<00:00, 78.05it/s][A
 87%|████████▋ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.16it/s][A
  7%|▋         | 8/120 [00:00<00:06, 18.10it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 33.17it/s][A
 20%|██        | 24/120 [00:00<00:02, 45.17it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 56.61it/s][A
 34%|███▍      | 41/120 [00:00<00:01, 61.57it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 67.55it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 72.68it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 72.32it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 73.38it/s][A
 70%|███████   | 84/120 [00:01<00:00, 75.37it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 74.92it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 73.48it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 74.85it/s][A
100%|██████████| 120/120 [00:02<00:00, 57.11it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:43,  2.19it/s][A
  2%|▏         | 6/360 [00:00<00:26, 13.36it/s][A
  4%|▍         | 15/360 [00:00<00:10, 32.04it/s][A
  7%|▋         | 24/360 [00:00<00:07, 46.15it/s][A
  9%|▊         | 31/360 [00:00<00:06, 52.34it/s][A
 11%|█         | 39/360 [00:00<00:05, 59.03it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 66.25it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.03it/s][A
 18%|█▊        | 65/360 [00:01<00:03, 74.30it/s][A
 20%|██        | 73/360 [00:01<00:03, 73.11it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 76.18it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 71.68it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 71.11it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 72.98it/s][A
 32%|███▏      | 116/360 [00:01<00:03, 76.41it/s][A
 35%|███▍      | 125/360 [00:02<00:02, 79.19it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 83.02it/s][A
 40%|████      | 144/360 [00:02<00:02, 83.44it/s][A
 42%|████▎     | 153/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0





class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.17it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.34it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.40it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 41.15it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 50.72it/s][A
 31%|███       | 37/119 [00:00<00:01, 57.59it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 65.53it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 65.34it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.38it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 72.83it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 73.42it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 74.28it/s][A
 80%|███████▉  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.18it/s][A
  8%|▊         | 9/120 [00:00<00:05, 20.50it/s][A
 14%|█▍        | 17/120 [00:00<00:02, 34.58it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.52it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.09it/s][A
 33%|███▎      | 40/120 [00:00<00:01, 56.82it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 60.35it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 63.42it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 66.87it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 68.70it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 69.98it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 71.72it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 68.63it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 70.10it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.06it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:45,  2.16it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.77it/s][A
  4%|▍         | 15/360 [00:00<00:11, 31.32it/s][A
  7%|▋         | 24/360 [00:00<00:07, 45.20it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.30it/s][A
 11%|█         | 39/360 [00:00<00:05, 58.70it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.82it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 66.84it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.87it/s][A
 20%|██        | 72/360 [00:01<00:03, 72.98it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 75.35it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 72.73it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 72.86it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 69.78it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 70.72it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 74.95it/s][A
 36%|███▋      | 131/360 [00:02<00:02, 77.82it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 82.74it/s][A
 42%|████▏     | 150/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 33.33 33.33 33.33
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.26it/s][A
  6%|▌         | 7/119 [00:00<00:06, 16.14it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.34it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.96it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 50.67it/s][A
 31%|███       | 37/119 [00:00<00:01, 57.05it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 66.98it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 70.98it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 74.21it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 72.32it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 73.17it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 75.60it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.08
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.05it/s][A
  8%|▊         | 10/120 [00:00<00:05, 21.93it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.19it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 46.72it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 56.25it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 63.43it/s][A
 44%|████▍     | 53/120 [00:01<00:00, 68.84it/s][A
 51%|█████     | 61/120 [00:01<00:00, 69.88it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 69.72it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 72.02it/s][A
 71%|███████   | 85/120 [00:01<00:00, 70.88it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 69.55it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 69.69it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 73.70it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.80it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.15384615384615385
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:42,  2.21it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.98it/s][A
  4%|▍         | 15/360 [00:00<00:11, 31.19it/s][A
  6%|▋         | 23/360 [00:00<00:07, 43.33it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.75it/s][A
 11%|█▏        | 41/360 [00:00<00:05, 63.15it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 68.96it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 74.62it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 75.65it/s][A
 21%|██        | 76/360 [00:01<00:03, 75.78it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 74.23it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 72.90it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 66.77it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 66.96it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 70.21it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 75.36it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 80.73it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 79.50it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.10526315789473684
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 36.00 38.46 36.84
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.24it/s][A
  6%|▌         | 7/119 [00:00<00:06, 16.10it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 32.24it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 44.60it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 55.79it/s][A
 34%|███▍      | 41/119 [00:00<00:01, 63.77it/s][A
 41%|████      | 49/119 [00:01<00:01, 67.76it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 67.53it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 69.95it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 71.05it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 72.99it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 74.86it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.24
class1 recall: 0.0
class2 recall: 0.9833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.14it/s][A
  8%|▊         | 10/120 [00:00<00:04, 22.25it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.73it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 46.75it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 53.36it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 60.02it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 63.20it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 64.56it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 67.73it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 66.00it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 70.69it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 71.38it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 70.23it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 71.78it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.93it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.23076923076923078
class1 recall: 0.0
class2 recall: 0.9833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:48,  2.14it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.23it/s][A
  4%|▍         | 15/360 [00:00<00:11, 31.06it/s][A
  6%|▋         | 23/360 [00:00<00:07, 43.65it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.15it/s][A
 11%|█         | 38/360 [00:01<00:05, 56.05it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.66it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 66.36it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 67.84it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 69.96it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 73.83it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 73.52it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 76.13it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 74.35it/s][A
 31%|███       | 112/360 [00:01<00:03, 74.77it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 77.24it/s][A
 36%|███▌      | 130/360 [00:02<00:02, 79.88it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 80.22it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34210526315789475
class1 recall: 0.0
class2 recall: 0.9502762430939227
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 40.78 40.47 43.08
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.24it/s][A
  6%|▌         | 7/119 [00:00<00:06, 16.07it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.68it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 44.17it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 56.09it/s][A
 34%|███▍      | 41/119 [00:00<00:01, 63.48it/s][A
 41%|████      | 49/119 [00:01<00:01, 65.50it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 68.78it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 71.20it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 76.26it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 76.57it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 75.95it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.0
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.17it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.72it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.82it/s][A
 20%|██        | 24/120 [00:00<00:02, 44.33it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.58it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 55.94it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.72it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 67.81it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 69.22it/s][A
 60%|██████    | 72/120 [00:01<00:00, 70.08it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 70.23it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 73.59it/s][A
 81%|████████  | 97/120 [00:01<00:00, 74.32it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 77.24it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.96it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.0
class2 recall: 0.95
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:45,  2.17it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.71it/s][A
  4%|▍         | 16/360 [00:00<00:10, 34.20it/s][A
  7%|▋         | 25/360 [00:00<00:06, 48.24it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.43it/s][A
 11%|█         | 40/360 [00:00<00:05, 60.17it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 65.68it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 72.26it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.71it/s][A
 20%|██        | 73/360 [00:01<00:04, 68.60it/s][A
 22%|██▎       | 81/360 [00:01<00:04, 69.23it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 69.94it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 70.17it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 67.56it/s][A
 31%|███       | 112/360 [00:02<00:03, 66.40it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 71.17it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 74.51it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 79.45it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4605263157894737
class1 recall: 0.0
class2 recall: 0.8729281767955801
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.0
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 42.56 47.05 44.45
INFO:__main__:At EMA Best val, validation/test/train 33.33 33.33 33.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:52,  2.25it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.66it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 32.58it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 45.00it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 56.58it/s][A
 34%|███▎      | 40/119 [00:00<00:01, 60.38it/s][A
 40%|████      | 48/119 [00:01<00:01, 64.36it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 68.31it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 73.53it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 75.29it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 73.14it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 73.56it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.12
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.08
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:56,  2.11it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.67it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.66it/s][A
 20%|██        | 24/120 [00:00<00:02, 44.04it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.65it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 59.61it/s][A
 40%|████      | 48/120 [00:01<00:01, 64.24it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 67.44it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 72.72it/s][A
 61%|██████    | 73/120 [00:01<00:00, 71.09it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 72.06it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 70.92it/s][A
 81%|████████  | 97/120 [00:01<00:00, 69.70it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 73.88it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.52it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.15384615384615385
class1 recall: 0.0
class2 recall: 1.0
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.11538461538461539
class1 recall: 0.0
class2 recall: 1.0




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:42,  2.21it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.46it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.60it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.81it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.15it/s][A
 11%|█         | 38/360 [00:01<00:05, 56.03it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.28it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 68.70it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 71.04it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 71.92it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 73.23it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 72.83it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 73.72it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 73.14it/s][A
 31%|███       | 111/360 [00:01<00:03, 70.50it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 74.20it/s][A
 36%|███▌      | 129/360 [00:02<00:02, 77.20it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 77.34it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.21052631578947367
class1 recall: 0.0
class2 recall: 0.9779005524861878
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.07894736842105263
class1 recall: 0.0
class2 recall: 1.0



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 42.56 47.05 44.45
INFO:__main__:At EMA Best val, validation/test/train 36.00 37.18 35.96
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.19it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.75it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.07it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.65it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.60it/s][A
 31%|███       | 37/119 [00:00<00:01, 56.47it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 66.69it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 70.78it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 74.87it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 77.50it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 80.39it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 78.19it/s][A
 85%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.32
class1 recall: 0.0
class2 recall: 0.9833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.2
class1 recall: 0.0
class2 recall: 0.9833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.05it/s][A
  8%|▊         | 10/120 [00:00<00:05, 21.59it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 34.91it/s][A
 22%|██▎       | 27/120 [00:00<00:01, 48.58it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 54.24it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 61.70it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 65.30it/s][A
 51%|█████     | 61/120 [00:01<00:00, 70.79it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 72.37it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 75.83it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 77.08it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 72.56it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 74.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.92it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.2692307692307692
class1 recall: 0.0
class2 recall: 0.9666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.19230769230769232
class1 recall: 0.0
class2 recall: 1.0



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:41,  2.22it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.94it/s][A
  4%|▍         | 15/360 [00:00<00:10, 31.89it/s][A
  7%|▋         | 24/360 [00:00<00:07, 46.79it/s][A
  9%|▉         | 33/360 [00:00<00:05, 57.23it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 65.20it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 70.03it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 71.83it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 73.58it/s][A
 21%|██        | 76/360 [00:01<00:03, 72.77it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 74.71it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 71.72it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 68.35it/s][A
 30%|███       | 108/360 [00:01<00:03, 70.01it/s][A
 32%|███▎      | 117/360 [00:01<00:03, 74.68it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 75.49it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 75.61it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 74.69it/s][A
 42%|████▏     | 150/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.3157894736842105
class1 recall: 0.0
class2 recall: 0.9337016574585635
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.17105263157894737
class1 recall: 0.0
class2 recall: 0.9723756906077348



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 43.44 41.20 41.65
INFO:__main__:At EMA Best val, validation/test/train 39.44 39.74 38.11
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:51,  2.28it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.52it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 32.46it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.34it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 54.38it/s][A
 34%|███▎      | 40/119 [00:00<00:01, 63.09it/s][A
 41%|████      | 49/119 [00:01<00:00, 70.14it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 69.46it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 73.39it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 72.82it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 73.01it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 73.48it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.0
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.28
class1 recall: 0.0
class2 recall: 0.9833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.62it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.13it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 36.64it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 46.75it/s][A
 30%|███       | 36/120 [00:01<00:01, 52.69it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 58.46it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 64.20it/s][A
 50%|█████     | 60/120 [00:01<00:00, 66.78it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 66.24it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 65.99it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 67.04it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 67.39it/s][A
 80%|████████  | 96/120 [00:01<00:00, 67.43it/s][A
 87%|████████▋ | 104/120 [00:02<00:00, 69.03it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.14it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.0
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.2692307692307692
class1 recall: 0.0
class2 recall: 0.9666666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 8/360 [00:00<00:20, 16.89it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.73it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.68it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.29it/s][A
 11%|█         | 40/360 [00:01<00:05, 59.28it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 63.69it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 66.27it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.18it/s][A
 20%|██        | 73/360 [00:01<00:04, 71.02it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 72.29it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 73.55it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 74.71it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 69.42it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 69.99it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 71.12it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 71.73it/s][A
 38%|███▊      | 137/360 [00:02<00:03, 72.28it/s][A
 40%|████      | 145/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.038834951456310676
class2 recall: 0.6298342541436464
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.35526315789473684
class1 recall: 0.0
class2 recall: 0.9392265193370166



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 44.67 43.46 47.73
INFO:__main__:At EMA Best val, validation/test/train 42.11 41.20 43.15
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.18it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.31it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.73it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.86it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.21it/s][A
 31%|███       | 37/119 [00:00<00:01, 56.45it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 64.50it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 65.12it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 69.12it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 71.16it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 74.49it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 73.33it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.0
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.0
class2 recall: 0.9666666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.03it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.98it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.84it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.22it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 48.28it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 58.34it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 61.23it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 64.47it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 68.34it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 69.27it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 72.11it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 71.74it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 70.62it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 70.68it/s][A
 92%|█████████▎| 111/120 [00:02<00:00, 72.55it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.77it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.0
class2 recall: 0.9666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.91it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.54it/s][A
  6%|▌         | 22/360 [00:00<00:08, 40.29it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.95it/s][A
 11%|█         | 38/360 [00:01<00:05, 55.50it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.24it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 67.93it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.00it/s][A
 20%|██        | 72/360 [00:01<00:04, 71.93it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 72.80it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 68.13it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 68.70it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 66.82it/s][A
 31%|███       | 112/360 [00:02<00:03, 68.92it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 68.99it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 73.41it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 78.84it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6578947368421053
class1 recall: 0.08737864077669903
class2 recall: 0.6795580110497238
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4473684210526316
class1 recall: 0.0
class2 recall: 0.8950276243093923



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 45.00 45.08 47.49
INFO:__main__:At EMA Best val, validation/test/train 44.22 47.61 44.75
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.20it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.23it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.17it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.97it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 48.41it/s][A
 29%|██▉       | 35/119 [00:00<00:01, 53.02it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 63.16it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 65.48it/s][A
 50%|█████     | 60/119 [00:01<00:00, 65.56it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 67.71it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 72.24it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 74.99it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.058823529411764705
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.0
class2 recall: 0.9333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.98it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.77it/s][A
 12%|█▏        | 14/120 [00:00<00:03, 27.61it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 39.85it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 52.28it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 58.66it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 62.38it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 64.13it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 65.49it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 67.89it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 67.20it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 67.50it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 67.34it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 68.01it/s][A
 90%|█████████ | 108/120 [00:02<00:00, 70.61it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.59it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.38461538461538464
class1 recall: 0.0
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.0
class2 recall: 0.9333333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:47,  2.14it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.48it/s][A
  4%|▍         | 15/360 [00:00<00:11, 31.28it/s][A
  6%|▋         | 23/360 [00:00<00:07, 43.08it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.67it/s][A
 11%|█         | 39/360 [00:01<00:05, 58.35it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 64.09it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.75it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 71.54it/s][A
 20%|██        | 72/360 [00:01<00:03, 72.53it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 74.47it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 69.38it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 68.05it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 64.89it/s][A
 31%|███       | 110/360 [00:02<00:03, 66.15it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 67.96it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 72.58it/s][A
 38%|███▊      | 135/360 [00:02<00:03, 73.32it/s][A
 40%|███▉      | 143/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.47368421052631576
class1 recall: 0.06796116504854369
class2 recall: 0.8674033149171271
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5131578947368421
class1 recall: 0.009708737864077669
class2 recall: 0.8839779005524862


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 45.00 45.08 47.49
INFO:__main__:At EMA Best val, validation/test/train 45.78 46.50 46.89
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:59,  1.99it/s][A
  5%|▌         | 6/119 [00:00<00:09, 12.21it/s][A
 11%|█         | 13/119 [00:00<00:04, 25.25it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.04it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.03it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 51.49it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 57.67it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 65.45it/s][A
 50%|█████     | 60/119 [00:01<00:00, 66.66it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 68.24it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 70.91it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 74.22it/s][A
 78%|███████▊  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.0
class2 recall: 0.9166666666666666




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:03,  1.87it/s][A
  8%|▊         | 9/120 [00:00<00:06, 17.66it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 31.17it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 44.60it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 54.92it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 59.99it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 62.86it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 66.30it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 69.04it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 71.55it/s][A
 70%|███████   | 84/120 [00:01<00:00, 70.51it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 72.57it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 71.75it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 71.98it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.61it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.0
class2 recall: 0.9333333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.09it/s][A
  2%|▏         | 6/360 [00:00<00:27, 13.03it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.52it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.15it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.88it/s][A
 10%|█         | 37/360 [00:01<00:05, 54.99it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 60.19it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 65.54it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 71.52it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 71.99it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 75.33it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 74.92it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 76.25it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 74.49it/s][A
 31%|███       | 111/360 [00:01<00:03, 73.45it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 71.72it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 73.35it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 76.59it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.10679611650485436
class2 recall: 0.6906077348066298
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5657894736842105
class1 recall: 0.02912621359223301
class2 recall: 0.8453038674033149



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 45.00 45.08 47.49
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.13it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.81it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.56it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.07it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.99it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 51.42it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 57.30it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 60.35it/s][A
 48%|████▊     | 57/119 [00:01<00:01, 61.57it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 61.76it/s][A
 61%|██████    | 72/119 [00:01<00:00, 65.35it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 68.94it/s][A
 74%|███████▍  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.72
class1 recall: 0.029411764705882353
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.0
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.04it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.02it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.34it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 47.27it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.44it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 61.79it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 66.74it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 67.46it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 72.01it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 73.40it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 74.34it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 73.33it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 73.47it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 74.01it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.18it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.029411764705882353
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.0
class2 recall: 0.9




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.12it/s][A
  1%|▏         | 5/360 [00:00<00:32, 10.87it/s][A
  3%|▎         | 12/360 [00:00<00:13, 25.31it/s][A
  6%|▌         | 21/360 [00:00<00:08, 40.56it/s][A
  8%|▊         | 28/360 [00:00<00:06, 48.01it/s][A
 10%|█         | 36/360 [00:01<00:05, 55.73it/s][A
 12%|█▎        | 45/360 [00:01<00:04, 64.16it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 68.00it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 70.79it/s][A
 19%|█▉        | 70/360 [00:01<00:03, 75.52it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 75.51it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 72.20it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 72.07it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 68.98it/s][A
 31%|███       | 110/360 [00:01<00:03, 70.05it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 66.24it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 69.39it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 76.10it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.08737864077669903
class2 recall: 0.6022099447513812
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5789473684210527
class1 recall: 0.038834951456310676
class2 recall: 0.8232044198895028



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 45.00 45.08 47.49
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.02it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.01it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.61it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.43it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 52.35it/s][A
 35%|███▌      | 42/119 [00:01<00:01, 56.94it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 61.82it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 64.78it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 67.39it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 69.83it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 71.72it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.0
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:08,  1.73it/s][A
  6%|▌         | 7/120 [00:00<00:08, 13.09it/s][A
 11%|█         | 13/120 [00:00<00:04, 23.22it/s][A
 17%|█▋        | 20/120 [00:00<00:02, 33.60it/s][A
 22%|██▎       | 27/120 [00:00<00:02, 42.54it/s][A
 28%|██▊       | 34/120 [00:01<00:01, 48.05it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 53.01it/s][A
 41%|████      | 49/120 [00:01<00:01, 59.75it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 65.15it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 68.56it/s][A
 61%|██████    | 73/120 [00:01<00:00, 69.25it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 71.65it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 71.96it/s][A
 81%|████████  | 97/120 [00:01<00:00, 70.58it/s][A
 88%|████████▊ | 106/120 [00:02<00:00, 74.40it/s][A
100%|██████████| 120/120 [00:02<00:00, 51.19it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.029411764705882353
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.0
class2 recall: 0.85




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.14it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.69it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.91it/s][A
  9%|▉         | 32/360 [00:00<00:06, 51.78it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.60it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 66.73it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 69.85it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 72.27it/s][A
 21%|██        | 74/360 [00:01<00:03, 72.40it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 72.43it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 70.47it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 69.82it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 71.81it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 71.74it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 74.18it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 75.45it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 79.04it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5789473684210527
class1 recall: 0.06796116504854369
class2 recall: 0.8121546961325967
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.04854368932038835
class2 recall: 0.8011049723756906



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.15it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.43it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.33it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.67it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 49.59it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 56.58it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.83it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 66.63it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 66.27it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 66.62it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 65.58it/s][A
 71%|███████   | 84/119 [00:01<00:00, 66.30it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.058823529411764705





class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  5%|▌         | 6/120 [00:00<00:09, 12.62it/s][A
 11%|█         | 13/120 [00:00<00:04, 26.21it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 39.07it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 49.68it/s][A
 31%|███       | 37/120 [00:01<00:01, 57.92it/s][A
 38%|███▊      | 45/120 [00:01<00:01, 61.80it/s][A
 44%|████▍     | 53/120 [00:01<00:01, 66.36it/s][A
 51%|█████     | 61/120 [00:01<00:00, 69.84it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 68.35it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 68.90it/s][A
 71%|███████   | 85/120 [00:01<00:00, 68.77it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 71.34it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 72.17it/s][A
 92%|█████████▏| 110/120 [00:02<00:00, 74.84it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.72it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.058823529411764705
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:46,  2.15it/s][A
  2%|▏         | 8/360 [00:00<00:19, 17.75it/s][A
  5%|▍         | 17/360 [00:00<00:09, 35.02it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.90it/s][A
  9%|▉         | 33/360 [00:00<00:06, 53.85it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 62.61it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 68.11it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 71.01it/s][A
 19%|█▉        | 68/360 [00:01<00:04, 72.77it/s][A
 21%|██        | 76/360 [00:01<00:04, 70.82it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 71.97it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 69.16it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 67.75it/s][A
 30%|███       | 108/360 [00:01<00:03, 70.28it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 73.46it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 73.47it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 74.71it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 75.31it/s][A
 41%|████▏     | 149/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5394736842105263
class1 recall: 0.10679611650485436
class2 recall: 0.856353591160221
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.618421052631579
class1 recall: 0.07766990291262135
class2 recall: 0.7513812154696132



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:59,  2.00it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.41it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 29.77it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 41.48it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 50.38it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 54.29it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 60.22it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 61.59it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 67.66it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 69.09it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 74.02it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 75.11it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.0
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.45it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.80it/s][A
 21%|██        | 25/120 [00:00<00:02, 43.04it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 53.70it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 55.93it/s][A
 40%|████      | 48/120 [00:01<00:01, 59.03it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 61.94it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 63.29it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 68.43it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 71.57it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 67.47it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 65.04it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 66.95it/s][A
 92%|█████████▏| 110/120 [00:02<00:00, 70.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.48it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.17it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.85it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.81it/s][A
  9%|▉         | 32/360 [00:00<00:06, 54.01it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 62.94it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 68.53it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 71.01it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 72.60it/s][A
 21%|██        | 74/360 [00:01<00:03, 71.57it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 72.08it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 70.24it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 67.13it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 68.78it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 68.50it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 73.14it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 75.95it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 77.10it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.07766990291262135
class2 recall: 0.7237569060773481
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.0970873786407767
class2 recall: 0.7458563535911602


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.98it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.37it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.22it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.05it/s][A
 32%|███▏      | 38/119 [00:00<00:01, 59.29it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 63.13it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 67.36it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 69.54it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 69.90it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 70.92it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 71.16it/s][A
 79%|███████▉  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.058823529411764705
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.65it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.92it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 38.78it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 49.46it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 58.98it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 62.83it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 63.98it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 64.80it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 68.01it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 67.83it/s][A
 71%|███████   | 85/120 [00:01<00:00, 67.72it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 68.25it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 69.26it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 72.09it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.69it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.8
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:17,  1.82it/s][A
  2%|▏         | 6/360 [00:00<00:30, 11.66it/s][A
  4%|▎         | 13/360 [00:00<00:14, 24.31it/s][A
  6%|▌         | 22/360 [00:00<00:08, 39.69it/s][A
  8%|▊         | 29/360 [00:00<00:06, 47.31it/s][A
 10%|█         | 36/360 [00:01<00:06, 53.21it/s][A
 12%|█▏        | 43/360 [00:01<00:05, 57.34it/s][A
 14%|█▍        | 50/360 [00:01<00:05, 60.85it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 65.86it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 71.05it/s][A
 21%|██        | 75/360 [00:01<00:04, 68.15it/s][A
 23%|██▎       | 83/360 [00:01<00:04, 68.28it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 69.74it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 68.43it/s][A
 30%|██▉       | 107/360 [00:02<00:03, 70.48it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 74.45it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 77.39it/s][A
 37%|███▋      | 133/360 [00:02<00:02, 77.74it/s][A
 39%|███▉      | 141/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5921052631578947
class1 recall: 0.10679611650485436
class2 recall: 0.7734806629834254
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.0970873786407767
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.89 47.78 48.01
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.04it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.12it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.00it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 35.74it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 46.48it/s][A
 30%|███       | 36/119 [00:01<00:01, 54.85it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 57.29it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 63.48it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 64.29it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 66.58it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 68.84it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 73.17it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.029411764705882353
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.029411764705882353
class2 recall: 0.85




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.59it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.38it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 39.84it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 49.76it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.48it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 59.57it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 63.20it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 66.39it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 66.19it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 66.71it/s][A
 71%|███████   | 85/120 [00:01<00:00, 67.81it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 68.18it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 67.83it/s][A
 90%|█████████ | 108/120 [00:02<00:00, 68.94it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.51it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.058823529411764705
class2 recall: 0.6166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.85




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.07it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.87it/s][A
  4%|▎         | 13/360 [00:00<00:13, 26.48it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.93it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.65it/s][A
 11%|█         | 39/360 [00:01<00:05, 59.99it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 61.50it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 65.93it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 66.99it/s][A
 20%|██        | 72/360 [00:01<00:04, 70.98it/s][A
 22%|██▏       | 80/360 [00:01<00:04, 69.40it/s][A
 24%|██▍       | 88/360 [00:01<00:04, 67.29it/s][A
 26%|██▋       | 95/360 [00:01<00:04, 64.68it/s][A
 28%|██▊       | 102/360 [00:01<00:04, 64.45it/s][A
 30%|███       | 109/360 [00:02<00:03, 65.34it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 68.24it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 73.74it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 79.12it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.1262135922330097
class2 recall: 0.6408839779005525
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.10679611650485436
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.98 48.24 49.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.13it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.10it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.04it/s][A
 16%|█▌        | 19/119 [00:00<00:02, 35.50it/s][A
 23%|██▎       | 27/119 [00:00<00:01, 46.22it/s][A
 29%|██▊       | 34/119 [00:00<00:01, 52.50it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 62.30it/s][A
 44%|████▎     | 52/119 [00:01<00:00, 68.12it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 72.47it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 74.40it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 78.04it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 79.28it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.029411764705882353
class2 recall: 0.7





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.029411764705882353
class2 recall: 0.85



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.93it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.20it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 40.80it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.38it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.67it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.24it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 66.57it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 68.59it/s][A
 60%|██████    | 72/120 [00:01<00:00, 71.52it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 73.51it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 72.62it/s][A
 80%|████████  | 96/120 [00:01<00:00, 72.69it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 73.17it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.57it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.6666666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.8



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.83it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.45it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.11it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.43it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.05it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 60.27it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 62.42it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 66.89it/s][A
 19%|█▉        | 69/360 [00:01<00:04, 68.43it/s][A
 21%|██▏       | 77/360 [00:01<00:04, 70.60it/s][A
 24%|██▎       | 85/360 [00:01<00:04, 66.39it/s][A
 26%|██▌       | 92/360 [00:01<00:04, 65.15it/s][A
 28%|██▊       | 99/360 [00:01<00:04, 61.43it/s][A
 30%|██▉       | 107/360 [00:02<00:03, 65.54it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 65.27it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 66.74it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 68.68it/s][A
 38%|███▊      | 138/360 [00:02<00:03

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.1650485436893204
class2 recall: 0.6795580110497238
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.618421052631579
class1 recall: 0.0970873786407767
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.98 48.24 49.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.19it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.46it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 30.19it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.98it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.62it/s][A
 32%|███▏      | 38/119 [00:00<00:01, 57.98it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.62it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.09it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 66.72it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 68.90it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 72.33it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 75.17it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.029411764705882353
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.34it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 35.44it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 46.12it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.23it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 60.10it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 65.80it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 69.24it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 71.64it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 73.52it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.54it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 74.47it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 72.62it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 72.56it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.72it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.6333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.10it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.40it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.52it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.20it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.18it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 61.69it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 67.43it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 73.23it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 77.54it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 75.10it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 71.03it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 70.05it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 67.08it/s][A
 30%|███       | 108/360 [00:01<00:03, 67.29it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 69.42it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 69.40it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 74.29it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 75.46it/s][A
 41%|████▏     | 149/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.22330097087378642
class2 recall: 0.6685082872928176
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.631578947368421
class1 recall: 0.11650485436893204
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 47.98 48.24 49.33
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<01:02,  1.89it/s][A
  5%|▌         | 6/119 [00:00<00:09, 11.84it/s][A
 11%|█         | 13/119 [00:00<00:04, 24.60it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 37.66it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.11it/s][A
 30%|███       | 36/119 [00:01<00:01, 52.09it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 55.64it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 59.47it/s][A
 48%|████▊     | 57/119 [00:01<00:01, 59.57it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 66.57it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 68.92it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 73.08it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.2647058823529412
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.058823529411764705
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.04it/s][A
  5%|▌         | 6/120 [00:00<00:08, 12.85it/s][A
 11%|█         | 13/120 [00:00<00:04, 26.23it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 38.58it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 48.62it/s][A
 31%|███       | 37/120 [00:01<00:01, 55.76it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 57.22it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 62.95it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 63.83it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 64.84it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 66.67it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 68.92it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 71.23it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 71.11it/s][A
 88%|████████▊ | 106/120 [00:02<00:00, 72.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.75it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.56it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.52it/s][A
  6%|▋         | 23/360 [00:00<00:08, 42.01it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.53it/s][A
 11%|█         | 38/360 [00:01<00:05, 54.61it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 58.23it/s][A
 14%|█▍        | 52/360 [00:01<00:05, 60.40it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 67.21it/s][A
 19%|█▉        | 69/360 [00:01<00:04, 68.19it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 71.18it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 72.44it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 70.47it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 66.83it/s][A
 30%|███       | 108/360 [00:02<00:03, 65.94it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 67.91it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 69.33it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 69.48it/s][A
 39%|███▉      | 140/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5131578947368421
class1 recall: 0.4368932038834951
class2 recall: 0.7071823204419889
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.11650485436893204
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 49.18 45.47 50.60
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.30it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.93it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 43.26it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 54.63it/s][A
 34%|███▍      | 41/119 [00:00<00:01, 63.23it/s][A
 42%|████▏     | 50/119 [00:01<00:00, 69.09it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 70.20it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 74.31it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 75.62it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 73.23it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 74.10it/s][A
 84%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.32
class1 recall: 0.058823529411764705
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.08823529411764706
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.04it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.36it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 35.80it/s][A
 22%|██▎       | 27/120 [00:00<00:01, 48.63it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 55.51it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 62.91it/s][A
 44%|████▍     | 53/120 [00:01<00:00, 68.72it/s][A
 51%|█████     | 61/120 [00:01<00:00, 69.05it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 70.33it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 70.67it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 73.76it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 74.83it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 73.68it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 75.19it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.29it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34615384615384615
class1 recall: 0.08823529411764706
class2 recall: 0.95
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.08823529411764706
class2 recall: 0.75




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.07it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.94it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.35it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.02it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.42it/s][A
 11%|█         | 39/360 [00:01<00:05, 56.88it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 61.78it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 67.86it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 74.78it/s][A
 21%|██        | 74/360 [00:01<00:03, 75.39it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 74.27it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 73.95it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 69.66it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 72.49it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 72.66it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 72.63it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 71.95it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 77.60it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.39473684210526316
class1 recall: 0.18446601941747573
class2 recall: 0.850828729281768
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6578947368421053
class1 recall: 0.13592233009708737
class2 recall: 0.7513812154696132


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 50.16 45.89 51.51
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.16it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.42it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.44it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.98it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.31it/s][A
 30%|███       | 36/119 [00:00<00:01, 54.67it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 60.80it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 65.50it/s][A
 50%|█████     | 60/119 [00:01<00:00, 65.60it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 67.61it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 69.88it/s][A
 71%|███████   | 84/119 [00:01<00:00, 70.48it/s][A
 77%|███████▋  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.32
class1 recall: 0.0
class2 recall: 0.9666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.11764705882352941
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.13it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.96it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.35it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 39.42it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 52.90it/s][A
 32%|███▎      | 39/120 [00:00<00:01, 59.70it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 64.85it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 67.98it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 68.03it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 68.56it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 68.37it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 69.28it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 71.75it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 72.86it/s][A
 92%|█████████▎| 111/120 [00:01<00:00, 73.83it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.70it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.2692307692307692
class1 recall: 0.029411764705882353
class2 recall: 0.9833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.08823529411764706
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.18it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.81it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.77it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.74it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.48it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 64.08it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 69.63it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 71.93it/s][A
 21%|██        | 74/360 [00:01<00:04, 71.01it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 74.68it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 74.47it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 71.51it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 72.83it/s][A
 32%|███▏      | 115/360 [00:01<00:03, 74.24it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 77.76it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 73.81it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 74.97it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.2631578947368421
class1 recall: 0.05825242718446602
class2 recall: 0.9281767955801105
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.14563106796116504
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 46.78 47.82 48.64
INFO:__main__:At EMA Best val, validation/test/train 50.59 45.33 52.64
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.97it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.35it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.32it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.70it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 52.18it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.77it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 62.44it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 66.44it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 70.20it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 72.76it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 70.28it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.72
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.11764705882352941
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:54,  2.17it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.70it/s][A
 11%|█         | 13/120 [00:00<00:04, 26.44it/s][A
 17%|█▋        | 20/120 [00:00<00:02, 36.79it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 49.57it/s][A
 31%|███       | 37/120 [00:01<00:01, 56.04it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 59.06it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 64.81it/s][A
 51%|█████     | 61/120 [00:01<00:00, 69.92it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 71.13it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 70.50it/s][A
 71%|███████   | 85/120 [00:01<00:00, 69.01it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 67.05it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 68.48it/s][A
 92%|█████████▏| 110/120 [00:02<00:00, 72.42it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.46it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.029411764705882353
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:44,  2.18it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.53it/s][A
  4%|▍         | 16/360 [00:00<00:10, 33.36it/s][A
  7%|▋         | 25/360 [00:00<00:07, 47.36it/s][A
  9%|▉         | 33/360 [00:00<00:05, 56.07it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 64.06it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 69.25it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 73.60it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 76.22it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 74.67it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 73.07it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 69.69it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 67.47it/s][A
 30%|███       | 109/360 [00:01<00:03, 67.93it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 68.20it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 72.44it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 72.65it/s][A
 39%|███▉      | 142/360 [00:02<00:02, 75.79it/s][A
 42%|████▏     | 150/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.1553398058252427
class2 recall: 0.7182320441988951
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.13592233009708737
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.22it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.36it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.63it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.18it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.92it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 61.12it/s][A
 40%|████      | 48/119 [00:01<00:01, 68.35it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 68.06it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 70.43it/s][A
 61%|██████    | 72/119 [00:01<00:00, 72.27it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 72.79it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 72.23it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.17647058823529413





class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.08823529411764706
class2 recall: 0.8



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:56,  2.11it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.63it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.33it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.76it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.92it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 54.16it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 57.87it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 62.88it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 68.92it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 69.89it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 71.79it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 71.22it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 70.65it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 71.74it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.50it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.17647058823529413
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.029411764705882353
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:43,  2.19it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.71it/s][A
  4%|▍         | 15/360 [00:00<00:10, 31.60it/s][A
  7%|▋         | 24/360 [00:00<00:07, 46.47it/s][A
  9%|▉         | 32/360 [00:00<00:06, 54.51it/s][A
 11%|█         | 39/360 [00:00<00:05, 58.63it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 65.52it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 71.13it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 76.33it/s][A
 21%|██        | 75/360 [00:01<00:04, 70.55it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 70.63it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 70.11it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 66.40it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 67.39it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 72.23it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 73.50it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 74.03it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 75.20it/s][A
 42%|████▏     | 150/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5921052631578947
class1 recall: 0.2621359223300971
class2 recall: 0.7458563535911602
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.1553398058252427
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.60it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.18it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.01it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.44it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 53.21it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 58.61it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 64.21it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 66.41it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 67.43it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 72.35it/s][A
 71%|███████   | 84/119 [00:01<00:00, 74.21it/s][A
 77%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.24
class1 recall: 0.058823529411764705
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.08823529411764706
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.14it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.45it/s][A
 12%|█▏        | 14/120 [00:00<00:03, 28.58it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 37.99it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 48.07it/s][A
 31%|███       | 37/120 [00:01<00:01, 55.20it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 58.79it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 61.77it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 65.66it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 67.71it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 65.10it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 68.78it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 68.64it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 67.77it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 72.40it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.08it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.08823529411764706
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.10it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.02it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.99it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.57it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.19it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.31it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 59.25it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 61.79it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 65.19it/s][A
 19%|█▉        | 69/360 [00:01<00:04, 67.14it/s][A
 21%|██        | 76/360 [00:01<00:04, 66.51it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 69.62it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 69.82it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 67.80it/s][A
 30%|██▉       | 107/360 [00:02<00:03, 68.00it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 66.02it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 67.21it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 66.66it/s][A
 38%|███▊      | 137/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.42105263157894735
class1 recall: 0.17475728155339806
class2 recall: 0.8397790055248618
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.1553398058252427
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:53,  2.19it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.55it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.39it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.27it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.78it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 51.73it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 57.70it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 61.26it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 64.60it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 67.20it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 72.31it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 75.58it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.0
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.08823529411764706
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:55,  2.14it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.89it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.96it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.35it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 48.41it/s][A
 32%|███▏      | 38/120 [00:01<00:01, 55.01it/s][A
 38%|███▊      | 45/120 [00:01<00:01, 59.05it/s][A
 44%|████▍     | 53/120 [00:01<00:01, 63.46it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 69.67it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 72.98it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 73.72it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 73.05it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 73.71it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 75.10it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.44it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.029411764705882353
class2 recall: 0.95
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.058823529411764705
class2 recall: 0.75



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:48,  2.13it/s][A
  2%|▏         | 8/360 [00:00<00:19, 17.92it/s][A
  5%|▍         | 17/360 [00:00<00:09, 35.45it/s][A
  7%|▋         | 26/360 [00:00<00:06, 48.66it/s][A
 10%|▉         | 35/360 [00:00<00:05, 58.10it/s][A
 12%|█▏        | 44/360 [00:01<00:04, 65.53it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 70.59it/s][A
 17%|█▋        | 62/360 [00:01<00:03, 74.62it/s][A
 20%|█▉        | 71/360 [00:01<00:03, 77.29it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 75.28it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 72.28it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 74.42it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 73.82it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 74.78it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 75.85it/s][A
 36%|███▌      | 130/360 [00:02<00:02, 77.89it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 81.08it/s][A
 41%|████      | 148/360 [00:02<00:02, 81.11it/s][A
 44%|████▎     | 157/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5789473684210527
class1 recall: 0.0970873786407767
class2 recall: 0.8342541436464088
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.1650485436893204
class2 recall: 0.7569060773480663


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  5%|▌         | 6/119 [00:00<00:09, 12.45it/s][A
 11%|█         | 13/119 [00:00<00:04, 25.47it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.13it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.74it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.28it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.20it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.49it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 68.55it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 70.48it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 71.32it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 74.12it/s][A
 80%|███████▉  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.72
class1 recall: 0.0
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.11764705882352941
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.90it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.05it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 39.67it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 48.99it/s][A
 32%|███▏      | 38/120 [00:01<00:01, 55.51it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 61.04it/s][A
 44%|████▍     | 53/120 [00:01<00:01, 61.25it/s][A
 50%|█████     | 60/120 [00:01<00:00, 62.49it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 63.63it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 67.15it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 70.77it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 70.39it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 69.34it/s][A
 90%|█████████ | 108/120 [00:02<00:00, 73.90it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.10it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.0
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:47,  2.14it/s][A
  1%|▏         | 5/360 [00:00<00:33, 10.62it/s][A
  4%|▎         | 13/360 [00:00<00:12, 27.37it/s][A
  6%|▌         | 22/360 [00:00<00:07, 43.00it/s][A
  8%|▊         | 29/360 [00:00<00:06, 49.15it/s][A
 11%|█         | 38/360 [00:01<00:05, 59.11it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 65.96it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 68.39it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 68.93it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 69.73it/s][A
 22%|██▏       | 79/360 [00:01<00:04, 69.90it/s][A
 24%|██▍       | 87/360 [00:01<00:04, 67.86it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 70.37it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 68.62it/s][A
 31%|███       | 111/360 [00:02<00:03, 69.94it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 72.22it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 71.84it/s][A
 38%|███▊      | 135/360 [00:02<00:03, 73.03it/s][A
 40%|███▉      | 143/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.10679611650485436
class2 recall: 0.7071823204419889
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.17475728155339806
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.13it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.02it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.17it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.92it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.13it/s][A
 29%|██▉       | 35/119 [00:00<00:01, 53.23it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 59.64it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 64.82it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 65.25it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 69.31it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 68.92it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 70.51it/s][A
 76%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.0
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.11764705882352941
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.16it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.22it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 40.60it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 47.67it/s][A
 31%|███       | 37/120 [00:01<00:01, 52.96it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 56.96it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 61.75it/s][A
 50%|█████     | 60/120 [00:01<00:00, 65.20it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 68.90it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 71.80it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.29it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 70.78it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 65.38it/s][A
 90%|█████████ | 108/120 [00:02<00:00, 67.79it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.35it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.0
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.18it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.51it/s][A
  6%|▌         | 22/360 [00:00<00:08, 39.44it/s][A
  8%|▊         | 29/360 [00:00<00:07, 47.13it/s][A
 11%|█         | 38/360 [00:01<00:05, 57.51it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 64.98it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 66.15it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 69.88it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 69.40it/s][A
 22%|██▏       | 79/360 [00:01<00:04, 68.79it/s][A
 24%|██▍       | 87/360 [00:01<00:04, 67.22it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 67.45it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 67.59it/s][A
 30%|███       | 108/360 [00:02<00:03, 68.03it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 67.38it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 68.40it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 68.79it/s][A
 39%|███▊      | 139/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.04854368932038835
class2 recall: 0.7900552486187845
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.17475728155339806
class2 recall: 0.7513812154696132



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 51.92 44.35 52.31
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.34it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.30it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.25it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.79it/s][A
 31%|███       | 37/119 [00:01<00:01, 55.24it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 61.41it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 64.88it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 67.86it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 67.38it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 69.97it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 68.76it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.11764705882352941
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.72it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.33it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 45.45it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 52.75it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 58.99it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 63.17it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 63.28it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 66.48it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 66.27it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 68.59it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 68.21it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 69.55it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 74.95it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.22it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.08823529411764706
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.10it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.65it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.42it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.95it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.83it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.14it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 66.70it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 68.54it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 68.95it/s][A
 22%|██▏       | 79/360 [00:01<00:04, 67.63it/s][A
 24%|██▍       | 86/360 [00:01<00:04, 67.37it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 69.57it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 70.13it/s][A
 31%|███       | 110/360 [00:02<00:03, 70.84it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 72.99it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 73.52it/s][A
 37%|███▋      | 134/360 [00:02<00:03, 74.70it/s][A
 39%|███▉      | 142/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.13592233009708737
class2 recall: 0.8176795580110497
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.17475728155339806
class2 recall: 0.7458563535911602



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.17it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.34it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.09it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 38.26it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 49.30it/s][A
 30%|███       | 36/119 [00:00<00:01, 57.03it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 62.48it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 65.81it/s][A
 50%|█████     | 60/119 [00:01<00:00, 69.54it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 72.38it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 72.34it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 75.70it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.058823529411764705
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.029411764705882353
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.92it/s][A
  8%|▊         | 9/120 [00:00<00:06, 17.99it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 31.92it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 45.45it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 49.24it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 56.25it/s][A
 41%|████      | 49/120 [00:01<00:01, 61.40it/s][A
 48%|████▊     | 57/120 [00:01<00:01, 62.48it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 65.81it/s][A
 61%|██████    | 73/120 [00:01<00:00, 63.42it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 64.31it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 61.56it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 63.59it/s][A
 86%|████████▌ | 103/120 [00:02<00:00, 66.50it/s][A
 92%|█████████▎| 111/120 [00:02<00:00, 69.57it/s][A
100%|██████████| 120/120 [00:02<00:00, 51.34it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.058823529411764705
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.85




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.33it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.48it/s][A
  7%|▋         | 24/360 [00:00<00:07, 45.01it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.72it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 62.48it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 65.45it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 70.42it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 70.38it/s][A
 21%|██        | 74/360 [00:01<00:04, 71.08it/s][A
 23%|██▎       | 82/360 [00:01<00:04, 68.64it/s][A
 25%|██▌       | 90/360 [00:01<00:04, 66.55it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 66.26it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 67.74it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 69.82it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 73.04it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 74.86it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 77.85it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5657894736842105
class1 recall: 0.1650485436893204
class2 recall: 0.856353591160221
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.17475728155339806
class2 recall: 0.7458563535911602



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.17it/s][A
  4%|▍         | 5/119 [00:00<00:10, 11.11it/s][A
 10%|█         | 12/119 [00:00<00:04, 25.74it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 38.78it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 51.46it/s][A
 32%|███▏      | 38/119 [00:00<00:01, 60.51it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 65.29it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 66.93it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 69.58it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 68.96it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 69.89it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 69.54it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.029411764705882353
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.029411764705882353
class2 recall: 0.7666666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  8%|▊         | 10/120 [00:00<00:05, 21.35it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 34.79it/s][A
 22%|██▎       | 27/120 [00:00<00:01, 47.74it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 55.64it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 61.62it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 64.70it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 67.57it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 70.68it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 72.42it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 74.28it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 75.32it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 76.02it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 78.48it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.45it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.029411764705882353
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.08823529411764706
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.21it/s][A
  5%|▍         | 17/360 [00:00<00:10, 33.86it/s][A
  7%|▋         | 25/360 [00:00<00:07, 44.83it/s][A
  9%|▉         | 32/360 [00:00<00:06, 50.98it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.55it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 65.26it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 70.72it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 75.55it/s][A
 21%|██        | 76/360 [00:01<00:03, 75.78it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 74.32it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 74.07it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 71.22it/s][A
 30%|███       | 108/360 [00:01<00:03, 71.21it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 73.57it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 75.15it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 72.56it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 76.62it/s][A
 41%|████▏     | 149/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5789473684210527
class1 recall: 0.13592233009708737
class2 recall: 0.8453038674033149
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.1941747572815534
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.39it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.65it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 42.41it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 50.99it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 58.31it/s][A
 40%|████      | 48/119 [00:01<00:01, 65.48it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 65.86it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 67.86it/s][A
 61%|██████    | 72/119 [00:01<00:00, 67.02it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 67.69it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 67.05it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.058823529411764705
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.08it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.49it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.93it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 47.73it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.24it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.52it/s][A
 41%|████      | 49/120 [00:01<00:01, 63.84it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 66.29it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 69.34it/s][A
 61%|██████    | 73/120 [00:01<00:00, 70.61it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 70.89it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 71.73it/s][A
 81%|████████  | 97/120 [00:01<00:00, 71.85it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 75.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.50it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.11764705882352941
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.8666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:07,  1.91it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.21it/s][A
  4%|▍         | 15/360 [00:00<00:11, 28.99it/s][A
  6%|▋         | 23/360 [00:00<00:08, 40.25it/s][A
  8%|▊         | 30/360 [00:00<00:06, 47.47it/s][A
 10%|█         | 37/360 [00:01<00:06, 53.32it/s][A
 12%|█▏        | 44/360 [00:01<00:05, 55.99it/s][A
 14%|█▍        | 51/360 [00:01<00:05, 59.35it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 66.32it/s][A
 19%|█▉        | 69/360 [00:01<00:04, 71.18it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 71.13it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 71.44it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 69.96it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 65.54it/s][A
 30%|███       | 108/360 [00:02<00:03, 64.75it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 63.70it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 66.60it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 70.08it/s][A
 39%|███▊      | 139/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5789473684210527
class1 recall: 0.18446601941747573
class2 recall: 0.8232044198895028
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.22330097087378642
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.13it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.30it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.14it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.49it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.34it/s][A
 31%|███       | 37/119 [00:01<00:01, 56.78it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 64.10it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.90it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 69.17it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 70.62it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 68.53it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 68.88it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.72
class1 recall: 0.20588235294117646
class2 recall: 0.5666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.03it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.09it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.39it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 39.50it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 51.80it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 57.55it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 63.12it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 66.07it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 69.05it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 67.46it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 70.29it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 68.26it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 71.03it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 72.65it/s][A
 92%|█████████▎| 111/120 [00:02<00:00, 74.62it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.84it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.5333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.84it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.92it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.16it/s][A
  9%|▊         | 31/360 [00:00<00:06, 49.26it/s][A
 11%|█         | 38/360 [00:01<00:05, 53.76it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 56.26it/s][A
 14%|█▍        | 52/360 [00:01<00:05, 57.24it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 62.57it/s][A
 19%|█▉        | 68/360 [00:01<00:04, 64.65it/s][A
 21%|██        | 75/360 [00:01<00:04, 65.02it/s][A
 23%|██▎       | 83/360 [00:01<00:04, 68.56it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 67.85it/s][A
 27%|██▋       | 98/360 [00:01<00:04, 65.43it/s][A
 29%|██▉       | 106/360 [00:02<00:03, 67.35it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 67.02it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 70.31it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 68.69it/s][A
 39%|███▊      | 139/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.27184466019417475
class2 recall: 0.5580110497237569
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.22330097087378642
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.00it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.56it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.67it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 50.36it/s][A
 31%|███       | 37/119 [00:01<00:01, 57.73it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 62.42it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 67.03it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 72.29it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 74.03it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 76.51it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 76.63it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.0
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:56,  2.10it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.43it/s][A
 12%|█▏        | 14/120 [00:00<00:03, 28.82it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 42.10it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 53.67it/s][A
 32%|███▏      | 38/120 [00:00<00:01, 56.59it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 60.11it/s][A
 44%|████▍     | 53/120 [00:01<00:01, 61.52it/s][A
 50%|█████     | 60/120 [00:01<00:00, 61.94it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 63.24it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 67.18it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 69.09it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 70.35it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 71.36it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 71.69it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.40it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.029411764705882353
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.11it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.52it/s][A
  5%|▍         | 17/360 [00:00<00:10, 33.82it/s][A
  7%|▋         | 25/360 [00:00<00:07, 43.94it/s][A
  9%|▉         | 32/360 [00:00<00:06, 49.06it/s][A
 11%|█         | 40/360 [00:01<00:05, 56.24it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 58.58it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 66.70it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.45it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.38it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 71.95it/s][A
 25%|██▍       | 89/360 [00:01<00:04, 67.41it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 68.00it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 66.04it/s][A
 31%|███       | 112/360 [00:02<00:03, 68.89it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 69.38it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 68.97it/s][A
 38%|███▊      | 136/360 [00:02<00:03, 70.42it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.10679611650485436
class2 recall: 0.8121546961325967
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.22330097087378642
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.87it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.02it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.84it/s][A
 23%|██▎       | 27/119 [00:00<00:02, 44.64it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 53.38it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 59.51it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 62.23it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 62.25it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 65.82it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 67.80it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 69.79it/s][A
 76%|███████▌  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.0
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705





class2 recall: 0.7833333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.64it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 28.53it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 38.42it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 50.36it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.38it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 60.03it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 64.99it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 66.74it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 69.61it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 72.34it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 73.66it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 72.54it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 73.81it/s][A
 92%|█████████▎| 111/120 [00:02<00:00, 76.37it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.88it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.029411764705882353
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.11764705882352941
class2 recall: 0.9




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.31it/s][A
  4%|▍         | 15/360 [00:00<00:11, 31.03it/s][A
  6%|▋         | 23/360 [00:00<00:07, 43.45it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.58it/s][A
 11%|█         | 38/360 [00:00<00:05, 58.62it/s][A
 13%|█▎        | 46/360 [00:01<00:04, 64.26it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 67.55it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 70.80it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 72.12it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 73.42it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 69.45it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 67.67it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 65.39it/s][A
 30%|███       | 108/360 [00:01<00:03, 66.15it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 69.47it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 73.28it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 74.23it/s][A
 39%|███▉      | 141/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.1262135922330097
class2 recall: 0.8342541436464088
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.23300970873786409
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.18it/s][A
  4%|▍         | 5/119 [00:00<00:10, 11.04it/s][A
  9%|▉         | 11/119 [00:00<00:04, 23.18it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 39.74it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 49.72it/s][A
 30%|███       | 36/119 [00:00<00:01, 57.27it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 63.18it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 63.04it/s][A
 50%|█████     | 60/119 [00:01<00:00, 67.66it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 68.61it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 70.00it/s][A
 71%|███████   | 84/119 [00:01<00:00, 69.80it/s][A
 77%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.0
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.8




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.78it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.88it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 42.27it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 49.06it/s][A
 32%|███▏      | 38/120 [00:01<00:01, 56.43it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 60.50it/s][A
 44%|████▍     | 53/120 [00:01<00:01, 60.97it/s][A
 51%|█████     | 61/120 [00:01<00:00, 64.26it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 68.03it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 70.00it/s][A
 71%|███████   | 85/120 [00:01<00:00, 69.53it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 66.90it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 69.01it/s][A
 91%|█████████ | 109/120 [00:02<00:00, 70.89it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.85it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.08823529411764706
class2 recall: 0.85
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.11764705882352941
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.11it/s][A
  2%|▏         | 7/360 [00:00<00:22, 15.38it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.73it/s][A
  6%|▌         | 22/360 [00:00<00:08, 42.04it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.95it/s][A
 11%|█         | 39/360 [00:01<00:05, 59.59it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 66.17it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 67.70it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 72.61it/s][A
 20%|██        | 73/360 [00:01<00:03, 71.86it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 75.35it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 69.74it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 68.48it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 70.92it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 70.99it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 71.95it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 71.30it/s][A
 39%|███▊      | 139/360 [00:02<00:03, 72.34it/s][A
 41%|████      | 147/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.1941747572815534
class2 recall: 0.7734806629834254
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.23300970873786409
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.86it/s][A
 11%|█         | 13/119 [00:00<00:04, 25.64it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.29it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.22it/s][A
 30%|███       | 36/119 [00:01<00:01, 53.14it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 56.27it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 59.62it/s][A
 49%|████▊     | 58/119 [00:01<00:01, 59.70it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 64.45it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 65.53it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 66.05it/s][A
 76%|███████▌  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.0
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705





class2 recall: 0.7666666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.58it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.09it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.05it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.68it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.44it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 61.70it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 63.76it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 63.76it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 65.19it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 66.15it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 65.15it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 65.71it/s][A
 81%|████████  | 97/120 [00:01<00:00, 64.54it/s][A
 88%|████████▊ | 106/120 [00:02<00:00, 69.62it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.35it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.0
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.11764705882352941
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:05,  1.94it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.37it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.46it/s][A
  7%|▋         | 24/360 [00:00<00:08, 41.96it/s][A
  9%|▊         | 31/360 [00:00<00:07, 46.48it/s][A
 11%|█         | 39/360 [00:01<00:05, 54.19it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 59.56it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 66.45it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 69.24it/s][A
 20%|██        | 73/360 [00:01<00:03, 72.29it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 73.96it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 72.96it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 73.08it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 72.23it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 73.79it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 77.04it/s][A
 36%|███▋      | 131/360 [00:02<00:02, 78.75it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 80.91it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.10679611650485436
class2 recall: 0.8066298342541437
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.24271844660194175
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.78it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.81it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.55it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 46.61it/s][A
 30%|███       | 36/119 [00:01<00:01, 55.28it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 63.58it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 66.22it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 71.46it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 75.19it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 77.77it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 78.86it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.0
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.029411764705882353
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.29it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 35.41it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 46.09it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.04it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 57.88it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 64.85it/s][A
 50%|█████     | 60/120 [00:01<00:00, 70.69it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 72.81it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 73.00it/s][A
 70%|███████   | 84/120 [00:01<00:00, 73.02it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 71.26it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 72.87it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 74.85it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.53it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.029411764705882353
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.11764705882352941
class2 recall: 0.8833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.05it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.17it/s][A
  5%|▍         | 17/360 [00:00<00:10, 34.23it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.25it/s][A
  9%|▉         | 33/360 [00:00<00:05, 54.77it/s][A
 12%|█▏        | 42/360 [00:01<00:04, 63.77it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 66.13it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 70.83it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 74.45it/s][A
 21%|██        | 76/360 [00:01<00:03, 74.63it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 76.52it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 74.47it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 70.06it/s][A
 30%|███       | 109/360 [00:01<00:03, 71.93it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 72.86it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 75.23it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 78.61it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 78.33it/s][A
 42%|████▏     | 151/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.10679611650485436
class2 recall: 0.7845303867403315
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.2621359223300971
class2 recall: 0.7679558011049724


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.12it/s][A
 10%|█         | 12/119 [00:00<00:04, 24.67it/s][A
 16%|█▌        | 19/119 [00:00<00:02, 35.46it/s][A
 22%|██▏       | 26/119 [00:00<00:02, 44.57it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 51.20it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 55.18it/s][A
 41%|████      | 49/119 [00:01<00:01, 62.88it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 66.00it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 69.81it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 68.14it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 69.65it/s][A
 75%|███████▍  | 8

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.08823529411764706
class2 recall: 0.6333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.54it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.23it/s][A
 20%|██        | 24/120 [00:00<00:02, 41.54it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.27it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.38it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.86it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 67.87it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 71.29it/s][A
 60%|██████    | 72/120 [00:01<00:00, 68.21it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 72.08it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 71.67it/s][A
 81%|████████  | 97/120 [00:01<00:00, 71.11it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 74.68it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.10it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.058823529411764705
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.6153846153846154
class1 recall: 0.17647058823529413
class2 recall: 0.8666666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.75it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.28it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.68it/s][A
  9%|▉         | 33/360 [00:00<00:06, 54.23it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 59.50it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 64.44it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 69.81it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 70.67it/s][A
 21%|██        | 74/360 [00:01<00:04, 69.05it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 72.51it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 70.42it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 66.54it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 66.47it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 65.47it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 67.69it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 70.13it/s][A
 38%|███▊      | 137/360 [00:02<00:03, 70.33it/s][A
 40%|████      | 145/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.23300970873786409
class2 recall: 0.6795580110497238
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.2815533980582524
class2 recall: 0.7679558011049724


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.80it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.37it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.62it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.92it/s][A
 31%|███       | 37/119 [00:01<00:01, 57.73it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 62.95it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 66.40it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 71.55it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 69.48it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 71.13it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 74.25it/s

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.029411764705882353
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.11764705882352941
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  8%|▊         | 9/120 [00:00<00:06, 18.48it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 29.99it/s][A
 20%|██        | 24/120 [00:00<00:02, 41.68it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.04it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 57.76it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.44it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 66.95it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 69.86it/s][A
 60%|██████    | 72/120 [00:01<00:00, 71.66it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 73.04it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 72.18it/s][A
 80%|████████  | 96/120 [00:01<00:00, 70.33it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 69.47it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.93it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.029411764705882353
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 8/360 [00:00<00:20, 16.81it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.48it/s][A
  7%|▋         | 25/360 [00:00<00:07, 44.82it/s][A
  9%|▉         | 33/360 [00:00<00:06, 52.80it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 61.15it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 63.63it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 67.86it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 69.47it/s][A
 21%|██        | 74/360 [00:01<00:04, 69.12it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 72.07it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 72.58it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 70.01it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 71.66it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 74.87it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 77.30it/s][A
 37%|███▋      | 133/360 [00:02<00:02, 76.33it/s][A
 39%|███▉      | 142/360 [00:02<00:02, 77.87it/s][A
 42%|████▏     | 150/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.14563106796116504
class2 recall: 0.8121546961325967





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.3106796116504854
class2 recall: 0.7734806629834254


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.57it/s][A
 10%|█         | 12/119 [00:00<00:04, 23.81it/s][A
 16%|█▌        | 19/119 [00:00<00:02, 35.14it/s][A
 22%|██▏       | 26/119 [00:00<00:02, 43.91it/s][A
 28%|██▊       | 33/119 [00:01<00:01, 50.76it/s][A
 34%|███▍      | 41/119 [00:01<00:01, 57.27it/s][A
 40%|████      | 48/119 [00:01<00:01, 59.10it/s][A
 46%|████▌     | 55/119 [00:01<00:01, 60.80it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 67.23it/s][A
 61%|██████    | 72/119 [00:01<00:00, 68.00it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 69.35it/s][A
 74%|███████▍  | 8

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.08823529411764706
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.08823529411764706
class2 recall: 0.7666666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.90it/s][A
  8%|▊         | 9/120 [00:00<00:06, 18.36it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 34.17it/s][A
 22%|██▎       | 27/120 [00:00<00:01, 46.85it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 53.92it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 58.30it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 62.00it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 66.67it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 66.85it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 69.87it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.73it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 71.98it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 73.40it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 76.68it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.68it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.08823529411764706
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.12it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.99it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.66it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.52it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.21it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 62.74it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 65.86it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 69.74it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.67it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.96it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 72.98it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 68.49it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 70.35it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 70.80it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 72.24it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 72.10it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 74.13it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 75.87it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.631578947368421
class1 recall: 0.2815533980582524
class2 recall: 0.8674033149171271
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.3300970873786408
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 52.07 46.49 52.36
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<01:02,  1.90it/s][A
  5%|▌         | 6/119 [00:00<00:09, 11.91it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 27.24it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 39.73it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 48.59it/s][A
 31%|███       | 37/119 [00:01<00:01, 52.92it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 55.97it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 57.20it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 62.53it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 67.05it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 69.55it/s][A
 71%|███████   | 84/119 [00:01<00:00, 73.41it/s][A
 77%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.6764705882352942
class2 recall: 0.4166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.07it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.28it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.52it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 38.03it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 48.54it/s][A
 31%|███       | 37/120 [00:01<00:01, 55.55it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 58.82it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 61.33it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 66.59it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 66.18it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 65.72it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 68.93it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 66.96it/s][A
 81%|████████  | 97/120 [00:01<00:00, 64.85it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 65.99it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.26it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.6176470588235294
class2 recall: 0.43333333333333335
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:11,  1.88it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.17it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.60it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.02it/s][A
  9%|▉         | 33/360 [00:00<00:06, 52.71it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 59.05it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 64.21it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 70.22it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 72.37it/s][A
 21%|██        | 74/360 [00:01<00:04, 70.41it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 70.01it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 68.29it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 66.43it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 69.21it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 71.25it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 72.72it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 71.22it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 74.37it/s][A
 41%|████      | 148/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.6601941747572816
class2 recall: 0.4861878453038674
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.32038834951456313
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.07it/s][A
  4%|▍         | 5/119 [00:00<00:10, 10.72it/s][A
 10%|█         | 12/119 [00:00<00:04, 25.11it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 38.81it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 50.82it/s][A
 31%|███       | 37/119 [00:01<00:01, 57.18it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 59.01it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 61.95it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 63.96it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 65.02it/s][A
 61%|██████    | 72/119 [00:01<00:00, 64.21it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 65.72it/s][A
 73%|███████▎  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.20588235294117646
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.11764705882352941
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.02it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.94it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.82it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.13it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 53.43it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 60.29it/s][A
 40%|████      | 48/120 [00:01<00:01, 65.11it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 69.05it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 72.46it/s][A
 61%|██████    | 73/120 [00:01<00:00, 73.68it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 75.45it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 75.86it/s][A
 81%|████████  | 97/120 [00:01<00:00, 75.24it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 75.56it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.97it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.10it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.99it/s][A
  4%|▍         | 14/360 [00:00<00:12, 27.84it/s][A
  6%|▌         | 22/360 [00:00<00:08, 40.60it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.31it/s][A
 11%|█         | 39/360 [00:01<00:05, 59.59it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 66.88it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 70.00it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 71.70it/s][A
 20%|██        | 73/360 [00:01<00:03, 74.19it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 76.90it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 74.46it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 73.01it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 73.84it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 76.75it/s][A
 35%|███▍      | 125/360 [00:02<00:02, 79.43it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 82.02it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 80.74it/s][A
 42%|████▏     | 152/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.44660194174757284
class2 recall: 0.7624309392265194
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.33980582524271846
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.67it/s][A
 10%|█         | 12/119 [00:00<00:04, 23.73it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.69it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.53it/s][A
 31%|███       | 37/119 [00:01<00:01, 55.78it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 64.38it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 65.54it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 69.02it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 68.81it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 70.45it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 72.06it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4
class1 recall: 0.6470588235294118
class2 recall: 0.45
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.14705882352941177
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.02it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.85it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.11it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.94it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 56.03it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 59.75it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 64.13it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 65.73it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 71.02it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 71.95it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 72.28it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 72.86it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 72.62it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 76.14it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.33it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34615384615384615
class1 recall: 0.7941176470588235
class2 recall: 0.43333333333333335
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.35it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.55it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.27it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.92it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 61.86it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 67.72it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 69.77it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 76.18it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 77.13it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 77.42it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 77.75it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 75.64it/s][A
 31%|███       | 110/360 [00:01<00:03, 76.85it/s][A
 33%|███▎      | 118/360 [00:01<00:03, 75.28it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 75.48it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 81.82it/s][A
 40%|████      | 145/360 [00:02<00:02, 75.73it/s][A
 42%|████▎     | 153/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.631578947368421
class1 recall: 0.7281553398058253
class2 recall: 0.46408839779005523
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.33980582524271846
class2 recall: 0.7513812154696132



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.99it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.54it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.48it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.65it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 59.88it/s][A
 40%|████      | 48/119 [00:01<00:01, 66.73it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 68.69it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 71.54it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 74.65it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 77.60it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 75.88it/s][A
 83%|████████▎ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.76
class1 recall: 0.058823529411764705
class2 recall: 0.6666666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.14705882352941177
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.78it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.59it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.70it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 55.10it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 60.09it/s][A
 41%|████      | 49/120 [00:01<00:01, 62.79it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 65.87it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 70.37it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 67.68it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 70.45it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 70.10it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 67.88it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 71.76it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.16it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.08823529411764706
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.17647058823529413
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:47,  2.14it/s][A
  2%|▏         | 6/360 [00:00<00:26, 13.32it/s][A
  4%|▎         | 13/360 [00:00<00:12, 27.44it/s][A
  6%|▌         | 22/360 [00:00<00:07, 43.50it/s][A
  8%|▊         | 30/360 [00:00<00:06, 52.30it/s][A
 11%|█         | 38/360 [00:00<00:05, 59.68it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.74it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 68.71it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 73.99it/s][A
 20%|██        | 73/360 [00:01<00:03, 75.71it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 78.38it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 77.33it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 70.35it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 71.24it/s][A
 32%|███▏      | 115/360 [00:01<00:03, 73.56it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 73.39it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 73.62it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 78.85it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8026315789473685
class1 recall: 0.24271844660194175
class2 recall: 0.6629834254143646
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.33980582524271846
class2 recall: 0.7403314917127072



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.18it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.14it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.17it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.15it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.84it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 61.15it/s][A
 40%|████      | 48/119 [00:01<00:01, 67.31it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 64.33it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 70.06it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 73.64it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 72.51it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 71.12it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.029411764705882353
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.98it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.72it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.48it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 45.76it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.34it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 58.37it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 61.77it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 67.60it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 69.34it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 70.98it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 69.85it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 70.13it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 69.82it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 73.31it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.41it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.029411764705882353
class2 recall: 0.85
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.20588235294117646
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.07it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.97it/s][A
  4%|▎         | 13/360 [00:00<00:13, 26.43it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.65it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.80it/s][A
 11%|█         | 39/360 [00:01<00:05, 58.73it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.06it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.55it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 69.79it/s][A
 20%|██        | 72/360 [00:01<00:04, 70.36it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 73.90it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 71.39it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 68.07it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 66.81it/s][A
 31%|███       | 112/360 [00:02<00:03, 68.85it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 70.50it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 71.18it/s][A
 38%|███▊      | 136/360 [00:02<00:03, 73.42it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.1650485436893204
class2 recall: 0.7237569060773481
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.3592233009708738
class2 recall: 0.7348066298342542



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.14 50.68 53.93
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.62it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.70it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.52it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.83it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 61.61it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 66.57it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 67.30it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 69.74it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 69.61it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 71.55it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 73.71it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.11764705882352941
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.17647058823529413
class2 recall: 0.75



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.72it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.11it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 39.17it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 52.06it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 60.87it/s][A
 40%|████      | 48/120 [00:01<00:01, 64.85it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 65.29it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 67.43it/s][A
 60%|██████    | 72/120 [00:01<00:00, 68.36it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 68.73it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 68.77it/s][A
 81%|████████  | 97/120 [00:01<00:00, 70.71it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 71.21it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.58it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.11764705882352941
class2 recall: 0.85
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.23it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.31it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.47it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.22it/s][A
 11%|█         | 40/360 [00:01<00:05, 59.62it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 63.72it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 69.84it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 75.85it/s][A
 21%|██        | 76/360 [00:01<00:03, 73.34it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 71.51it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 68.42it/s][A
 28%|██▊       | 100/360 [00:01<00:04, 62.17it/s][A
 30%|██▉       | 107/360 [00:01<00:04, 61.99it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 65.14it/s][A
 34%|███▍      | 123/360 [00:02<00:03, 68.38it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 72.86it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 76.47it/s][A
 42%|████▏     | 150/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.30097087378640774
class2 recall: 0.7513812154696132
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3300970873786408
class2 recall: 0.7403314917127072



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 52.22 56.18 60.68
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  7%|▋         | 8/119 [00:00<00:06, 17.09it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 31.42it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 40.96it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 52.35it/s][A
 34%|███▍      | 41/119 [00:01<00:01, 60.87it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 66.70it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 68.19it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 69.63it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 71.99it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 72.25it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 72.69it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.058823529411764705
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.04it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.01it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.57it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.93it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 49.94it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.76it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 62.64it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 67.17it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 71.75it/s][A
 60%|██████    | 72/120 [00:01<00:00, 73.49it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 74.65it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 73.71it/s][A
 81%|████████  | 97/120 [00:01<00:00, 74.41it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 77.68it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.73it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7692307692307693
class1 recall: 0.11764705882352941
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.08it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.44it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.66it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.83it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.82it/s][A
 11%|█▏        | 41/360 [00:00<00:05, 62.27it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 67.13it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 70.12it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.98it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.59it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 70.49it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 69.52it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 69.12it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 70.23it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 71.49it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 73.21it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 73.59it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 74.96it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.20388349514563106
class2 recall: 0.6077348066298343
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.33980582524271846
class2 recall: 0.7513812154696132


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.15it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.35it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.69it/s][A
 20%|██        | 24/119 [00:00<00:02, 44.71it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 55.41it/s][A
 35%|███▌      | 42/119 [00:01<00:01, 63.40it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 65.73it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 69.21it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 73.01it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 72.94it/s][A
 71%|███████   | 84/119 [00:01<00:00, 75.58it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 73.99it/s][A
 84%|████████▍ | 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.35294117647058826
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.07it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.14it/s][A
 12%|█▏        | 14/120 [00:00<00:03, 28.28it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 43.72it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 55.03it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 60.52it/s][A
 41%|████      | 49/120 [00:01<00:01, 66.77it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 71.41it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 72.37it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 72.57it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 74.43it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 76.53it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 76.17it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 79.16it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.75it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.47058823529411764
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.70it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.36it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.04it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.71it/s][A
 11%|█         | 38/360 [00:01<00:05, 58.62it/s][A
 13%|█▎        | 46/360 [00:01<00:04, 63.48it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 67.42it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 70.03it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 71.62it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 73.22it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 73.01it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 72.50it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 71.77it/s][A
 31%|███       | 110/360 [00:01<00:03, 73.68it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 72.66it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 71.62it/s][A
 37%|███▋      | 134/360 [00:02<00:03, 71.25it/s][A
 39%|███▉      | 142/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.5631067961165048
class2 recall: 0.6629834254143646
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3106796116504854
class2 recall: 0.7458563535911602



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.00it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.46it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.53it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.50it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.69it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 63.05it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.70it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 69.67it/s][A
 61%|██████    | 72/119 [00:01<00:00, 73.46it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 77.42it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 78.74it/s][A
 83%|████████▎ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.17647058823529413
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.05it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.40it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 34.12it/s][A
 22%|██▏       | 26/120 [00:00<00:01, 47.55it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.17it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 59.22it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 63.17it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 68.38it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 68.67it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 67.09it/s][A
 70%|███████   | 84/120 [00:01<00:00, 71.27it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 74.39it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 75.42it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 75.45it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.69it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.20588235294117646
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.14705882352941177
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.14it/s][A
  4%|▎         | 13/360 [00:00<00:13, 25.65it/s][A
  6%|▌         | 21/360 [00:00<00:08, 38.06it/s][A
  8%|▊         | 29/360 [00:00<00:06, 47.78it/s][A
 10%|█         | 37/360 [00:01<00:05, 55.05it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 60.74it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 67.72it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 73.52it/s][A
 20%|██        | 72/360 [00:01<00:03, 76.49it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 77.28it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 73.51it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 73.03it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 72.45it/s][A
 31%|███       | 112/360 [00:01<00:03, 74.43it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 73.42it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 73.04it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 77.84it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.618421052631579
class1 recall: 0.33980582524271846
class2 recall: 0.8397790055248618
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.3592233009708738
class2 recall: 0.7458563535911602



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.30it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.17it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.64it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.67it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.29it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.47it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 65.87it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.54it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 72.21it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 76.98it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 73.46it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.14705882352941177





class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.75



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.09it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.86it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.08it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 54.15it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 60.56it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 66.88it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 71.76it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 73.69it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 76.43it/s][A
 71%|███████   | 85/120 [00:01<00:00, 73.85it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 75.22it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 72.95it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 74.23it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.04it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.11764705882352941
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.23it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.85it/s][A
  7%|▋         | 25/360 [00:00<00:07, 46.13it/s][A
  9%|▉         | 34/360 [00:00<00:05, 56.04it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 61.27it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 63.81it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 70.59it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 75.59it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 76.14it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 75.44it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 72.65it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 69.51it/s][A
 30%|███       | 109/360 [00:01<00:03, 68.11it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 69.93it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 74.48it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 79.73it/s][A
 40%|████      | 145/360 [00:02<00:02, 77.53it/s][A
 43%|████▎     | 154/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.631578947368421
class1 recall: 0.30097087378640774
class2 recall: 0.8397790055248618
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.3592233009708738
class2 recall: 0.7513812154696132



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  self.pid = os.fork()
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.17it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.60it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.00it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.69it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.90it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 59.60it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 63.63it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 65.41it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 67.57it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 69.00it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 73.89it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 71.85it/

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.0
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.05it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.45it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.74it/s][A
 21%|██        | 25/120 [00:00<00:02, 46.71it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 57.54it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 62.86it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 68.43it/s][A
 50%|█████     | 60/120 [00:01<00:00, 73.52it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 75.64it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 76.15it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 72.65it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 71.41it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 72.97it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 74.47it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.63it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.029411764705882353
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:47,  2.14it/s][A
  2%|▏         | 6/360 [00:00<00:26, 13.15it/s][A
  4%|▍         | 15/360 [00:00<00:10, 31.62it/s][A
  7%|▋         | 24/360 [00:00<00:07, 46.11it/s][A
  9%|▉         | 33/360 [00:00<00:05, 56.74it/s][A
 12%|█▏        | 42/360 [00:00<00:04, 65.26it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 68.61it/s][A
 17%|█▋        | 60/360 [00:01<00:03, 76.06it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 79.09it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 78.38it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 73.94it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 71.55it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 69.68it/s][A
 31%|███       | 111/360 [00:01<00:03, 72.22it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 74.18it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 77.21it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 82.48it/s][A
 41%|████      | 147/360 [00:02<00:02, 80.77it/s][A
 43%|████▎     | 156/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.05825242718446602
class2 recall: 0.9226519337016574
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3592233009708738
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.13it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.75it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.18it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 47.15it/s][A
 30%|███       | 36/119 [00:01<00:01, 52.50it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 61.55it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 65.05it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 71.24it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 72.50it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 76.97it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 78.86it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.35294117647058826
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.02it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.87it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.96it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.57it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.88it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.75it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.54it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 66.40it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 66.95it/s][A
 60%|██████    | 72/120 [00:01<00:00, 70.22it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 70.57it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 71.27it/s][A
 80%|████████  | 96/120 [00:01<00:00, 71.77it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 72.59it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.22it/s]

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.6153846153846154
class1 recall: 0.4411764705882353
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8333333333333334



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.69it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.80it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.01it/s][A
  9%|▉         | 33/360 [00:00<00:05, 54.53it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 62.92it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 65.74it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 67.35it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 70.81it/s][A
 21%|██        | 74/360 [00:01<00:04, 67.38it/s][A
 23%|██▎       | 82/360 [00:01<00:04, 69.30it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 69.24it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 67.75it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 70.48it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 73.77it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 77.37it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 79.64it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 81.54it/s][A
 42%|████▏     | 152/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.5145631067961165
class2 recall: 0.6574585635359116
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3786407766990291
class2 recall: 0.7624309392265194



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.05it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.27it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.18it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.59it/s][A
 30%|███       | 36/119 [00:01<00:01, 53.28it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 59.72it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 64.63it/s][A
 50%|█████     | 60/119 [00:01<00:00, 66.51it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 66.56it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 70.44it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 71.51it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.058823529411764705
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.51it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.94it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.01it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.68it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.80it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.12it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 68.20it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 70.20it/s][A
 61%|██████    | 73/120 [00:01<00:00, 71.45it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 72.62it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 70.11it/s][A
 81%|████████  | 97/120 [00:01<00:00, 71.28it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 71.78it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.31it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.058823529411764705
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.14705882352941177
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.10it/s][A
  2%|▏         | 6/360 [00:00<00:27, 13.06it/s][A
  4%|▍         | 14/360 [00:00<00:11, 29.22it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.90it/s][A
  8%|▊         | 30/360 [00:00<00:06, 51.50it/s][A
 11%|█         | 39/360 [00:00<00:05, 61.40it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 66.00it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 71.85it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 77.54it/s][A
 21%|██        | 75/360 [00:01<00:03, 77.91it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 78.54it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 72.64it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 67.20it/s][A
 30%|███       | 109/360 [00:01<00:03, 69.76it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 72.18it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 74.02it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 77.09it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 79.51it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.21359223300970873
class2 recall: 0.7513812154696132
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.39805825242718446
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.16it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.42it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.47it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 40.72it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.89it/s][A
 31%|███       | 37/119 [00:00<00:01, 57.77it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 63.37it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 67.12it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 72.21it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 70.04it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 69.54it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 70.33it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.058823529411764705
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.20588235294117646
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.67it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.11it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 40.11it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 49.84it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.28it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 59.96it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 62.70it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 68.35it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 70.94it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 73.16it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 75.30it/s][A
 80%|████████  | 96/120 [00:01<00:00, 76.19it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 76.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.61it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.029411764705882353
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:48,  2.13it/s][A
  2%|▏         | 6/360 [00:00<00:27, 13.07it/s][A
  4%|▎         | 13/360 [00:00<00:12, 26.96it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.80it/s][A
  8%|▊         | 30/360 [00:00<00:06, 51.09it/s][A
 11%|█         | 38/360 [00:01<00:05, 58.49it/s][A
 13%|█▎        | 46/360 [00:01<00:04, 64.36it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 70.28it/s][A
 18%|█▊        | 64/360 [00:01<00:03, 75.04it/s][A
 20%|██        | 73/360 [00:01<00:03, 76.31it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 77.32it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 72.43it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 70.75it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 67.74it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 70.14it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 73.45it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 75.75it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 78.76it/s][A
 41%|████      | 148/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.11650485436893204
class2 recall: 0.9171270718232044
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4077669902912621
class2 recall: 0.7569060773480663


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.19it/s][A
 10%|█         | 12/119 [00:00<00:04, 24.33it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 40.23it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.15it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 60.69it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 63.67it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 66.80it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 68.79it/s][A
 61%|██████    | 72/119 [00:01<00:00, 73.62it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 77.17it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 79.06it/s][A
 83%|████████▎ | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.029411764705882353
class2 recall: 0.6666666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.03it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.81it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.11it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.19it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.83it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 57.02it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 59.45it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 59.98it/s][A
 51%|█████     | 61/120 [00:01<00:00, 61.72it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 61.50it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 65.85it/s][A
 70%|███████   | 84/120 [00:01<00:00, 67.20it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 66.54it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 65.04it/s][A
 88%|████████▊ | 106/120 [00:02<00:00, 68.97it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.02it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7692307692307693
class1 recall: 0.029411764705882353
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.03it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.00it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.65it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.49it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.21it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 61.84it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 67.38it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 72.35it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 74.86it/s][A
 21%|██        | 76/360 [00:01<00:03, 72.38it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 72.17it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 70.91it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 68.41it/s][A
 30%|███       | 108/360 [00:01<00:03, 68.90it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 70.73it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 75.22it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 77.97it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 79.95it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.868421052631579
class1 recall: 0.18446601941747573
class2 recall: 0.6574585635359116
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3883495145631068
class2 recall: 0.7513812154696132



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.20 56.18 61.37
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.13it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.17it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.35it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.64it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.99it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.64it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.71it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.27it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 68.52it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 67.48it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 72.28it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 72.54it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8
class1 recall: 0.14705882352941177
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.20588235294117646
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.60it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.92it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.04it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.19it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.96it/s][A
 40%|████      | 48/120 [00:01<00:01, 60.91it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 65.47it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 68.22it/s][A
 61%|██████    | 73/120 [00:01<00:00, 71.83it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 72.59it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 69.39it/s][A
 81%|████████  | 97/120 [00:01<00:00, 70.78it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 69.99it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.42it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.23529411764705882
class2 recall: 0.6833333333333333





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:51,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.11it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.37it/s][A
  6%|▋         | 23/360 [00:00<00:07, 43.79it/s][A
  9%|▉         | 32/360 [00:00<00:05, 55.01it/s][A
 12%|█▏        | 42/360 [00:01<00:04, 65.11it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 70.77it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 73.15it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 76.21it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 74.45it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 75.47it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 76.08it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 69.94it/s][A
 31%|███       | 110/360 [00:01<00:03, 69.95it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 70.88it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 72.85it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 78.65it/s][A
 40%|████      | 145/360 [00:02<00:02, 77.82it/s][A
 43%|████▎     | 154/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.36893203883495146
class2 recall: 0.6298342541436464
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.3883495145631068
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.75 55.20 63.18
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.19it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.42it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 42.95it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.77it/s][A
 33%|███▎      | 39/119 [00:00<00:01, 59.87it/s][A
 40%|████      | 48/119 [00:01<00:01, 66.72it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 67.02it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 66.52it/s][A
 61%|██████    | 72/119 [00:01<00:00, 68.58it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 69.97it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 69.49it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.11764705882352941
class2 recall: 0.36666666666666664
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.20588235294117646
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.58it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.31it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.11it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.88it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 59.97it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 64.13it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 69.81it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 71.05it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 71.36it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 71.57it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 72.42it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 70.55it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 73.47it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.65it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7692307692307693
class1 recall: 0.08823529411764706
class2 recall: 0.45
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:01,  1.98it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.33it/s][A
  4%|▍         | 15/360 [00:00<00:11, 28.94it/s][A
  6%|▋         | 23/360 [00:00<00:08, 40.44it/s][A
  8%|▊         | 30/360 [00:00<00:07, 47.01it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.36it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.15it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 67.58it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 70.59it/s][A
 20%|█▉        | 71/360 [00:01<00:03, 72.68it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 73.90it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 71.85it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 73.87it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 72.52it/s][A
 31%|███       | 111/360 [00:02<00:03, 71.11it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 72.48it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 74.80it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 77.69it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9210526315789473
class1 recall: 0.22330097087378642
class2 recall: 0.4696132596685083
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.39805825242718446
class2 recall: 0.7569060773480663



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 53.75 55.20 63.18
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.08it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.95it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.39it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 46.96it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 51.82it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.78it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 67.22it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 69.85it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 68.20it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 68.67it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 70.33it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.2647058823529412
class2 recall: 0.6166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.20588235294117646
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.03it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.04it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.27it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.70it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 53.63it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.56it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 60.95it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 61.59it/s][A
 51%|█████     | 61/120 [00:01<00:00, 62.73it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 64.55it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 68.94it/s][A
 70%|███████   | 84/120 [00:01<00:00, 67.58it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 68.69it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 68.26it/s][A
 90%|█████████ | 108/120 [00:02<00:00, 72.33it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.26it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.4411764705882353
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.14705882352941177
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.77it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.24it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.81it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.23it/s][A
 11%|█         | 38/360 [00:01<00:05, 55.85it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.83it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 63.01it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 65.96it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 70.83it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 72.97it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 69.37it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 69.95it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 69.87it/s][A
 31%|███       | 111/360 [00:02<00:03, 70.00it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 70.88it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 74.19it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 74.88it/s][A
 40%|████      | 144/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.5145631067961165
class2 recall: 0.7071823204419889
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.39805825242718446
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.37it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.45it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.57it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.68it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.72it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 63.09it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.97it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.71it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 70.90it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 75.07it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 77.96it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.08823529411764706
class2 recall: 0.6833333333333333





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.95it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.69it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.46it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.82it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 61.36it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 66.63it/s][A
 50%|█████     | 60/120 [00:01<00:00, 72.32it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 71.79it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 73.00it/s][A
 70%|███████   | 84/120 [00:01<00:00, 74.14it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 73.87it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 71.75it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 75.47it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.84it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.11764705882352941
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.20588235294117646
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 8/360 [00:00<00:20, 16.92it/s][A
  5%|▌         | 18/360 [00:00<00:09, 35.95it/s][A
  7%|▋         | 26/360 [00:00<00:07, 46.93it/s][A
 10%|▉         | 35/360 [00:00<00:05, 57.27it/s][A
 12%|█▏        | 44/360 [00:01<00:04, 64.97it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 69.69it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 74.43it/s][A
 20%|█▉        | 71/360 [00:01<00:03, 73.58it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 72.49it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 70.16it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 72.49it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 71.34it/s][A
 31%|███       | 111/360 [00:01<00:03, 70.97it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 72.09it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 73.65it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 77.30it/s][A
 40%|████      | 144/360 [00:02<00:02, 77.23it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.868421052631579
class1 recall: 0.21359223300970873
class2 recall: 0.6795580110497238
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.39805825242718446





class2 recall: 0.7569060773480663


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.19it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.04it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.49it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 53.68it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 59.29it/s][A
 40%|████      | 48/119 [00:01<00:01, 67.08it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 69.66it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 71.84it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 75.82it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 78.67it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 78.85it/s][A
 84%|████████▍ | 1

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.029411764705882353
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.20588235294117646
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.98it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.75it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.72it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.99it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.94it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.99it/s][A
 41%|████      | 49/120 [00:01<00:01, 64.11it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 64.36it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 65.26it/s][A
 61%|██████    | 73/120 [00:01<00:00, 69.05it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 70.98it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 72.33it/s][A
 81%|████████  | 97/120 [00:01<00:00, 72.26it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 75.79it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.35it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8076923076923077
class1 recall: 0.029411764705882353
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.2647058823529412
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.08it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.21it/s][A
  4%|▍         | 16/360 [00:00<00:10, 33.26it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.53it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.51it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 61.21it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 64.21it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 67.02it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 68.95it/s][A
 20%|██        | 73/360 [00:01<00:04, 68.86it/s][A
 22%|██▎       | 81/360 [00:01<00:04, 68.27it/s][A
 25%|██▍       | 89/360 [00:01<00:04, 67.62it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 70.21it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 70.44it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 71.07it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 71.18it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 71.65it/s][A
 38%|███▊      | 137/360 [00:02<00:03, 73.88it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9078947368421053
class1 recall: 0.1650485436893204
class2 recall: 0.6408839779005525
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.3883495145631068
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.13it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.80it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 40.46it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 50.57it/s][A
 31%|███       | 37/119 [00:00<00:01, 58.40it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 61.98it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 64.45it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 67.83it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 69.16it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 68.30it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 68.41it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.2647058823529412
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.20588235294117646
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.43it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.59it/s][A
 21%|██        | 25/120 [00:00<00:02, 46.37it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.95it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 57.72it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 60.89it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 63.43it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 67.62it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 70.58it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 69.50it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 72.15it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 71.71it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 73.08it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 73.36it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.63it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.23529411764705882
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.2647058823529412
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.03it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.89it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.76it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.39it/s][A
 11%|█         | 38/360 [00:01<00:05, 56.26it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.72it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 68.57it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.53it/s][A
 20%|██        | 72/360 [00:01<00:03, 73.46it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 75.07it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 73.80it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 74.49it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 72.97it/s][A
 31%|███       | 112/360 [00:01<00:03, 72.72it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 72.89it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 75.61it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 75.48it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.46601941747572817
class2 recall: 0.8121546961325967
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.39805825242718446
class2 recall: 0.7679558011049724


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:54,  2.16it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.55it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.46it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.67it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.46it/s][A
 31%|███       | 37/119 [00:00<00:01, 56.61it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 62.59it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 66.42it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 69.16it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 70.22it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 73.72it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 73.94it/s][A
 79%|███████▉  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.29411764705882354
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.20588235294117646
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.05it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.41it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.30it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.53it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.74it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 60.36it/s][A
 41%|████      | 49/120 [00:01<00:01, 65.68it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 67.07it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 71.17it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 69.03it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 71.25it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 71.29it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 69.32it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 70.97it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.95it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.4117647058823529
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.23529411764705882
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:46,  2.15it/s][A
  1%|▏         | 5/360 [00:00<00:32, 10.98it/s][A
  4%|▎         | 13/360 [00:00<00:12, 28.11it/s][A
  6%|▋         | 23/360 [00:00<00:07, 45.84it/s][A
  9%|▊         | 31/360 [00:00<00:05, 54.96it/s][A
 11%|█         | 40/360 [00:00<00:05, 63.04it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 69.21it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 74.32it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 78.30it/s][A
 21%|██        | 76/360 [00:01<00:03, 77.64it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 78.36it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 72.56it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 69.74it/s][A
 31%|███       | 110/360 [00:01<00:03, 68.89it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 70.84it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 72.59it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 74.73it/s][A
 40%|████      | 144/360 [00:02<00:02, 75.25it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.4854368932038835
class2 recall: 0.7182320441988951
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4174757281553398
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.50it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.64it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.32it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.81it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 56.32it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 64.09it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 64.33it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 69.61it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 73.83it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 76.62it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 77.26it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.14705882352941177
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.20588235294117646
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.05it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.38it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.56it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 41.37it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 51.39it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 57.23it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 60.70it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 63.87it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 67.05it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 69.65it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 68.51it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 71.13it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 73.31it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 74.46it/s][A
 92%|█████████▎| 111/120 [00:01<00:00, 76.62it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.78it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.17647058823529413
class2 recall: 0.6166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.23529411764705882
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:52,  2.09it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.80it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.12it/s][A
  7%|▋         | 24/360 [00:00<00:07, 42.86it/s][A
  9%|▉         | 33/360 [00:00<00:06, 53.99it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 62.61it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 67.94it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 69.37it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 71.82it/s][A
 21%|██        | 75/360 [00:01<00:04, 69.28it/s][A
 23%|██▎       | 83/360 [00:01<00:04, 68.24it/s][A
 25%|██▌       | 91/360 [00:01<00:04, 66.44it/s][A
 27%|██▋       | 98/360 [00:01<00:04, 64.43it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 68.55it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 68.81it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 70.97it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 71.52it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 73.78it/s][A
 41%|████      | 147/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.868421052631579
class1 recall: 0.33980582524271846
class2 recall: 0.6464088397790055
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4077669902912621
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.20it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 29.98it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 39.90it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 49.46it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 58.78it/s][A
 40%|████      | 48/119 [00:01<00:01, 65.77it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 64.48it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 68.20it/s][A
 61%|██████    | 72/119 [00:01<00:00, 70.31it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 72.41it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 72.21it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.17647058823529413
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.17647058823529413
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.07it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.28it/s][A
 12%|█▏        | 14/120 [00:00<00:03, 28.13it/s][A
 17%|█▋        | 20/120 [00:00<00:02, 36.08it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 49.75it/s][A
 31%|███       | 37/120 [00:01<00:01, 57.65it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 60.26it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 65.23it/s][A
 50%|█████     | 60/120 [00:01<00:00, 69.43it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 68.86it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 71.34it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.57it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 70.58it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 68.92it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 71.87it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.58it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.14705882352941177
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.20588235294117646
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.73it/s][A
  4%|▎         | 13/360 [00:00<00:13, 25.89it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.00it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.09it/s][A
 11%|█         | 39/360 [00:01<00:05, 60.01it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 64.82it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.84it/s][A
 18%|█▊        | 65/360 [00:01<00:03, 73.98it/s][A
 20%|██        | 73/360 [00:01<00:03, 72.84it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 70.81it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 68.40it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 68.02it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 67.05it/s][A
 31%|███       | 112/360 [00:02<00:03, 67.99it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 70.65it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 73.99it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 77.13it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.3106796116504854
class2 recall: 0.861878453038674
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.39805825242718446
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<01:00,  1.95it/s][A
  5%|▌         | 6/119 [00:00<00:09, 12.32it/s][A
 11%|█         | 13/119 [00:00<00:04, 25.04it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 35.44it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 46.37it/s][A
 30%|███       | 36/119 [00:01<00:01, 53.52it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 61.55it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 65.13it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 67.60it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 69.93it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 68.98it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 71.18it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.2647058823529412
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.17647058823529413
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.97it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.39it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.42it/s][A
 21%|██        | 25/120 [00:00<00:02, 43.96it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.03it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 55.72it/s][A
 40%|████      | 48/120 [00:01<00:01, 61.12it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 61.33it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 63.68it/s][A
 57%|█████▊    | 69/120 [00:01<00:00, 63.93it/s][A
 64%|██████▍   | 77/120 [00:01<00:00, 67.75it/s][A
 71%|███████   | 85/120 [00:01<00:00, 68.34it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 67.59it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 66.65it/s][A
 89%|████████▉ | 107/120 [00:02<00:00, 70.21it/s][A
100%|██████████| 120/120 [00:02<00:00, 52.17it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34615384615384615
class1 recall: 0.38235294117647056
class2 recall: 0.8
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769





class1 recall: 0.23529411764705882
class2 recall: 0.8



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.05it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.83it/s][A
  4%|▍         | 14/360 [00:00<00:12, 27.32it/s][A
  6%|▌         | 22/360 [00:00<00:08, 39.94it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.10it/s][A
 11%|█         | 39/360 [00:01<00:05, 58.36it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.18it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 68.62it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 72.57it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.29it/s][A
 22%|██▎       | 81/360 [00:01<00:04, 69.10it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 69.86it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 69.38it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 69.17it/s][A
 31%|███       | 112/360 [00:02<00:03, 68.91it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 68.50it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 68.85it/s][A
 38%|███▊      | 135/360 [00:02<00:03, 74.12it/s][A
 40%|███▉      | 143/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5921052631578947
class1 recall: 0.5631067961165048
class2 recall: 0.8011049723756906
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4174757281553398
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.01it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.94it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.24it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.59it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 53.17it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 60.13it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 66.46it/s][A
 50%|█████     | 60/119 [00:01<00:00, 69.05it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 72.41it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 76.22it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 74.78it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.35294117647058826
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.23529411764705882
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.07it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.95it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.54it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.63it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 54.06it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.90it/s][A
 41%|████      | 49/120 [00:01<00:01, 63.50it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 67.84it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 70.98it/s][A
 61%|██████    | 73/120 [00:01<00:00, 71.59it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 72.03it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 72.12it/s][A
 81%|████████  | 97/120 [00:01<00:00, 72.59it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 74.13it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.29it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.3076923076923077
class1 recall: 0.5294117647058824
class2 recall: 0.7166666666666667





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.23529411764705882
class2 recall: 0.8



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.98it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.65it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.38it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.58it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 61.90it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 67.68it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 73.55it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 77.41it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 73.25it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 73.70it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 73.56it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 70.33it/s][A
 31%|███       | 110/360 [00:01<00:03, 72.40it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 73.86it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 76.41it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 81.64it/s][A
 41%|████      | 146/360 [00:02<00:02, 79.35it/s][A
 43%|████▎     | 154/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5394736842105263
class1 recall: 0.6407766990291263
class2 recall: 0.7182320441988951
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4174757281553398
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 55.10 54.26 60.58
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<01:00,  1.94it/s][A
  5%|▌         | 6/119 [00:00<00:09, 12.22it/s][A
 11%|█         | 13/119 [00:00<00:04, 25.19it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 35.50it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 46.76it/s][A
 30%|███       | 36/119 [00:01<00:01, 55.36it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 63.07it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 64.98it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 71.24it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 71.14it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 74.36it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 73.41it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.14705882352941177
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.2647058823529412
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.42it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.25it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.01it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.65it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 60.45it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 67.24it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 66.85it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 69.11it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 68.28it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 70.71it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 70.60it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 69.52it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 73.78it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.85it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.058823529411764705
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.23529411764705882
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.20it/s][A
  5%|▍         | 17/360 [00:00<00:10, 34.25it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.85it/s][A
  9%|▉         | 34/360 [00:00<00:05, 55.87it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 64.17it/s][A
 14%|█▍        | 52/360 [00:01<00:04, 69.57it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 74.15it/s][A
 19%|█▉        | 70/360 [00:01<00:03, 77.45it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 78.04it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 74.79it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 76.08it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 74.41it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 75.86it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 75.91it/s][A
 36%|███▌      | 130/360 [00:02<00:02, 78.14it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 80.05it/s][A
 41%|████      | 148/360 [00:02<00:02, 79.48it/s][A
 43%|████▎     | 156/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.2524271844660194
class2 recall: 0.7679558011049724
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.42718446601941745
class2 recall: 0.7679558011049724



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.04it/s][A
 11%|█         | 13/119 [00:00<00:03, 27.05it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 42.86it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.40it/s][A
 31%|███       | 37/119 [00:01<00:01, 55.57it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 60.44it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 62.81it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 67.02it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 68.99it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 73.20it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 73.68it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.23529411764705882
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.29411764705882354
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.25it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.01it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.07it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.92it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.96it/s][A
 40%|████      | 48/120 [00:01<00:01, 61.33it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 64.87it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 68.11it/s][A
 60%|██████    | 72/120 [00:01<00:00, 69.20it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 71.80it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 73.17it/s][A
 80%|████████  | 96/120 [00:01<00:00, 73.49it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 70.52it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.67it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.20588235294117646





class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.7833333333333333



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.73it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.65it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.43it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.06it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.53it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.58it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.37it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.09it/s][A
 20%|██        | 73/360 [00:01<00:03, 74.61it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 73.20it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 74.07it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 75.02it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 71.56it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 70.82it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 71.98it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 71.97it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 75.23it/s][A
 41%|████      | 147/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.32038834951456313
class2 recall: 0.8066298342541437
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.4077669902912621
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.15it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.13it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.27it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.37it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.80it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.68it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.89it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.51it/s][A
 61%|██████    | 72/119 [00:01<00:00, 74.27it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 76.39it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 75.16it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.17647058823529413





class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.29411764705882354
class2 recall: 0.7166666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.92it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.52it/s][A
 18%|█▊        | 22/120 [00:00<00:02, 40.58it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 47.92it/s][A
 31%|███       | 37/120 [00:01<00:01, 55.73it/s][A
 38%|███▊      | 45/120 [00:01<00:01, 60.60it/s][A
 43%|████▎     | 52/120 [00:01<00:01, 62.65it/s][A
 50%|█████     | 60/120 [00:01<00:00, 65.56it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 66.75it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 66.81it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 68.54it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 70.54it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 70.13it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 75.13it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.46it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.14705882352941177
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 8/360 [00:00<00:20, 17.00it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.76it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.56it/s][A
  9%|▉         | 34/360 [00:00<00:05, 55.93it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 64.33it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 68.11it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 71.14it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 72.81it/s][A
 21%|██        | 75/360 [00:01<00:04, 69.89it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 71.50it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 69.05it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 66.58it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 66.14it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 66.74it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 70.24it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 74.15it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 78.42it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.631578947368421
class1 recall: 0.30097087378640774
class2 recall: 0.8729281767955801
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.42718446601941745
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.83it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 27.70it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 35.56it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.24it/s][A
 30%|███       | 36/119 [00:01<00:01, 54.83it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.61it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 66.77it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 72.18it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 73.77it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 73.48it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 75.12it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.5294117647058824
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.2647058823529412
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.08it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.77it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 43.35it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 49.81it/s][A
 32%|███▏      | 38/120 [00:01<00:01, 57.85it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 63.46it/s][A
 45%|████▌     | 54/120 [00:01<00:00, 66.41it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 67.65it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 68.65it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 67.96it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 69.77it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 67.60it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 70.38it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 72.51it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.81it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.5588235294117647
class2 recall: 0.6833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.94it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.86it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.56it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.45it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 62.29it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 67.88it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 71.84it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 71.58it/s][A
 21%|██        | 75/360 [00:01<00:04, 69.90it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 71.53it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 69.31it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 65.25it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 66.27it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 64.36it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 69.66it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 73.18it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 76.45it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.618421052631579
class1 recall: 0.6407766990291263
class2 recall: 0.6574585635359116
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.39805825242718446
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.05it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.01it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 41.52it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 50.28it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 55.50it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 63.94it/s][A
 46%|████▌     | 55/119 [00:01<00:01, 63.01it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 65.86it/s][A
 61%|██████    | 72/119 [00:01<00:00, 69.67it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 73.37it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 72.25it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.20588235294117646
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  8%|▊         | 9/120 [00:00<00:06, 18.44it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.53it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.05it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.53it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 60.64it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 65.25it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 70.37it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 72.63it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 71.43it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 70.22it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 72.14it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 68.67it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 72.58it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.47it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.3076923076923077
class1 recall: 0.2647058823529412
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:56,  2.03it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.41it/s][A
  4%|▍         | 14/360 [00:00<00:12, 27.41it/s][A
  6%|▌         | 22/360 [00:00<00:08, 39.42it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.67it/s][A
 10%|█         | 37/360 [00:01<00:05, 54.59it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 59.87it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 66.58it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 68.88it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 70.63it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 70.60it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 71.41it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 73.56it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 66.73it/s][A
 31%|███       | 110/360 [00:02<00:03, 70.20it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 72.74it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 73.07it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 78.98it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5131578947368421
class1 recall: 0.4563106796116505
class2 recall: 0.8397790055248618
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.3786407766990291
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:58,  2.02it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.69it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 27.80it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 37.97it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.47it/s][A
 31%|███       | 37/119 [00:01<00:01, 54.99it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 58.74it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 62.83it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 69.06it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 70.32it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 74.69it/s][A
 72%|███████▏  | 86/119 [00:01<00:00, 76.15it/s][A
 79%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.76
class1 recall: 0.14705882352941177
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.95it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.62it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 34.17it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 44.72it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 53.17it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 55.74it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 63.43it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 69.32it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 70.43it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 70.15it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 72.78it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 72.52it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 73.80it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 77.18it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.16it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.17647058823529413
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.06it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.80it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.53it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.61it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.22it/s][A
 11%|█         | 40/360 [00:01<00:05, 59.90it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 66.84it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 68.66it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 69.03it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.37it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 74.18it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 73.65it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 70.19it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 72.08it/s][A
 32%|███▏      | 114/360 [00:02<00:03, 71.13it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 72.75it/s][A
 36%|███▋      | 131/360 [00:02<00:03, 75.76it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 78.31it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.2912621359223301
class2 recall: 0.7403314917127072
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.4174757281553398
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.04it/s][A
  7%|▋         | 8/119 [00:00<00:06, 16.83it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 30.68it/s][A
 21%|██        | 25/119 [00:00<00:02, 44.75it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 53.56it/s][A
 34%|███▍      | 41/119 [00:01<00:01, 58.92it/s][A
 41%|████      | 49/119 [00:01<00:01, 64.58it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 67.61it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 70.67it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 75.24it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 72.79it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 74.55it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.20588235294117646
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.12it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.42it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 42.34it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 49.40it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 59.32it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 64.97it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 67.71it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 72.58it/s][A
 60%|██████    | 72/120 [00:01<00:00, 74.54it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 76.66it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 77.12it/s][A
 81%|████████  | 97/120 [00:01<00:00, 75.52it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 76.57it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.26it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.23529411764705882
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:59,  1.99it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.62it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.86it/s][A
  6%|▋         | 23/360 [00:00<00:08, 42.09it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.57it/s][A
 11%|█         | 40/360 [00:01<00:05, 60.30it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 62.64it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 68.04it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 68.97it/s][A
 20%|██        | 73/360 [00:01<00:04, 69.29it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 70.89it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 69.12it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 66.33it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 65.92it/s][A
 31%|███       | 111/360 [00:02<00:03, 66.23it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 66.42it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 67.79it/s][A
 37%|███▋      | 133/360 [00:02<00:03, 68.22it/s][A
 39%|███▉      | 141/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.42718446601941745
class2 recall: 0.8397790055248618
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7105263157894737
class1 recall: 0.4563106796116505
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.11it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 27.90it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 37.93it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 48.36it/s][A
 31%|███       | 37/119 [00:01<00:01, 56.87it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 65.69it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 68.34it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 72.75it/s][A
 61%|██████    | 72/119 [00:01<00:00, 75.72it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 76.26it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 78.24it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.029411764705882353
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.66it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.54it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.36it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.94it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 61.97it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 64.55it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 70.24it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 72.92it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 71.71it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 72.67it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 73.76it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 74.84it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 75.97it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.72it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7692307692307693
class1 recall: 0.0
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.29411764705882354
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.10it/s][A
  2%|▏         | 6/360 [00:00<00:27, 13.07it/s][A
  4%|▍         | 14/360 [00:00<00:11, 29.03it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.25it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.98it/s][A
 11%|█         | 39/360 [00:01<00:05, 61.28it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 67.50it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 67.56it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 69.67it/s][A
 20%|██        | 72/360 [00:01<00:04, 70.39it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 71.47it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 71.23it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 71.84it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 70.75it/s][A
 31%|███       | 112/360 [00:02<00:03, 70.69it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 71.85it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 69.56it/s][A
 38%|███▊      | 137/360 [00:02<00:03, 74.04it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9342105263157895
class1 recall: 0.1262135922330097
class2 recall: 0.712707182320442
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.44660194174757284
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.83it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.59it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.76it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.95it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 60.83it/s][A
 40%|████      | 48/119 [00:01<00:01, 67.07it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 69.46it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 74.05it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 77.82it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 80.12it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 78.73it/s][A
 85%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.20588235294117646
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.2647058823529412
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.06it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.85it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.84it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 52.80it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.06it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.26it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 67.09it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 72.21it/s][A
 61%|██████    | 73/120 [00:01<00:00, 72.43it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 75.07it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 76.30it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 75.13it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 79.33it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.27it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.35294117647058826
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.29411764705882354
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.63it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.33it/s][A
  7%|▋         | 25/360 [00:00<00:07, 46.06it/s][A
  9%|▉         | 34/360 [00:00<00:05, 56.25it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 63.85it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 67.50it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 72.56it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 76.03it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 76.60it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 73.61it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 73.60it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 71.24it/s][A
 31%|███       | 110/360 [00:01<00:03, 72.42it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 75.00it/s][A
 36%|███▌      | 128/360 [00:02<00:02, 78.15it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 82.14it/s][A
 41%|████      | 147/360 [00:02<00:02, 80.36it/s][A
 43%|████▎     | 156/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.4077669902912621
class2 recall: 0.7734806629834254
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.42718446601941745
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  7%|▋         | 8/119 [00:00<00:06, 17.05it/s][A
 14%|█▍        | 17/119 [00:00<00:03, 33.86it/s][A
 22%|██▏       | 26/119 [00:00<00:01, 46.71it/s][A
 29%|██▊       | 34/119 [00:00<00:01, 53.31it/s][A
 34%|███▍      | 41/119 [00:01<00:01, 57.02it/s][A
 41%|████      | 49/119 [00:01<00:01, 61.71it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 63.03it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 66.61it/s][A
 61%|██████    | 72/119 [00:01<00:00, 69.94it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 73.42it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 76.47it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.47058823529411764
class2 recall: 0.5666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.2647058823529412
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.97it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.25it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.43it/s][A
 21%|██        | 25/120 [00:00<00:02, 43.85it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.59it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 57.37it/s][A
 41%|████      | 49/120 [00:01<00:01, 60.56it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 66.69it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 67.51it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 70.14it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 73.17it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 72.47it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 70.90it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 72.58it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.93it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.5588235294117647
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:03,  1.96it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.26it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.75it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.40it/s][A
  9%|▉         | 32/360 [00:00<00:06, 52.42it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 62.11it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 68.18it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 70.11it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 69.95it/s][A
 21%|██        | 74/360 [00:01<00:04, 70.64it/s][A
 23%|██▎       | 82/360 [00:01<00:04, 67.68it/s][A
 25%|██▌       | 90/360 [00:01<00:04, 66.53it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 67.32it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 67.35it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 71.09it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 70.80it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 71.35it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 75.72it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.5922330097087378
class2 recall: 0.6408839779005525
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.42718446601941745
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.19it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.64it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.62it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 48.62it/s][A
 29%|██▉       | 35/119 [00:00<00:01, 53.02it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 58.81it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 65.60it/s][A
 50%|█████     | 60/119 [00:01<00:00, 69.57it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 70.68it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 72.72it/s][A
 71%|███████   | 84/119 [00:01<00:00, 73.71it/s][A
 77%|███████▋  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.29411764705882354
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.2647058823529412
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.91it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.39it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.99it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.61it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.50it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 54.79it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 58.17it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 63.89it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 69.74it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 70.36it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 71.13it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 70.44it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 69.58it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 69.88it/s][A
 92%|█████████▎| 111/120 [00:02<00:00, 71.62it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.07it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.74it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.06it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.46it/s][A
  9%|▉         | 33/360 [00:00<00:06, 53.11it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.14it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 65.41it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 67.83it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 68.38it/s][A
 20%|██        | 73/360 [00:01<00:04, 68.25it/s][A
 22%|██▎       | 81/360 [00:01<00:04, 69.01it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 70.02it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 71.77it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 71.62it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 72.99it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 73.15it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 70.86it/s][A
 38%|███▊      | 137/360 [00:02<00:03, 71.74it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75





class1 recall: 0.4174757281553398
class2 recall: 0.7734806629834254
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.4368932038834951
class2 recall: 0.7845303867403315


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.08it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.96it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.98it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 48.68it/s][A
 30%|███       | 36/119 [00:00<00:01, 56.20it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.97it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 66.88it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 72.36it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 71.69it/s][A
 66%|██████▌   | 78/119 [00:01<00:00, 72.64it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 76.40it/s][A
 80%|███████▉  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.36
class1 recall: 0.23529411764705882
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.2647058823529412
class2 recall: 0.7333333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.76it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.73it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.35it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.77it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 61.95it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 67.86it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 68.33it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 69.75it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 68.80it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 69.79it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 70.51it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 69.85it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 72.49it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.66it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.3076923076923077
class1 recall: 0.20588235294117646
class2 recall: 0.95
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:56,  2.03it/s][A
  2%|▏         | 8/360 [00:00<00:20, 16.82it/s][A
  5%|▍         | 17/360 [00:00<00:10, 33.26it/s][A
  7%|▋         | 25/360 [00:00<00:07, 44.20it/s][A
  9%|▉         | 33/360 [00:00<00:06, 53.42it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.32it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 66.55it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 68.58it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 74.52it/s][A
 21%|██        | 75/360 [00:01<00:03, 72.49it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 76.00it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 73.84it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 70.37it/s][A
 30%|███       | 108/360 [00:01<00:03, 69.73it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 70.74it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 73.22it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 74.62it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 74.97it/s][A
 41%|████      | 148/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.47368421052631576
class1 recall: 0.3300970873786408
class2 recall: 0.9171270718232044
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.4368932038834951
class2 recall: 0.7790055248618785


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.62it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.07it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.91it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.82it/s][A
 30%|███       | 36/119 [00:01<00:01, 54.92it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 63.52it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 67.22it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 71.70it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 72.99it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 74.91it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 75.78it/s][A
 81%|████████  | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.5
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.23529411764705882
class2 recall: 0.7333333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.97it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.05it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.57it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.63it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 54.50it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 58.56it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 65.93it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 70.69it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 70.58it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 70.80it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 73.24it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 71.81it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 72.63it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 72.04it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.06it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.4411764705882353
class2 recall: 0.7833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:54,  2.05it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.09it/s][A
  4%|▍         | 16/360 [00:00<00:10, 33.23it/s][A
  7%|▋         | 25/360 [00:00<00:07, 46.66it/s][A
  9%|▉         | 34/360 [00:00<00:05, 56.52it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 64.89it/s][A
 14%|█▍        | 52/360 [00:01<00:04, 70.27it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 71.65it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 74.47it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 75.49it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 74.97it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 75.40it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 73.99it/s][A
 31%|███       | 110/360 [00:01<00:03, 74.71it/s][A
 33%|███▎      | 118/360 [00:01<00:03, 75.52it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 76.75it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 80.47it/s][A
 40%|████      | 145/360 [00:02<00:02, 78.11it/s][A
 42%|████▎     | 153/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.6213592233009708
class2 recall: 0.7292817679558011
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.44660194174757284
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.10it/s][A
  4%|▍         | 5/119 [00:00<00:10, 10.63it/s][A
 10%|█         | 12/119 [00:00<00:04, 24.86it/s][A
 16%|█▌        | 19/119 [00:00<00:02, 35.98it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 48.55it/s][A
 29%|██▉       | 35/119 [00:01<00:01, 53.14it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 60.06it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 66.70it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 71.43it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 73.31it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 71.49it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 70.09it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.5294117647058824
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.2647058823529412
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.68it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.03it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 46.53it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.63it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 60.81it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 66.50it/s][A
 50%|█████     | 60/120 [00:01<00:00, 71.65it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 70.37it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 72.31it/s][A
 70%|███████   | 84/120 [00:01<00:00, 73.02it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 73.33it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 73.20it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 76.43it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.76it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.5882352941176471
class2 recall: 0.65
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.05it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.98it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.17it/s][A
  6%|▌         | 22/360 [00:00<00:08, 40.25it/s][A
  8%|▊         | 29/360 [00:00<00:06, 48.06it/s][A
 10%|█         | 36/360 [00:01<00:05, 54.06it/s][A
 12%|█▏        | 43/360 [00:01<00:05, 58.32it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 63.13it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 69.91it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 74.32it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 74.76it/s][A
 24%|██▎       | 85/360 [00:01<00:03, 75.97it/s][A
 26%|██▌       | 93/360 [00:01<00:03, 72.95it/s][A
 28%|██▊       | 101/360 [00:01<00:03, 70.60it/s][A
 30%|███       | 109/360 [00:01<00:03, 72.88it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 75.79it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 76.94it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 82.17it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.6504854368932039
class2 recall: 0.6519337016574586
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7236842105263158
class1 recall: 0.4368932038834951
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.06it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 32.13it/s][A
 20%|██        | 24/119 [00:00<00:02, 43.23it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 52.04it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 58.31it/s][A
 40%|████      | 48/119 [00:01<00:01, 61.92it/s][A
 46%|████▌     | 55/119 [00:01<00:01, 63.67it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 69.24it/s][A
 61%|██████    | 72/119 [00:01<00:00, 68.73it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 70.77it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 71.25it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.23529411764705882
class2 recall: 0.5333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.29411764705882354
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.03it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.79it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.09it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 40.22it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 50.17it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 56.46it/s][A
 38%|███▊      | 46/120 [00:01<00:01, 59.87it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 64.03it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 69.29it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 69.03it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 71.74it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 71.11it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 67.66it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 67.95it/s][A
 92%|█████████▏| 110/120 [00:02<00:00, 70.72it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.03it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6153846153846154
class1 recall: 0.38235294117647056
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.85it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.52it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.04it/s][A
  8%|▊         | 30/360 [00:00<00:06, 51.00it/s][A
 11%|█         | 39/360 [00:01<00:05, 61.35it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 65.91it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 72.08it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 77.98it/s][A
 21%|██        | 75/360 [00:01<00:03, 74.74it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 74.76it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 69.75it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 67.32it/s][A
 30%|███       | 108/360 [00:01<00:03, 71.03it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 74.81it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 77.30it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 80.08it/s][A
 40%|████      | 144/360 [00:02<00:02, 81.23it/s][A
 42%|████▎     | 153/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.868421052631579
class1 recall: 0.5436893203883495
class2 recall: 0.6685082872928176
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.46601941747572817
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.96it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.42it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.28it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 49.69it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 56.55it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.01it/s][A
 45%|████▌     | 54/119 [00:01<00:01, 64.16it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 67.88it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 69.87it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 74.51it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 71.95it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.76
class1 recall: 0.4411764705882353
class2 recall: 0.4
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.2647058823529412
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.51it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.99it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.23it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.06it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 57.90it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 60.47it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 61.84it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 66.64it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 67.04it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 67.59it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 70.74it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 72.60it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 73.26it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.97it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7692307692307693
class1 recall: 0.4117647058823529
class2 recall: 0.45
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.05it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.51it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.38it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.71it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.26it/s][A
 11%|█         | 40/360 [00:01<00:05, 60.00it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 64.08it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 65.65it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 70.53it/s][A
 20%|██        | 73/360 [00:01<00:03, 72.74it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 74.95it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 71.92it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 69.59it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 71.25it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 74.91it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 78.08it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 81.64it/s][A
 40%|████      | 144/360 [00:02<00:02, 79.46it/s][A
 42%|████▎     | 153/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9078947368421053
class1 recall: 0.5048543689320388
class2 recall: 0.4696132596685083
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.46601941747572817
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 55.08 55.76 63.61
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.73it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.53it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.87it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.29it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 58.32it/s][A
 40%|████      | 48/119 [00:01<00:01, 67.88it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 69.93it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 75.29it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 78.91it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 74.61it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 75.67it/s][A
 84%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.17647058823529413
class2 recall: 0.8833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.35294117647058826
class2 recall: 0.7




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.59it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.06it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 42.47it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 51.24it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 57.90it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 61.95it/s][A
 46%|████▌     | 55/120 [00:01<00:01, 64.03it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 67.17it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 70.02it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 70.94it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 72.27it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 72.14it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 69.82it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.62it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34615384615384615
class1 recall: 0.35294117647058826
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:49,  2.12it/s][A
  2%|▏         | 6/360 [00:00<00:26, 13.20it/s][A
  4%|▍         | 14/360 [00:00<00:11, 29.40it/s][A
  6%|▋         | 23/360 [00:00<00:07, 44.71it/s][A
  9%|▊         | 31/360 [00:00<00:06, 53.80it/s][A
 11%|█         | 40/360 [00:00<00:05, 63.28it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 66.99it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.85it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.40it/s][A
 20%|██        | 73/360 [00:01<00:03, 75.16it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 75.69it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 74.47it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 71.15it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 71.96it/s][A
 31%|███▏      | 113/360 [00:01<00:03, 72.23it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 73.70it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 75.20it/s][A
 39%|███▊      | 139/360 [00:02<00:02, 80.96it/s][A
 41%|████      | 148/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6447368421052632
class1 recall: 0.3786407766990291
class2 recall: 0.8342541436464088
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.49514563106796117
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.12it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.97it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.53it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 37.65it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.64it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 59.58it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 65.21it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 66.04it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 69.67it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 70.29it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 73.64it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 76.17it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.38235294117647056
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.29411764705882354
class2 recall: 0.6833333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.76it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 29.83it/s][A
 20%|██        | 24/120 [00:00<00:02, 43.89it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 54.45it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.11it/s][A
 41%|████      | 49/120 [00:01<00:01, 64.71it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 66.87it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 69.47it/s][A
 61%|██████    | 73/120 [00:01<00:00, 68.79it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 70.47it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 70.40it/s][A
 81%|████████  | 97/120 [00:01<00:00, 71.63it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 75.51it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.58it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.38461538461538464
class1 recall: 0.4117647058823529
class2 recall: 0.8
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.3235294117647059
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:50,  2.11it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.97it/s][A
  4%|▍         | 14/360 [00:00<00:11, 28.99it/s][A
  6%|▌         | 22/360 [00:00<00:08, 40.62it/s][A
  8%|▊         | 29/360 [00:00<00:07, 46.82it/s][A
 10%|█         | 37/360 [00:01<00:05, 54.76it/s][A
 12%|█▎        | 45/360 [00:01<00:05, 61.62it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 67.72it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 70.37it/s][A
 19%|█▉        | 70/360 [00:01<00:03, 72.54it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 72.92it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 71.46it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 70.08it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 67.73it/s][A
 31%|███       | 110/360 [00:02<00:03, 70.39it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 73.62it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 73.87it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 78.70it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5921052631578947
class1 recall: 0.5242718446601942
class2 recall: 0.7900552486187845
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.5436893203883495
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.49it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.64it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 41.97it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 50.90it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 60.26it/s][A
 40%|████      | 48/119 [00:01<00:01, 64.76it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 65.56it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 68.80it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 73.26it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 73.37it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 76.21it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.2647058823529412
class2 recall: 0.7
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.35294117647058826
class2 recall: 0.6666666666666666




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.96it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.76it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.94it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.47it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.23it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 57.81it/s][A
 41%|████      | 49/120 [00:01<00:01, 63.27it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 68.22it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 70.43it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 70.70it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 69.04it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 69.36it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 69.78it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 73.43it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.34it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.5
class2 recall: 0.8
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.3235294117647059
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:02,  1.97it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.43it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.17it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.51it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.32it/s][A
 11%|█         | 39/360 [00:01<00:05, 58.14it/s][A
 13%|█▎        | 47/360 [00:01<00:05, 61.97it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 68.00it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 70.72it/s][A
 20%|██        | 72/360 [00:01<00:04, 70.94it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 71.10it/s][A
 24%|██▍       | 88/360 [00:01<00:04, 67.18it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 67.35it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 66.95it/s][A
 31%|███       | 110/360 [00:02<00:03, 69.49it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 71.82it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 73.45it/s][A
 37%|███▋      | 134/360 [00:02<00:03, 72.74it/s][A
 40%|███▉      | 143/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6973684210526315
class1 recall: 0.5533980582524272
class2 recall: 0.7845303867403315
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.5533980582524272
class2 recall: 0.7734806629834254


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.04it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.80it/s][A
 10%|█         | 12/119 [00:00<00:04, 24.22it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 40.18it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.52it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 61.65it/s][A
 40%|████      | 48/119 [00:01<00:01, 68.16it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 67.21it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 69.40it/s][A
 61%|██████    | 72/119 [00:01<00:00, 70.04it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 71.94it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 75.06it/s][A
 82%|████████▏ | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.44
class1 recall: 0.17647058823529413
class2 recall: 0.9333333333333333





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.35294117647058826
class2 recall: 0.6833333333333333



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.91it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.22it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.47it/s][A
 20%|██        | 24/120 [00:00<00:02, 41.86it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.31it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.98it/s][A
 41%|████      | 49/120 [00:01<00:01, 64.65it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 68.16it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 72.10it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 72.73it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 75.14it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 75.81it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 73.40it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 73.93it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.97it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.38461538461538464
class1 recall: 0.20588235294117646
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.3235294117647059
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:56,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.52it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.74it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.18it/s][A
  9%|▉         | 34/360 [00:00<00:05, 55.26it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 61.34it/s][A
 14%|█▍        | 50/360 [00:01<00:04, 63.16it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 66.97it/s][A
 18%|█▊        | 66/360 [00:01<00:04, 69.46it/s][A
 21%|██        | 74/360 [00:01<00:04, 67.63it/s][A
 23%|██▎       | 82/360 [00:01<00:03, 70.21it/s][A
 25%|██▌       | 90/360 [00:01<00:03, 69.65it/s][A
 27%|██▋       | 98/360 [00:01<00:03, 66.45it/s][A
 29%|██▉       | 106/360 [00:01<00:03, 68.18it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 68.07it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 70.68it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 74.54it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 79.97it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.33980582524271846
class2 recall: 0.8950276243093923
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.5145631067961165
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.07it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.91it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.70it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.69it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.27it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 58.31it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 62.78it/s][A
 46%|████▌     | 55/119 [00:01<00:01, 63.67it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 66.47it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 69.76it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 70.48it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 71.84it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.4117647058823529
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  8%|▊         | 10/120 [00:00<00:05, 20.87it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 31.76it/s][A
 21%|██        | 25/120 [00:00<00:02, 42.78it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 49.32it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.38it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.37it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 68.80it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 67.89it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 71.79it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 73.35it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 74.92it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 72.58it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 76.42it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.99it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.47058823529411764
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.07it/s][A
  2%|▏         | 6/360 [00:00<00:27, 12.91it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.53it/s][A
  6%|▌         | 22/360 [00:00<00:08, 41.36it/s][A
  8%|▊         | 30/360 [00:00<00:06, 49.81it/s][A
 11%|█         | 39/360 [00:01<00:05, 59.26it/s][A
 13%|█▎        | 47/360 [00:01<00:04, 63.20it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 70.12it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 72.34it/s][A
 20%|██        | 72/360 [00:01<00:03, 73.47it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 75.06it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 71.08it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 72.45it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 71.61it/s][A
 31%|███       | 112/360 [00:01<00:03, 72.26it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 72.94it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 69.25it/s][A
 38%|███▊      | 136/360 [00:02<00:03, 71.75it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.6407766990291263
class2 recall: 0.6850828729281768
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.49514563106796117
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.91it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 31.39it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.16it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 49.41it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 56.43it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 65.10it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 67.27it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 72.38it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 76.39it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 72.98it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 76.51it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.4411764705882353
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.35294117647058826
class2 recall: 0.7




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.01it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.85it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.21it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.74it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.30it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 57.31it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 60.30it/s][A
 45%|████▌     | 54/120 [00:01<00:01, 61.77it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 66.30it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 69.99it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 73.48it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 74.32it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 72.41it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 72.13it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.59it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.5588235294117647
class2 recall: 0.75
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:56,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.67it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.54it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.33it/s][A
  9%|▉         | 32/360 [00:00<00:06, 51.51it/s][A
 11%|█         | 40/360 [00:01<00:05, 58.77it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 63.60it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 68.11it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 70.05it/s][A
 20%|██        | 72/360 [00:01<00:04, 70.94it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 71.70it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 70.29it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 73.02it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 72.80it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 72.72it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 74.68it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 74.89it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 76.90it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.75
class1 recall: 0.6019417475728155
class2 recall: 0.7071823204419889
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.4854368932038835
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.25it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.18it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.62it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.51it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.33it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 61.49it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 63.64it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 70.85it/s][A
 59%|█████▉    | 70/119 [00:01<00:00, 70.42it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 74.99it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 71.43it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.3235294117647059
class2 recall: 0.45
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.35294117647058826
class2 recall: 0.7




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/120 [00:00<00:07, 15.22it/s][A
 12%|█▎        | 15/120 [00:00<00:03, 30.67it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 42.37it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 51.18it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 59.91it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.61it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 67.22it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 71.59it/s][A
 61%|██████    | 73/120 [00:01<00:00, 70.01it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 73.98it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 73.59it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 73.59it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 75.75it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.39it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6538461538461539
class1 recall: 0.4117647058823529
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.29411764705882354
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:59,  2.00it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.59it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.69it/s][A
  6%|▌         | 22/360 [00:00<00:08, 39.17it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.17it/s][A
 11%|█         | 39/360 [00:01<00:05, 56.69it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 59.83it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 64.54it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 65.97it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 66.44it/s][A
 22%|██▏       | 78/360 [00:01<00:04, 69.59it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 69.84it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 70.45it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 71.02it/s][A
 31%|███       | 110/360 [00:02<00:03, 72.22it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 74.32it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 77.41it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 80.62it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8947368421052632
class1 recall: 0.5436893203883495
class2 recall: 0.580110497237569
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8026315789473685
class1 recall: 0.5048543689320388
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.80it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.37it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 36.84it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 47.67it/s][A
 30%|███       | 36/119 [00:01<00:01, 54.99it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.05it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 66.12it/s][A
 50%|█████     | 60/119 [00:01<00:00, 69.85it/s][A
 57%|█████▋    | 68/119 [00:01<00:00, 69.66it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 74.01it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 73.48it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.3235294117647059
class2 recall: 0.7333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.35294117647058826
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.05it/s][A
  7%|▋         | 8/120 [00:00<00:06, 17.25it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.82it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 40.98it/s][A
 26%|██▌       | 31/120 [00:00<00:01, 51.11it/s][A
 32%|███▎      | 39/120 [00:01<00:01, 57.57it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 62.22it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 65.89it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 71.00it/s][A
 60%|██████    | 72/120 [00:01<00:00, 70.93it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 74.38it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 73.49it/s][A
 81%|████████  | 97/120 [00:01<00:00, 73.33it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 74.80it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.26it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.35294117647058826
class2 recall: 0.8166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.3235294117647059





class2 recall: 0.7666666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.60it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.19it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.76it/s][A
  9%|▉         | 32/360 [00:00<00:06, 51.61it/s][A
 11%|█▏        | 41/360 [00:01<00:05, 60.73it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 63.84it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 70.14it/s][A
 19%|█▉        | 68/360 [00:01<00:03, 76.54it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 76.78it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 76.33it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 76.86it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 75.53it/s][A
 31%|███       | 110/360 [00:01<00:03, 75.45it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 77.95it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 76.86it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 81.37it/s][A
 41%|████      | 146/360 [00:02<00:02, 78.50it/s][A
 43%|████▎     | 154/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7368421052631579
class1 recall: 0.5145631067961165
class2 recall: 0.7955801104972375
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.5242718446601942
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.14it/s][A
  5%|▌         | 6/119 [00:00<00:08, 13.12it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.44it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 44.30it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.62it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 60.44it/s][A
 41%|████      | 49/119 [00:01<00:01, 67.98it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 67.81it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 73.48it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 77.78it/s][A
 71%|███████   | 84/119 [00:01<00:00, 80.68it/s][A
 78%|███████▊  | 93/119 [00:01<00:00, 77.39it/s][A
 85%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.23529411764705882
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.97it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.77it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.11it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.53it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.21it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.33it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.44it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 65.60it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 67.99it/s][A
 60%|██████    | 72/120 [00:01<00:00, 70.54it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 74.34it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 72.74it/s][A
 81%|████████  | 97/120 [00:01<00:00, 71.58it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 73.35it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.53it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.3235294117647059
class2 recall: 0.8166666666666667





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.46153846153846156
class1 recall: 0.3235294117647059
class2 recall: 0.7833333333333333



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.05it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.14it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.73it/s][A
  6%|▋         | 23/360 [00:00<00:07, 42.77it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.94it/s][A
 11%|█         | 40/360 [00:01<00:05, 62.09it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 66.17it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 69.00it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 75.76it/s][A
 21%|██        | 75/360 [00:01<00:03, 73.02it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 74.33it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 74.02it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 71.63it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 72.31it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 72.58it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 75.24it/s][A
 37%|███▋      | 132/360 [00:02<00:03, 75.98it/s][A
 39%|███▉      | 141/360 [00:02<00:02, 78.04it/s][A
 41%|████▏     | 149/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7631578947368421
class1 recall: 0.44660194174757284
class2 recall: 0.8232044198895028
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7894736842105263
class1 recall: 0.5339805825242718
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 56.43 52.58 68.16
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.69it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.13it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 40.68it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 50.35it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 54.89it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 58.89it/s][A
 44%|████▎     | 52/119 [00:01<00:01, 61.70it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 68.21it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 70.31it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 67.95it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 68.77it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.11764705882352941
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.04it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.98it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 32.14it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.98it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.40it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 60.30it/s][A
 41%|████      | 49/120 [00:01<00:01, 64.94it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 68.46it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 69.83it/s][A
 61%|██████    | 73/120 [00:01<00:00, 72.40it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 75.25it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 73.65it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 72.81it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 76.00it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.56it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.08823529411764706
class2 recall: 0.7166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.3235294117647059
class2 recall: 0.7833333333333333




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:56,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.60it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.39it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.67it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.02it/s][A
 11%|█         | 40/360 [00:01<00:05, 60.75it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 67.93it/s][A
 16%|█▌        | 58/360 [00:01<00:04, 72.41it/s][A
 19%|█▊        | 67/360 [00:01<00:03, 75.65it/s][A
 21%|██        | 76/360 [00:01<00:03, 73.03it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 73.36it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 73.14it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 70.12it/s][A
 30%|███       | 108/360 [00:01<00:03, 72.46it/s][A
 32%|███▎      | 117/360 [00:02<00:03, 75.15it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 77.92it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 80.01it/s][A
 40%|████      | 144/360 [00:02<00:02, 80.15it/s][A
 42%|████▎     | 153/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.9078947368421053
class1 recall: 0.21359223300970873
class2 recall: 0.7071823204419889
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8289473684210527
class1 recall: 0.49514563106796117
class2 recall: 0.8011049723756906


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 57.12 53.56 70.84
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:58,  2.02it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.78it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.13it/s][A
 19%|█▉        | 23/119 [00:00<00:02, 41.94it/s][A
 27%|██▋       | 32/119 [00:00<00:01, 52.96it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 59.45it/s][A
 40%|████      | 48/119 [00:01<00:01, 64.77it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 65.76it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 67.98it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 72.36it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 72.04it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 70.67it/s][A
 82%|████████▏ | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.17647058823529413
class2 recall: 0.8833333333333333





Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.29411764705882354
class2 recall: 0.75



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.97it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.70it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.20it/s][A
 20%|██        | 24/120 [00:00<00:02, 41.17it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.37it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 58.20it/s][A
 41%|████      | 49/120 [00:01<00:01, 63.27it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 67.69it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 69.78it/s][A
 61%|██████    | 73/120 [00:01<00:00, 69.86it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 73.39it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 74.29it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 73.70it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 73.88it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.43it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.058823529411764705
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.3235294117647059





class2 recall: 0.7666666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.61it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.85it/s][A
  7%|▋         | 24/360 [00:00<00:07, 44.28it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.26it/s][A
 11%|█         | 40/360 [00:01<00:05, 59.50it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 63.55it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 67.86it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 73.55it/s][A
 20%|██        | 73/360 [00:01<00:04, 70.80it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 71.44it/s][A
 25%|██▍       | 89/360 [00:01<00:04, 66.33it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 68.26it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 68.10it/s][A
 31%|███       | 112/360 [00:02<00:03, 67.37it/s][A
 33%|███▎      | 120/360 [00:02<00:03, 69.32it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 71.70it/s][A
 38%|███▊      | 136/360 [00:02<00:03, 72.43it/s][A
 40%|████      | 144/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8026315789473685
class1 recall: 0.2621359223300971
class2 recall: 0.9060773480662984
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.49514563106796117
class2 recall: 0.7790055248618785



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 57.47 53.01 70.98
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.75it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.26it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.26it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 51.91it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 59.00it/s][A
 39%|███▊      | 46/119 [00:01<00:01, 62.76it/s][A
 45%|████▌     | 54/119 [00:01<00:00, 66.04it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.62it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 69.23it/s][A
 66%|██████▋   | 79/119 [00:01<00:00, 69.18it/s][A
 73%|███████▎  | 87/119 [00:01<00:00, 71.63it/s][A
 80%|███████▉  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.52
class1 recall: 0.17647058823529413
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.76
class1 recall: 0.29411764705882354
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.90it/s][A
  7%|▋         | 8/120 [00:00<00:07, 15.78it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.08it/s][A
 20%|██        | 24/120 [00:00<00:02, 41.67it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.60it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.21it/s][A
 40%|████      | 48/120 [00:01<00:01, 61.44it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 66.47it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 71.35it/s][A
 61%|██████    | 73/120 [00:01<00:00, 73.51it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 76.46it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 75.52it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 74.88it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 76.69it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.67it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4230769230769231
class1 recall: 0.2647058823529412
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5769230769230769
class1 recall: 0.3235294117647059
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:59,  2.00it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.82it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.29it/s][A
  7%|▋         | 25/360 [00:00<00:07, 45.88it/s][A
  9%|▉         | 34/360 [00:00<00:05, 55.70it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 64.16it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 66.72it/s][A
 16%|█▋        | 59/360 [00:01<00:04, 70.00it/s][A
 19%|█▊        | 67/360 [00:01<00:04, 71.52it/s][A
 21%|██        | 75/360 [00:01<00:04, 68.02it/s][A
 23%|██▎       | 84/360 [00:01<00:03, 72.37it/s][A
 26%|██▌       | 92/360 [00:01<00:03, 72.89it/s][A
 28%|██▊       | 100/360 [00:01<00:03, 70.41it/s][A
 30%|███       | 108/360 [00:01<00:03, 72.38it/s][A
 32%|███▏      | 116/360 [00:02<00:03, 72.45it/s][A
 35%|███▍      | 125/360 [00:02<00:03, 76.08it/s][A
 37%|███▋      | 134/360 [00:02<00:02, 79.72it/s][A
 40%|███▉      | 143/360 [00:02<00:02, 78.41it/s][A
 42%|████▏     | 151/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6710526315789473
class1 recall: 0.30097087378640774
class2 recall: 0.9005524861878453
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.5048543689320388
class2 recall: 0.7734806629834254



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.05it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.08it/s][A
 13%|█▎        | 15/119 [00:00<00:03, 30.34it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 40.08it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.12it/s][A
 32%|███▏      | 38/119 [00:01<00:01, 57.16it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 65.79it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 66.79it/s][A
 53%|█████▎    | 63/119 [00:01<00:00, 70.12it/s][A
 61%|██████    | 72/119 [00:01<00:00, 74.33it/s][A
 68%|██████▊   | 81/119 [00:01<00:00, 77.30it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 74.06it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.4
class1 recall: 0.6764705882352942
class2 recall: 0.5333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.72
class1 recall: 0.2647058823529412
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.30it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.02it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.37it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 54.55it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 58.07it/s][A
 41%|████      | 49/120 [00:01<00:01, 60.79it/s][A
 48%|████▊     | 57/120 [00:01<00:00, 65.02it/s][A
 54%|█████▍    | 65/120 [00:01<00:00, 65.57it/s][A
 61%|██████    | 73/120 [00:01<00:00, 68.61it/s][A
 68%|██████▊   | 81/120 [00:01<00:00, 70.08it/s][A
 74%|███████▍  | 89/120 [00:01<00:00, 71.67it/s][A
 81%|████████  | 97/120 [00:01<00:00, 70.31it/s][A
 88%|████████▊ | 106/120 [00:01<00:00, 74.90it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.83it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.34615384615384615
class1 recall: 0.5882352941176471
class2 recall: 0.6166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.3235294117647059
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.75it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.03it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.97it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.04it/s][A
 11%|█         | 40/360 [00:01<00:05, 60.17it/s][A
 14%|█▎        | 49/360 [00:01<00:04, 65.53it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 68.58it/s][A
 18%|█▊        | 65/360 [00:01<00:04, 71.22it/s][A
 20%|██        | 73/360 [00:01<00:03, 72.36it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 71.71it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 72.54it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 73.73it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 72.62it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 70.89it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 71.76it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 73.71it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 76.28it/s][A
 41%|████      | 146/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6052631578947368
class1 recall: 0.7281553398058253
class2 recall: 0.6519337016574586
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.49514563106796117
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  6%|▌         | 7/119 [00:00<00:07, 14.94it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 32.00it/s][A
 20%|██        | 24/119 [00:00<00:02, 43.79it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 54.57it/s][A
 35%|███▌      | 42/119 [00:01<00:01, 62.55it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 66.37it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 69.60it/s][A
 56%|█████▋    | 67/119 [00:01<00:00, 73.72it/s][A
 64%|██████▍   | 76/119 [00:01<00:00, 75.08it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 77.79it/s][A
 79%|███████▉  | 94/119 [00:01<00:00, 78.28it/s][A
 87%|████████▋ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.88
class1 recall: 0.08823529411764706
class2 recall: 0.6333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.29411764705882354
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:02,  1.91it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.02it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 30.32it/s][A
 19%|█▉        | 23/120 [00:00<00:02, 39.26it/s][A
 25%|██▌       | 30/120 [00:00<00:01, 46.13it/s][A
 32%|███▏      | 38/120 [00:01<00:01, 54.95it/s][A
 39%|███▉      | 47/120 [00:01<00:01, 62.97it/s][A
 46%|████▌     | 55/120 [00:01<00:00, 67.10it/s][A
 52%|█████▎    | 63/120 [00:01<00:00, 68.42it/s][A
 59%|█████▉    | 71/120 [00:01<00:00, 70.49it/s][A
 66%|██████▌   | 79/120 [00:01<00:00, 72.87it/s][A
 72%|███████▎  | 87/120 [00:01<00:00, 74.79it/s][A
 79%|███████▉  | 95/120 [00:01<00:00, 72.51it/s][A
 86%|████████▌ | 103/120 [00:01<00:00, 71.59it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.71it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8461538461538461
class1 recall: 0.029411764705882353
class2 recall: 0.7666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.3235294117647059
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:59,  2.00it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.52it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.57it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.65it/s][A
  9%|▊         | 31/360 [00:00<00:06, 51.52it/s][A
 11%|█         | 39/360 [00:01<00:05, 57.02it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 59.84it/s][A
 15%|█▌        | 54/360 [00:01<00:04, 63.76it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 69.57it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 71.15it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 72.51it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 69.69it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 70.54it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 70.60it/s][A
 31%|███       | 111/360 [00:02<00:03, 72.93it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 73.96it/s][A
 36%|███▌      | 128/360 [00:02<00:02, 77.49it/s][A
 38%|███▊      | 138/360 [00:02<00:02, 82.44it/s][A
 41%|████      | 147/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9473684210526315
class1 recall: 0.10679611650485436
class2 recall: 0.7237569060773481
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.5048543689320388
class2 recall: 0.7845303867403315



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.82it/s][A
 11%|█         | 13/119 [00:00<00:04, 26.26it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.97it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 50.86it/s][A
 31%|███       | 37/119 [00:01<00:01, 54.65it/s][A
 38%|███▊      | 45/119 [00:01<00:01, 61.32it/s][A
 45%|████▍     | 53/119 [00:01<00:01, 65.28it/s][A
 51%|█████▏    | 61/119 [00:01<00:00, 69.12it/s][A
 58%|█████▊    | 69/119 [00:01<00:00, 69.69it/s][A
 65%|██████▍   | 77/119 [00:01<00:00, 70.01it/s][A
 71%|███████▏  | 85/119 [00:01<00:00, 71.88it/s][A
 78%|███████▊  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.20588235294117646
class2 recall: 0.8666666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:57,  2.05it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.78it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.19it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.98it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.59it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 60.07it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 63.89it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 67.17it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 67.42it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 67.79it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 66.49it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 68.59it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 65.58it/s][A
 88%|████████▊ | 105/120 [00:01<00:00, 66.67it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.09it/s]

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.5
class1 recall: 0.23529411764705882
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.29411764705882354
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:04,  1.95it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.53it/s][A
  4%|▍         | 16/360 [00:00<00:10, 31.80it/s][A
  7%|▋         | 24/360 [00:00<00:07, 42.99it/s][A
  9%|▉         | 32/360 [00:00<00:06, 51.26it/s][A
 11%|█         | 40/360 [00:01<00:05, 57.48it/s][A
 13%|█▎        | 48/360 [00:01<00:05, 62.24it/s][A
 16%|█▌        | 56/360 [00:01<00:04, 67.06it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 70.26it/s][A
 20%|██        | 72/360 [00:01<00:03, 72.36it/s][A
 22%|██▏       | 80/360 [00:01<00:03, 74.24it/s][A
 24%|██▍       | 88/360 [00:01<00:03, 73.71it/s][A
 27%|██▋       | 96/360 [00:01<00:03, 75.48it/s][A
 29%|██▉       | 104/360 [00:01<00:03, 73.87it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 76.03it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 74.72it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 74.69it/s][A
 38%|███▊      | 137/360 [00:02<00:02, 74.94it/s][A
 40%|████      | 145/360 [00:02<00:0

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7763157894736842
class1 recall: 0.32038834951456313
class2 recall: 0.8729281767955801
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8552631578947368
class1 recall: 0.49514563106796117
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:55,  2.11it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.93it/s][A
 11%|█         | 13/119 [00:00<00:03, 26.77it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 39.59it/s][A
 25%|██▌       | 30/119 [00:00<00:01, 52.27it/s][A
 33%|███▎      | 39/119 [00:01<00:01, 60.91it/s][A
 39%|███▉      | 47/119 [00:01<00:01, 65.80it/s][A
 46%|████▌     | 55/119 [00:01<00:00, 66.14it/s][A
 54%|█████▍    | 64/119 [00:01<00:00, 70.83it/s][A
 61%|██████    | 72/119 [00:01<00:00, 71.42it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 69.91it/s][A
 74%|███████▍  | 88/119 [00:01<00:00, 69.76it/s][A
 81%|████████  | 

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.8
class1 recall: 0.17647058823529413
class2 recall: 0.6666666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667



  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  2.00it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.82it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.18it/s][A
 20%|██        | 24/120 [00:00<00:02, 42.55it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 51.86it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 58.05it/s][A
 40%|████      | 48/120 [00:01<00:01, 63.86it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 65.11it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 67.26it/s][A
 60%|██████    | 72/120 [00:01<00:00, 70.06it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 72.55it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 72.26it/s][A
 80%|████████  | 96/120 [00:01<00:00, 73.03it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 72.08it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.31it/s]


Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.08823529411764706
class2 recall: 0.8
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8



  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:03,  1.96it/s][A
  2%|▏         | 6/360 [00:00<00:28, 12.45it/s][A
  4%|▎         | 13/360 [00:00<00:13, 25.72it/s][A
  6%|▌         | 21/360 [00:00<00:08, 37.99it/s][A
  8%|▊         | 29/360 [00:00<00:06, 47.86it/s][A
 10%|█         | 36/360 [00:01<00:06, 53.10it/s][A
 12%|█▏        | 44/360 [00:01<00:05, 59.34it/s][A
 14%|█▍        | 52/360 [00:01<00:04, 63.95it/s][A
 17%|█▋        | 61/360 [00:01<00:04, 69.93it/s][A
 19%|█▉        | 69/360 [00:01<00:04, 68.67it/s][A
 21%|██▏       | 77/360 [00:01<00:04, 67.30it/s][A
 23%|██▎       | 84/360 [00:01<00:04, 66.22it/s][A
 25%|██▌       | 91/360 [00:01<00:04, 65.48it/s][A
 27%|██▋       | 98/360 [00:01<00:04, 63.81it/s][A
 29%|██▉       | 106/360 [00:02<00:03, 66.41it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 66.71it/s][A
 34%|███▎      | 121/360 [00:02<00:03, 70.07it/s][A
 36%|███▌      | 129/360 [00:02<00:03, 69.88it/s][A
 38%|███▊      | 138/360 [00:02<00:02

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9078947368421053
class1 recall: 0.2815533980582524
class2 recall: 0.7513812154696132
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.5048543689320388
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.09it/s][A
  6%|▌         | 7/119 [00:00<00:07, 15.35it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.66it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.59it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 54.10it/s][A
 34%|███▎      | 40/119 [00:00<00:01, 62.69it/s][A
 41%|████      | 49/119 [00:01<00:01, 69.71it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 68.33it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 72.99it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 71.04it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 75.08it/s][A
 77%|███████▋  | 92/119 [00:01<00:00, 76.98it/s][A
 85%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.17647058823529413
class2 recall: 0.8333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.93it/s][A
  8%|▊         | 9/120 [00:00<00:06, 18.34it/s][A
 15%|█▌        | 18/120 [00:00<00:02, 34.08it/s][A
 22%|██▎       | 27/120 [00:00<00:01, 47.08it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 54.55it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 61.95it/s][A
 44%|████▍     | 53/120 [00:01<00:00, 67.98it/s][A
 52%|█████▏    | 62/120 [00:01<00:00, 72.06it/s][A
 58%|█████▊    | 70/120 [00:01<00:00, 71.58it/s][A
 65%|██████▌   | 78/120 [00:01<00:00, 71.77it/s][A
 72%|███████▏  | 86/120 [00:01<00:00, 73.59it/s][A
 78%|███████▊  | 94/120 [00:01<00:00, 75.08it/s][A
 85%|████████▌ | 102/120 [00:01<00:00, 74.40it/s][A
 92%|█████████▏| 110/120 [00:01<00:00, 74.27it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.41it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.17647058823529413
class2 recall: 0.85
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:02,  1.96it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.34it/s][A
  4%|▍         | 15/360 [00:00<00:11, 29.40it/s][A
  6%|▋         | 23/360 [00:00<00:08, 41.68it/s][A
  8%|▊         | 30/360 [00:00<00:06, 48.86it/s][A
 11%|█         | 38/360 [00:01<00:05, 56.05it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.30it/s][A
 15%|█▍        | 53/360 [00:01<00:04, 63.61it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 70.60it/s][A
 19%|█▉        | 70/360 [00:01<00:04, 71.47it/s][A
 22%|██▏       | 78/360 [00:01<00:03, 73.72it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 72.38it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 70.66it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 68.22it/s][A
 31%|███       | 110/360 [00:02<00:03, 69.36it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 72.11it/s][A
 35%|███▌      | 126/360 [00:02<00:03, 73.23it/s][A
 38%|███▊      | 135/360 [00:02<00:02, 76.79it/s][A
 40%|███▉      | 143/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.33980582524271846
class2 recall: 0.8232044198895028
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.49514563106796117
class2 recall: 0.7900552486187845



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:58,  2.01it/s][A
  5%|▌         | 6/119 [00:00<00:09, 12.46it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.54it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.09it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 52.49it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 61.63it/s][A
 40%|████      | 48/119 [00:01<00:01, 66.13it/s][A
 47%|████▋     | 56/119 [00:01<00:00, 66.50it/s][A
 55%|█████▍    | 65/119 [00:01<00:00, 71.22it/s][A
 61%|██████▏   | 73/119 [00:01<00:00, 70.92it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 74.33it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 75.32it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.11764705882352941
class2 recall: 0.9166666666666666
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.3235294117647059
class2 recall: 0.7166666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:00,  1.98it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.68it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.01it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.72it/s][A
 27%|██▋       | 32/120 [00:00<00:01, 50.87it/s][A
 33%|███▎      | 40/120 [00:01<00:01, 56.35it/s][A
 40%|████      | 48/120 [00:01<00:01, 62.48it/s][A
 47%|████▋     | 56/120 [00:01<00:00, 65.62it/s][A
 53%|█████▎    | 64/120 [00:01<00:00, 69.25it/s][A
 60%|██████    | 72/120 [00:01<00:00, 71.28it/s][A
 67%|██████▋   | 80/120 [00:01<00:00, 73.46it/s][A
 73%|███████▎  | 88/120 [00:01<00:00, 73.99it/s][A
 80%|████████  | 96/120 [00:01<00:00, 73.83it/s][A
 87%|████████▋ | 104/120 [00:01<00:00, 70.98it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.49it/s]

Inside calculate_balanced_accuracy, 3 classes passed in





class0 recall: 0.6153846153846154
class1 recall: 0.058823529411764705
class2 recall: 0.9833333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667



  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.07it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.95it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.74it/s][A
  7%|▋         | 25/360 [00:00<00:07, 46.31it/s][A
  9%|▉         | 34/360 [00:00<00:05, 56.15it/s][A
 12%|█▏        | 43/360 [00:01<00:04, 64.14it/s][A
 14%|█▍        | 52/360 [00:01<00:04, 69.33it/s][A
 17%|█▋        | 60/360 [00:01<00:04, 70.12it/s][A
 19%|█▉        | 69/360 [00:01<00:03, 74.89it/s][A
 21%|██▏       | 77/360 [00:01<00:03, 76.30it/s][A
 24%|██▍       | 86/360 [00:01<00:03, 77.19it/s][A
 26%|██▌       | 94/360 [00:01<00:03, 72.19it/s][A
 28%|██▊       | 102/360 [00:01<00:03, 70.44it/s][A
 31%|███       | 110/360 [00:01<00:03, 70.37it/s][A
 33%|███▎      | 118/360 [00:02<00:03, 72.72it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 75.47it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 78.78it/s][A
 40%|████      | 144/360 [00:02<00:02, 75.98it/s][A
 42%|████▏     | 152/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.18446601941747573
class2 recall: 0.9226519337016574
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.4854368932038835
class2 recall: 0.8066298342541437



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.89it/s][A
 10%|█         | 12/119 [00:00<00:04, 24.32it/s][A
 17%|█▋        | 20/119 [00:00<00:02, 38.67it/s][A
 24%|██▎       | 28/119 [00:00<00:01, 49.29it/s][A
 29%|██▉       | 35/119 [00:00<00:01, 54.55it/s][A
 36%|███▌      | 43/119 [00:01<00:01, 60.55it/s][A
 43%|████▎     | 51/119 [00:01<00:01, 65.65it/s][A
 50%|████▉     | 59/119 [00:01<00:00, 65.35it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 65.22it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 69.21it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 67.64it/s][A
 75%|███████▍  | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.76
class1 recall: 0.35294117647058826
class2 recall: 0.48333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.2647058823529412
class2 recall: 0.8



  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:01,  1.94it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.62it/s][A
 13%|█▎        | 16/120 [00:00<00:03, 31.34it/s][A
 21%|██        | 25/120 [00:00<00:02, 45.27it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 55.62it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 61.82it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 66.08it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 70.75it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 73.07it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 73.08it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 71.87it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 73.02it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 71.76it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 76.17it/s][A
100%|██████████| 120/120 [00:02<00:00, 55.79it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6923076923076923
class1 recall: 0.5
class2 recall: 0.6166666666666667
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:57,  2.02it/s][A
  2%|▏         | 7/360 [00:00<00:23, 14.89it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.39it/s][A
  7%|▋         | 24/360 [00:00<00:07, 45.09it/s][A
  9%|▊         | 31/360 [00:00<00:06, 50.77it/s][A
 11%|█         | 38/360 [00:01<00:05, 55.66it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 62.15it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 67.68it/s][A
 18%|█▊        | 64/360 [00:01<00:04, 73.20it/s][A
 20%|██        | 73/360 [00:01<00:03, 75.72it/s][A
 22%|██▎       | 81/360 [00:01<00:03, 76.93it/s][A
 25%|██▍       | 89/360 [00:01<00:03, 73.21it/s][A
 27%|██▋       | 97/360 [00:01<00:03, 70.80it/s][A
 29%|██▉       | 105/360 [00:01<00:03, 69.28it/s][A
 31%|███▏      | 113/360 [00:02<00:03, 72.15it/s][A
 34%|███▍      | 122/360 [00:02<00:03, 75.85it/s][A
 36%|███▌      | 130/360 [00:02<00:03, 76.51it/s][A
 39%|███▉      | 140/360 [00:02<00:02, 81.72it/s][A
 41%|████▏     | 149/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8947368421052632
class1 recall: 0.6116504854368932
class2 recall: 0.6187845303867403
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8421052631578947
class1 recall: 0.46601941747572817
class2 recall: 0.8232044198895028


INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.04it/s][A
  7%|▋         | 8/119 [00:00<00:06, 16.71it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 31.68it/s][A
 21%|██        | 25/119 [00:00<00:02, 45.35it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 54.36it/s][A
 35%|███▌      | 42/119 [00:01<00:01, 61.84it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 65.76it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 66.99it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 69.51it/s][A
 62%|██████▏   | 74/119 [00:01<00:00, 68.07it/s][A
 69%|██████▉   | 82/119 [00:01<00:00, 69.55it/s][A
 76%|███████▌  | 90/119 [00:01<00:00, 69.84it/s][A
 82%|████████▏ | 9

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.48
class1 recall: 0.6176470588235294
class2 recall: 0.48333333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.64
class1 recall: 0.29411764705882354
class2 recall: 0.7666666666666667




  self.pid = os.fork()

  1%|          | 1/120 [00:00<01:03,  1.88it/s][A
  7%|▋         | 8/120 [00:00<00:06, 16.22it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.54it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 45.24it/s][A
 28%|██▊       | 34/120 [00:00<00:01, 53.77it/s][A
 35%|███▌      | 42/120 [00:01<00:01, 59.83it/s][A
 42%|████▏     | 50/120 [00:01<00:01, 64.73it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 67.12it/s][A
 55%|█████▌    | 66/120 [00:01<00:00, 70.52it/s][A
 62%|██████▏   | 74/120 [00:01<00:00, 72.06it/s][A
 68%|██████▊   | 82/120 [00:01<00:00, 72.21it/s][A
 75%|███████▌  | 90/120 [00:01<00:00, 70.69it/s][A
 82%|████████▏ | 98/120 [00:01<00:00, 70.72it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 73.82it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.42it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.38461538461538464
class1 recall: 0.5882352941176471
class2 recall: 0.6
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.8333333333333334




  self.pid = os.fork()

  0%|          | 1/360 [00:00<03:01,  1.98it/s][A
  2%|▏         | 8/360 [00:00<00:20, 16.79it/s][A
  5%|▍         | 17/360 [00:00<00:10, 33.70it/s][A
  7%|▋         | 26/360 [00:00<00:07, 46.57it/s][A
 10%|▉         | 35/360 [00:00<00:05, 56.55it/s][A
 12%|█▏        | 44/360 [00:01<00:05, 63.03it/s][A
 14%|█▍        | 52/360 [00:01<00:04, 64.94it/s][A
 17%|█▋        | 62/360 [00:01<00:04, 72.25it/s][A
 20%|█▉        | 71/360 [00:01<00:03, 75.06it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 76.28it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 75.41it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 73.42it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 71.35it/s][A
 31%|███       | 111/360 [00:01<00:03, 71.22it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 71.55it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 73.14it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 77.79it/s][A
 40%|████      | 144/360 [00:02<00:02, 76.38it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6842105263157895
class1 recall: 0.7864077669902912
class2 recall: 0.6353591160220995
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.46601941747572817
class2 recall: 0.8176795580110497



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:57,  2.06it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.88it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 29.19it/s][A
 18%|█▊        | 22/119 [00:00<00:02, 41.94it/s][A
 26%|██▌       | 31/119 [00:00<00:01, 53.39it/s][A
 34%|███▎      | 40/119 [00:01<00:01, 61.84it/s][A
 41%|████      | 49/119 [00:01<00:01, 68.33it/s][A
 48%|████▊     | 57/119 [00:01<00:00, 67.59it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 70.82it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 74.02it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 73.93it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 70.56it/s][A
 83%|████████▎ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.32
class1 recall: 0.7647058823529411
class2 recall: 0.45
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.6
class1 recall: 0.3235294117647059
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  8%|▊         | 9/120 [00:00<00:05, 18.80it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 32.63it/s][A
 22%|██▏       | 26/120 [00:00<00:02, 45.85it/s][A
 29%|██▉       | 35/120 [00:00<00:01, 55.62it/s][A
 36%|███▌      | 43/120 [00:01<00:01, 61.74it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 62.97it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 65.80it/s][A
 57%|█████▋    | 68/120 [00:01<00:00, 69.36it/s][A
 63%|██████▎   | 76/120 [00:01<00:00, 68.74it/s][A
 70%|███████   | 84/120 [00:01<00:00, 71.62it/s][A
 77%|███████▋  | 92/120 [00:01<00:00, 71.62it/s][A
 83%|████████▎ | 100/120 [00:01<00:00, 71.39it/s][A
 90%|█████████ | 108/120 [00:01<00:00, 72.20it/s][A
100%|██████████| 120/120 [00:02<00:00, 54.55it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.3076923076923077
class1 recall: 0.6764705882352942
class2 recall: 0.55
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:58,  2.01it/s][A
  2%|▏         | 6/360 [00:00<00:28, 12.64it/s][A
  4%|▍         | 14/360 [00:00<00:12, 28.22it/s][A
  6%|▌         | 22/360 [00:00<00:08, 40.40it/s][A
  8%|▊         | 30/360 [00:00<00:06, 50.41it/s][A
 11%|█         | 38/360 [00:01<00:05, 56.49it/s][A
 13%|█▎        | 46/360 [00:01<00:05, 61.25it/s][A
 15%|█▌        | 55/360 [00:01<00:04, 67.63it/s][A
 18%|█▊        | 63/360 [00:01<00:04, 69.62it/s][A
 20%|█▉        | 71/360 [00:01<00:04, 70.26it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 70.38it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 68.87it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 71.24it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 70.86it/s][A
 31%|███       | 111/360 [00:02<00:03, 71.06it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 72.66it/s][A
 36%|███▌      | 128/360 [00:02<00:03, 75.04it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 75.98it/s][A
 40%|████      | 145/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5394736842105263
class1 recall: 0.8155339805825242
class2 recall: 0.574585635359116
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8289473684210527
class1 recall: 0.49514563106796117
class2 recall: 0.8066298342541437



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:13<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:56,  2.08it/s][A
  5%|▌         | 6/119 [00:00<00:08, 12.77it/s][A
 12%|█▏        | 14/119 [00:00<00:03, 28.72it/s][A
 18%|█▊        | 21/119 [00:00<00:02, 38.95it/s][A
 24%|██▍       | 29/119 [00:00<00:01, 49.36it/s][A
 30%|███       | 36/119 [00:01<00:01, 54.84it/s][A
 37%|███▋      | 44/119 [00:01<00:01, 61.37it/s][A
 45%|████▍     | 53/119 [00:01<00:00, 67.27it/s][A
 52%|█████▏    | 62/119 [00:01<00:00, 73.32it/s][A
 60%|█████▉    | 71/119 [00:01<00:00, 75.77it/s][A
 67%|██████▋   | 80/119 [00:01<00:00, 79.02it/s][A
 75%|███████▍  | 89/119 [00:01<00:00, 80.65it/s][A
 82%|████████▏ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.84
class1 recall: 0.38235294117647056
class2 recall: 0.4
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.2647058823529412
class2 recall: 0.7333333333333333




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:58,  2.02it/s][A
  6%|▌         | 7/120 [00:00<00:07, 14.68it/s][A
 11%|█         | 13/120 [00:00<00:04, 25.02it/s][A
 18%|█▊        | 21/120 [00:00<00:02, 37.91it/s][A
 24%|██▍       | 29/120 [00:00<00:01, 48.33it/s][A
 31%|███       | 37/120 [00:01<00:01, 56.47it/s][A
 37%|███▋      | 44/120 [00:01<00:01, 60.11it/s][A
 42%|████▎     | 51/120 [00:01<00:01, 62.73it/s][A
 49%|████▉     | 59/120 [00:01<00:00, 67.30it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 68.81it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 69.66it/s][A
 70%|███████   | 84/120 [00:01<00:00, 72.43it/s][A
 78%|███████▊  | 93/120 [00:01<00:00, 75.41it/s][A
 84%|████████▍ | 101/120 [00:01<00:00, 73.67it/s][A
 91%|█████████ | 109/120 [00:01<00:00, 74.71it/s][A
100%|██████████| 120/120 [00:02<00:00, 53.56it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.7307692307692307
class1 recall: 0.4411764705882353
class2 recall: 0.5833333333333334
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.2647058823529412
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:55,  2.04it/s][A
  2%|▏         | 7/360 [00:00<00:24, 14.69it/s][A
  4%|▍         | 16/360 [00:00<00:10, 32.63it/s][A
  7%|▋         | 24/360 [00:00<00:07, 43.95it/s][A
  9%|▉         | 33/360 [00:00<00:05, 55.05it/s][A
 12%|█▏        | 42/360 [00:01<00:05, 63.58it/s][A
 14%|█▍        | 51/360 [00:01<00:04, 68.68it/s][A
 17%|█▋        | 61/360 [00:01<00:03, 75.22it/s][A
 19%|█▉        | 70/360 [00:01<00:03, 75.90it/s][A
 22%|██▏       | 79/360 [00:01<00:03, 74.16it/s][A
 24%|██▍       | 87/360 [00:01<00:03, 73.05it/s][A
 26%|██▋       | 95/360 [00:01<00:03, 74.66it/s][A
 29%|██▊       | 103/360 [00:01<00:03, 74.43it/s][A
 31%|███       | 111/360 [00:01<00:03, 74.11it/s][A
 33%|███▎      | 119/360 [00:02<00:03, 72.72it/s][A
 35%|███▌      | 127/360 [00:02<00:03, 70.73it/s][A
 38%|███▊      | 136/360 [00:02<00:02, 75.51it/s][A
 40%|████      | 144/360 [00:02<00:02, 76.55it/s][A
 42%|████▏     | 152/360 [00:02<00

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.9342105263157895
class1 recall: 0.6213592233009708
class2 recall: 0.56353591160221
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.46601941747572817
class2 recall: 0.8011049723756906



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()

  predicted_relative_relevance = F.softmax(predicted_relevance/args['T'])
  0%|          | 0/360 [00:12<?, ?it/s]

  0%|          | 0/119 [00:00<?, ?it/s][A
  1%|          | 1/119 [00:00<00:58,  2.01it/s][A
  7%|▋         | 8/119 [00:00<00:06, 16.27it/s][A
 13%|█▎        | 16/119 [00:00<00:03, 30.65it/s][A
 20%|██        | 24/119 [00:00<00:02, 42.77it/s][A
 28%|██▊       | 33/119 [00:00<00:01, 53.72it/s][A
 34%|███▍      | 41/119 [00:01<00:01, 58.41it/s][A
 42%|████▏     | 50/119 [00:01<00:01, 66.18it/s][A
 49%|████▊     | 58/119 [00:01<00:00, 65.73it/s][A
 55%|█████▌    | 66/119 [00:01<00:00, 68.80it/s][A
 63%|██████▎   | 75/119 [00:01<00:00, 74.24it/s][A
 70%|██████▉   | 83/119 [00:01<00:00, 75.75it/s][A
 76%|███████▋  | 91/119 [00:01<00:00, 74.20it/s][A
 84%|████████▍ | 

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.56
class1 recall: 0.20588235294117646
class2 recall: 0.9
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.68
class1 recall: 0.2647058823529412
class2 recall: 0.75




  self.pid = os.fork()

  1%|          | 1/120 [00:00<00:59,  1.99it/s][A
  8%|▊         | 9/120 [00:00<00:05, 19.16it/s][A
 14%|█▍        | 17/120 [00:00<00:03, 33.10it/s][A
 21%|██        | 25/120 [00:00<00:02, 44.77it/s][A
 28%|██▊       | 33/120 [00:00<00:01, 53.61it/s][A
 34%|███▍      | 41/120 [00:01<00:01, 59.63it/s][A
 41%|████      | 49/120 [00:01<00:01, 63.97it/s][A
 48%|████▊     | 58/120 [00:01<00:00, 70.73it/s][A
 56%|█████▌    | 67/120 [00:01<00:00, 74.97it/s][A
 62%|██████▎   | 75/120 [00:01<00:00, 76.18it/s][A
 69%|██████▉   | 83/120 [00:01<00:00, 74.48it/s][A
 76%|███████▌  | 91/120 [00:01<00:00, 75.83it/s][A
 82%|████████▎ | 99/120 [00:01<00:00, 76.38it/s][A
 89%|████████▉ | 107/120 [00:01<00:00, 77.40it/s][A
100%|██████████| 120/120 [00:02<00:00, 56.43it/s]

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5
class1 recall: 0.11764705882352941
class2 recall: 0.9333333333333333
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.5384615384615384
class1 recall: 0.3235294117647059
class2 recall: 0.8166666666666667




  self.pid = os.fork()

  0%|          | 1/360 [00:00<02:53,  2.06it/s][A
  2%|▏         | 7/360 [00:00<00:23, 15.06it/s][A
  4%|▍         | 15/360 [00:00<00:11, 30.51it/s][A
  6%|▋         | 23/360 [00:00<00:08, 42.12it/s][A
  9%|▉         | 32/360 [00:00<00:06, 53.08it/s][A
 11%|█         | 40/360 [00:01<00:05, 59.42it/s][A
 13%|█▎        | 48/360 [00:01<00:04, 64.47it/s][A
 16%|█▌        | 57/360 [00:01<00:04, 70.45it/s][A
 18%|█▊        | 66/360 [00:01<00:03, 74.47it/s][A
 21%|██        | 74/360 [00:01<00:03, 75.45it/s][A
 23%|██▎       | 83/360 [00:01<00:03, 77.49it/s][A
 25%|██▌       | 91/360 [00:01<00:03, 73.78it/s][A
 28%|██▊       | 99/360 [00:01<00:03, 71.15it/s][A
 30%|██▉       | 107/360 [00:01<00:03, 72.37it/s][A
 32%|███▏      | 115/360 [00:02<00:03, 74.33it/s][A
 34%|███▍      | 124/360 [00:02<00:03, 77.36it/s][A
 37%|███▋      | 132/360 [00:02<00:02, 77.90it/s][A
 39%|███▉      | 142/360 [00:02<00:02, 82.78it/s][A
 42%|████▏     | 151/360 [00:02<00:

Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8026315789473685
class1 recall: 0.3883495145631068
class2 recall: 0.9005524861878453
Inside calculate_balanced_accuracy, 3 classes passed in
class0 recall: 0.8157894736842105
class1 recall: 0.47572815533980584
class2 recall: 0.7955801104972375



INFO:__main__:val progression view:
INFO:__main__:At RAW Best val, validation/test/train 59.79 49.87 59.89
INFO:__main__:At EMA Best val, validation/test/train 59.58 55.57 71.12
  self.pid = os.fork()


# Results
In this section, you should finish training your model training or loading your trained model. That is a great experiment! You should share the results with others with necessary metrics and figures.

Please test and report results for all experiments that you run with:

*   specific numbers (accuracy, AUC, RMSE, etc)
*   figures (loss shrinkage, outputs from GAN, annotation or label of sample pictures, etc)


### Model comparison

#### SAMIL with No Pretraining

In [None]:
target_balanced_accuracy = 0.727

load_model_path = os.path.join('runs/SAMIL/NoPretrain', "last_checkpoint.pth.tar")

loaded_SAMIL_Model = SAMIL().to(device)
loaded_SAMIL_Model.load_state_dict(torch.load(load_model_path)['ema_state_dict'])

loaded_SAMIL_Model.eval()

test_loader = DataLoader(test_dataset, batch_size=1, shuffle=False, num_workers=8)
bal_acc = eval_model_test(args, test_loader, loaded_SAMIL_Model)

print(f"Target bal_acc (from paper): {target_balanced_accuracy}")
print(f"Reproduced bal_acc (from our implementation): {bal_acc}")

  self.pid = os.fork()
  self.pid = os.fork()


Paper's bal_acc: 0.727
Bal_acc: 0.3333333333333333


#### SAMIL with Image Level Pretraining

In [None]:
target_balanced_accuracy = 0.727

load_model_path = os.path.join('runs/SAMIL/FeatureExtractor1', "last_checkpoint.pth.tar")

loaded_SAMIL_Model = SAMIL().to(device)
loaded_SAMIL_Model.load_state_dict(torch.load(load_model_path)['ema_state_dict'])

loaded_SAMIL_Model.eval()

test_loader = DataLoader(test_dataset, batch_size=1, shuffle=False, num_workers=8)
bal_acc = eval_model_test(args, test_loader, loaded_SAMIL_Model)

print(f"Target bal_acc (from paper): {target_balanced_accuracy}")
print(f"Reproduced bal_acc (from our implementation): {bal_acc}")

#### SAMIL with Pretraining

In [None]:
target_balanced_accuracy = 0.754

# Testing by loading a saved checkpoint
load_model_path = os.path.join('runs/SAMIL/Whole', "last_checkpoint.pth.tar")

loaded_SAMIL_Model = SAMIL().to(device)
loaded_SAMIL_Model.load_state_dict(torch.load(load_model_path)['ema_state_dict'])

loaded_SAMIL_Model.eval()

test_loader = DataLoader(test_dataset, batch_size=1, shuffle=False, num_workers=8)
bal_acc = eval_model_test(args, test_loader, loaded_SAMIL_Model)

print(f"Target bal_acc (from paper): {target_balanced_accuracy}")
print(f"Reproduced bal_acc (from our implementation): {bal_acc}")

# Discussion


### Reproducibility

At this time, it cannot be concluded whether the results in the paper can be reproduced, however it can be confirmed that the steps necessary to attempt reproducibility are feasible. The code in the paper's Github repo is usable, functional, and requires little to no modification to get started. The dataset, while difficult to obtail initially, is also maintained and accessible when access is acquired from the owners.


### Challenges

1. The code for this paper assumes you will run it outside of a Jupyter notebook, so refactoring of the classes was required to get this functional in a notebook.

2. While the TMED-2 dataset has an access request form, it took three attempts over several weeks to gain access. Ultimately, emailing the staff on the TMED-2 website was required beyond their signup form. For individuals looking to reproduce the results of the paper in short notice, this might pose a challenge.

3. The documentation for the paper's code in the repo was poor. There is only one example of running an experiment without making any changes to the parameters. The authors should have provided robust documentation in the Github repo for how to reproduce each model they evaluated in the various configurations.

4. The computational requirements to train each model are significant. Training on a standard GPU provided in colab is estimated to take 3-4 days to run the 2,000 epoch upperbound defined in the paper. Premium GPUs were acquired from Colab and enabled (A100), which brought the training down to around 16 hours per model. The reproducibility of this paper should only be considered feasible if individuals are willing to acquire GPUs such as the A100 through Colab/AWS/Azure/etc.

5. Instructions in the Github repo readme were unclear/incorrect on how to load in the dataset. Modification of the data loading method was necessary to get this running based on the structure of the data from TMED-2. Perhaps this was a misunderstanding on my part, but I could not get this to work based on the original instructions.

Despite the challenges above, once the code was refactored into a notebook and the A100 GPU was used in Colab, the code itself was functional without any bug fixing or adjustment to solve for out of date libraries.

### Suggestions

To improve reproducibility of this paper, the following suggestions are recommended:

1. Update the Github repo readme to specify exactly how to structure the downloaded data from TMED-2 (where to put it, what the folder structure should be, etc.)

2. TMED-2 dataset needs a better mechanism to acquire access in a faster more transparent way. The Google Form was filled out 3 times over 3 weeks with no response. An email to all the authors of the dataset was required to gain access and a response.

3. Github repo readme should outline the steps to reproduce the exact results in the paper, step by step. The readme only gives one example, and no steps on what to adjust for each experiment run in the paper. In summary, for each experiment in the paper, there should be a 1:1 instruction in the readme.

4. Paper author's should included the runtime of their model on the A100. They mentioned it ran on a single A100 in the paper, but no reference to how long for each run.

### Future Plans

The following work remains to complete the reproduction of this paper. Each item will be completed by the final submission.

1. Include the ABMIL model class from the paper, train it, and include in the Results/Analysis section. This is one of the models evaluated by the paper.

2. Fully train the other two variants of SAMIL for evaluation. This notebook only completed the training of the SAMIL with Study Level Pre Training, but the SAMIL w/o Pre Training and SAMIL w/ Image Level Pre Training must be trained for evaluation/analysis.

3. Complete the following sections that were not completed in the draft, and the additional sections required in the final report:
    * Results: Include final Balanced Accuracy results for all 4 models.
    * Analysis: Complete an Analysis of the 4 models for the following metrics: Balanced Accuracy, Numer of Epochs before Early Stop. Analysis to include a chart of the values for each model, and graphs for Balanced Accuracy vs. Epoch, Loss vs. Epoch.
    * Include a section for Environment setup (Python version, packages), Data visualizations, detailed information on the Hyperparams used by this notebook vs. in the paper, additional writeups on the models, and include a section for the Ablation Study.

# References

1.   Huang, Zhe, Benjamin S. Wessler, and Michael C. Hughes. "Detecting Heart Disease from Multi-View Ultrasound Images via Supervised Attention Multiple Instance Learning." Machine Learning for Healthcare Conference. PMLR, 2023. https://doi.org/10.48550/arXiv.2306.00003

