# TabNet으로 하는 Classification: Deep Dive

출처: [https://syslog.ravelin.com/classification-with-tabnet-deep-dive-49a0dcc8f7e8](https://syslog.ravelin.com/classification-with-tabnet-deep-dive-49a0dcc8f7e8)

# Classification with TabNet: Deep Dive

![image.png](attachment:image.png)

여기서는 해석 가능하고 테이블 형식 데이터와 잘 작동하도록 설계된 TabNet ([Arik & Pfister(2019)](https://arxiv.org/abs/1908.07442))이라는 neural architecture에 대해 자세히 알아볼 것입니다. 핵심 building block과 그 이면의 아이디어를 설명한 후, TensorFlow에서 구현하는 방법과 사기 탐지 데이터 세트에 적용하는 방법을 알 수 있습니다. 대부분의 코드는 [여기](https://github.com/aruberts/blogs/blob/main/TabNet%20Classification%20Example.ipynb)에서 가져왔습니다.

# TabNet
TabNet은 Sequential Attention의 아이디어를 사용하여 decision tree의 동작을 모방합니다. 간단히 말해서 각 단계에서 두 가지 주요 작업을 적용하는 multi-step neural network로 생각할 수 있습니다:

1. Attentive Transformer는 다음 단계에서 처리할 가장 중요한 feature를 선택합니다.

2. Feature Transformer는 feature를 보다 유용한 표현으로 처리합니다.

Feature Transformer의 output은 나중에 예측에 사용됩니다. Attentive와 Feature Transformer를 모두 사용하여 TabNet은 tree-based model의 의사 결정 프로세스를 시뮬레이션할 수 있습니다.

모델은 해석 가능성과 학습을 향상시키는 task에 가장 유용한 feature를 선택하고 처리할 수 있습니다.

![image.png](attachment:image.png)

Attentive와 Feature Transformer의 key building block은 Feature Blocks입니다. 이제 이들을 살펴봅시다.

# Feature Blocks
Feature Block은 순차적으로 적용된 FC (Fully-Connected)(or Dense) layer와 Batch Normalization (BN)으로 구성됩니다. 또한, Feature Transformers의 경우 output이 GLU activation layer를 통과합니다.

![image.png](attachment:image.png)

GLU(sigmoid gate와 반대)의 주요 기능은 hidden unit이 모델에 더 깊이 전달되도록 하고, gradient exploding이나 vanishing을 방지하는 것입니다.

In [None]:
def glu(x, n_units=None):
    """Generalized linear unit nonlinear activation."""
    return x[:, :n_units] * tf.nn.sigmoid(x[:, n_units:])

또한, 원본 논문은 training 중 convergence speed를 향상시키기 위해 Ghost Batch Normalization을 사용합니다. 관심이 있는 경우, [여기](https://github.com/ostamand/tensorflow-tabnet/blob/master/tabnet/models/gbn.py)에서 Tensorflow implementation을 찾을 수 있지만, 이 tutorial에서는 default Batch Normalization layer를 사용합니다.

In [None]:
class FeatureBlock(tf.keras.Model):
    """
    Implementation of a FL->BN->GLU block
    """
    def __init__(
        self,
        feature_dim,
        apply_glu = True,
        bn_momentum = 0.9,
        fc = None,
        epsilon = 1e-5,
    ):
        super(FeatureBlock, self).__init__()
        self.apply_gpu = apply_glu
        self.feature_dim = feature_dim
        units = feature_dim * 2 if apply_glu else feature_dim # desired dimension gets multiplied by 2
                                                              # because GLU activation halves it

        self.fc = tf.keras.layers.Dense(units, use_bias=False) if fc is None else fc # shared layers can get re-used
        self.bn = tf.keras.layers.BatchNormalization(momentum=bn_momentum, epsilon=epsilon)

    def call(self, x, training = None):
        x = self.fc(x) # inputs passes through the FC layer
        x = self.bn(x, training=training) # FC layer output gets passed through the BN
        if self.apply_gpu: 
            return glu(x, self.feature_dim) # GLU activation applied to BN output
        return x

# Feature Transformers
FeatureTransformer(FT)는 기본적으로 순차적으로 적용되는 feature blocks의 collection입니다. 논문에서 하나의 FeatureTransformer는 2개의 shared blocks(즉, weight은 steps에서 재사용)과 2개의 step dependent blocks으로 구성됩니다. shared weights는 모델의 parameters를 줄이고 더 나은 일반화로 이어집니다.

![image.png](attachment:image.png)

이전 section의 Feature Block 구현을 고려하여, Feature Transformer를 구축하는 방법은 다음과 같습니다.

In [None]:
class FeatureTransformer(tf.keras.Model):
    def __init__(
        self,
        feature_dim,
        fcs = [],
        n_total = 4,
        n_shared = 2,
        bn_momentum = 0.9,
    ):
        super(FeatureTransformer, self).__init__()
        self.n_total, self.n_shared = n_total, n_shared

        kwrgs = {
            "feature_dim": feature_dim,
            "bn_momentum": bn_momentum,
        }

        # build blocks
        self.blocks = []
        for n in range(n_total):
            # some shared blocks
            if fcs and n < len(fcs):
                self.blocks.append(FeatureBlock(**kwrgs, fc=fcs[n])) # Building shared blocks by providing FC layers
            # build new blocks
            else:
                self.blocks.append(FeatureBlock(**kwrgs)) # Step dependent blocks without the shared FC layers

    def call(self, x, training = None):
        # input passes through the first block
        x = self.blocks[0](x, training=training) 
        # for the remaining blocks
        for n in range(1, self.n_total):
            # output from previous block gets multiplied by sqrt(0.5) and output of this block gets added
            x = x * tf.sqrt(0.5) + self.blocks[n](x, training=training) 
        return x

    @property
    def shared_fcs(self):
        return [self.blocks[i].fc for i in range(self.n_shared)]

# Attentive Transformer
AT(Attentive Transformer)는 각 step에서 feature selection을 담당합니다. feture selection은 prior scale을 고려하면서 (GLU 대신) sparsemax activation를 적용하여 수행됩니다. prior scale를 사용하면 모델에서 feature를 선택할 수 있는 빈도를 제어할 수 있으며, 이전 단계에서 사용된 빈도에 따라 제어할 수 있습니다.

![image.png](attachment:image.png)

이전 Attention Transformer는 이전 step에서 사용된 feature에 대한 정보를 따라 prior scale로 전달됩니다. Feature Transformer와 유사하게, Attentive Transformer는 나중에 더 큰 아키텍처에 통합될 TensorFlow 모델로 구현될 수 있습니다.

In [None]:
class AttentiveTransformer(tf.keras.Model):
    def __init__(self, feature_dim):
        super(AttentiveTransformer, self).__init__()
        self.block = FeatureBlock(
            feature_dim,
            apply_glu=False, # sparsemax instead of glu
        )

    def call(self, x, prior_scales, training=None):
        # Pass input trhough a FC-BN block
        x = self.block(x, training=training)
        # Pass the output through sparsemax activation
        return sparsemax(x * prior_scales)

Feature와 Attentive Transformer block은 parameter가 상당히 무거울 수 있으므로 TabNet은 몇 가지 메커니즘을 사용하여 복잡성을 제어하고 overfitting을 방지합니다.

# Regularisation

## Prior Scales Calculation
Prior scales(P)를 사용하면 모델에서 feature를 선택할 수 있는 빈도를 제어할 수 있습니다. prior scale(P)은 이전 Attentive Transformer activation 및 relaxation factor($γ$) parameter를 사용하여 계산됩니다. 다음은 논문에 제시된 공식입니다.

![image.png](attachment:image.png)

이 방정식은 prior scales가 어떻게 업데이트되는지 보여줍니다. 업데이트는 현재 step $i$까지의 모든 단계에 대한 product입니다. 직관적으로, 이전 steo에서 feature가 사용된 경우, 모델은 overfitting을 줄이기 위해 나머지 feature에 더 많은 주의를 기울입니다.

예를 들어, $γ$=1일 때 multiplicative activations(예: 0.9)가 있는 feature는 작은 prior scales(1–0.9=0.1)를 갖습니다.

## Sparsity regularisation
loss에 대한 sparsity regularization은 attention mask가 sparse하도록 장려하기 위해 사용됩니다. hyperparameter $λ$에 의해 스케일링된 entropy of activations는 전체 모델 loss에 추가됩니다.

In [None]:
def sparse_loss(at_mask):
    loss = tf.reduce_mean(
        tf.reduce_sum(tf.multiply(-at_mask, tf.math.log(at_mask + 1e-15)),
                      axis=1)
    )
    
    return loss

not_sparse_mask = np.array([[0.4, 0.5, 0.05, 0.05],
                      [0.2, 0.2, 0.5, 0.1]])

sparse_mask = np.array([[0.0, 0.0, 0.7, 0.3],
                      [0.0, 0.0, 1, 0.0]])

print('Loss for non-sparse attention mask:', sparse_loss(not_sparse_mask).numpy())
print('Loss for sparse attention mask:', sparse_loss(sparse_mask).numpy())

# Loss for non-sparse attention mask: 1.1166351874690217
# Loss for sparse attention mask: 0.3054321510274452

다음으로 이러한 구성 요소를 사용하여 TabNet 모델을 구축하는 방법을 알아보겠습니다.

# TabNet Architecture

## Putting It All Together

TabNet의 기본 아이디어는 Feature 및 Attentive Transformers 구성 요소가 순차적으로 적용되어 모델이 decision tree를 만드는 과정을 모방할 수 있다는 것입니다. Attentive Transformer는 feature selection을 수행하고 Feature Transformer는 모델이 데이터의 복잡한 패턴을 학습할 수 있도록 하는 transformations를 수행합니다. 아래에서 2-step TabNet 모델에 대한 data flow를 요약한 다이어그램을 볼 수 있습니다.

![image.png](attachment:image.png)

우선 초기 input features를 Feature Transformer를 통해 전달하여 초기 feature representations를 얻습니다. 이 Feature Transformer의 output은 다음 단계로 전달할 features의 subset을 선택하는 Attentive Transformer에 대한 input으로 사용됩니다. 이 프로세스는 필요한 단계 수만큼 반복됩니다. (이 [code snippet](https://gist.github.com/aruberts/5e97edb8e8d1820db70b427c7ee74995)에서 위에 정의된 클래스를 사용하여 TensorFlow 구현을 볼 수 있습니다)

모델은 각 decision step의 Feature Transformer output을 사용하여 최종 예측을 생성합니다. 또한 각 step에서 attention mask를 집계하여 예측에 사용된 feature를 이해할 수 있습니다. 이러한 mask는 global importances뿐만 아니라 local feature importances를 얻는 데 사용할 수 있습니다.

<br/>

이제 실습을 시작해봅시다.

# 실습: Fraud Detectioin

## imports

In [6]:
from tqdm import tqdm
import pandas as pd
import numpy as np
import tensorflow as tf
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler, OneHotEncoder
from category_encoders.woe import WOEEncoder
from sklearn.compose import ColumnTransformer
from sklearn.metrics import average_precision_score, roc_auc_score, confusion_matrix
from tensorflow_addons.activations import sparsemax
from scipy.special import softmax
import matplotlib.pyplot as plt
import seaborn as sns

## Data

([데이터 경로](https://www.kaggle.com/competitions/ieee-fraud-detection/data))

데이터가 2개로 나뉘어 있기 때문에, merge가 필요합니다

In [13]:
train_transactions = pd.read_csv('train_transaction.csv')
train_identity = pd.read_csv('train_identity.csv')

# merge two datasets
train = pd.merge(train_transactions, train_identity, on='TransactionID', how='left')
train['isFraud'].value_counts(normalize=True)

0    0.98441
1    0.01559
Name: isFraud, dtype: float64

In [14]:
test_transactions = pd.read_csv('test_transaction.csv')
test_identity = pd.read_csv('test_identity.csv')

# merge two datasets
test = pd.merge(test_transactions, test_identity, on='TransactionID', how='left')
test.columns = [c.replace('-', '_') for c in test.columns]

## Feature Engineering
feature engineering은 fraud detection 영역에서 가장 중요한 단계입니다. 그러나 이 프로젝트의 주요 목표가 아니므로 이 단계를 건너뛰고 독자에게 맡기겠습니다.

In [15]:
# Make an hour feature from datetime stamp 
def make_hour_feature(f):
    #Creates an hour of the day feature, encoded as 0-23.  
    hours = f / (3600)        
    encoded_hours = np.floor(hours) % 24
    return encoded_hours

train['hour'] = make_hour_feature(train['TransactionDT'])
test['hour'] = make_hour_feature(test['TransactionDT'])

## Data Cleaning

- missing values가 많은 칼럼 제거

- median 값을 missing values에 채워넣기

- categorical 칼럼의 missing values에 "missing"이란 값으로 채워 넣기

In [16]:
cat_features = ['ProductCD', 'card1', 'card2', 'card3', 'card4', 'card5', 'card6', 
               'addr1', 'addr2', 'P_emaildomain', 'R_emaildomain', 'M1',
               'M2', 'M3', 'M4', 'M5', 'M6', 'M7', 'M8', 'M9', 'DeviceType', 'DeviceInfo',
               'id_12', 'id_13', 'id_14', 'id_15', 'id_16', 'id_17', 'id_18', 'id_19', 'id_20',
               'id_21', 'id_22', 'id_23', 'id_24', 'id_25', 'id_26', 'id_27', 'id_28', 'id_29', 'id_30',
               'id_31', 'id_32', 'id_33', 'id_34', 'id_35', 'id_36', 'id_37', 'id_38']

exclude = ['TransactionID', 'TransactionDT', 'isFraud']
num_features = [f for f in train.columns if (f not in cat_features) & (f not in exclude)]

# 결측치 90% 이상 칼럼 제거
col_na = train.isnull().sum()
to_drop = col_na[(col_na /  train.shape[0]) > 0.9].index

use_cols = [f for f in train.columns if f not in to_drop]
cat_features = [f for f in cat_features if f not in to_drop]
num_features = [f for f in num_features if f not in to_drop]

train[cat_features] = train[cat_features].astype(str)
train[num_features] = train[num_features].astype(np.float)
train = train[use_cols]

test[cat_features] = test[cat_features].astype(str)
test[num_features] = test[num_features].astype(np.float)
test = test[[f for f in use_cols if f != 'isFraud']]

Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations
Deprecated in NumPy 1.20; for more details and guidance: https://numpy.org/devdocs/release/1.20.0-notes.html#deprecations


In [17]:
from sklearn.impute import SimpleImputer
# median 값 채워 넣기

train[num_features] = SimpleImputer(strategy="median").fit_transform(train[num_features])
train[cat_features] = train[cat_features].replace("nan", "missing")
train.isnull().sum().sum()

0

In [18]:
test[num_features] = SimpleImputer(strategy="median").fit_transform(test[num_features])
test[cat_features] = test[cat_features].replace("nan", "missing")
test.isna().sum().sum()

0

In [19]:
train.shape

(898, 423)

## Train/Val Split
testing은 미래 시점에 수행되므로 validationi split은 datetime 칼럼을 사용하여 수행합니다.

In [20]:
train_split = train["TransactionDT"] <= np.quantile(train["TransactionDT"], 0.9)

In [21]:
train_X = train.loc[train_split.values, num_features + cat_features]
train_y = train.loc[train_split.values, "isFraud"]

val_X = train.loc[~train_split.values, num_features + cat_features]
val_y = train.loc[~train_split.values, "isFraud"]

In [22]:
print(len(train_X), 'train examples')
print(len(val_X), 'validation examples')

808 train examples
90 validation examples


## Pre-processing
neural network는 숫자 데이터만 처리할 수 있으므로 input을 전처리해야 합니다. 아래 코드는 매우 단순화한 전처리 파이프라인이며 categoy embeddings와 같은 보다 정교한 방법 바꿀 수 있습니다.

- numeric features 스케일링

- categorical features 인코딩

In [23]:
scaler = StandardScaler()
woe = WOEEncoder()
column_trans = ColumnTransformer(
    [("scaler", scaler, num_features),
     ("woe", woe, cat_features)], remainder="passthrough", n_jobs=-1
)

train_X_transformed = column_trans.fit_transform(train_X, train_y)
val_X_transformed = column_trans.transform(val_X)

In [24]:
test_X_transformed = column_trans.transform(test[num_features + cat_features])
print(train_X_transformed.shape, val_X_transformed.shape, test_X_transformed.shape)

(808, 420) (90, 420) (1834, 420)


In [25]:
train_X_transformed = pd.DataFrame(train_X_transformed, columns=[num_features + cat_features])
val_X_transformed = pd.DataFrame(val_X_transformed, columns=[num_features + cat_features])
test_X_trinsformed = pd.DataFrame(test_X_transformed, columns=[num_features + cat_features])

## TF Data
training과 inference를 더 빠르게 하려면 데이터를 TF Data object로 변환해야 합니다.

In [26]:
def prepare_tf_dataset(X, batch_size, y=None, shuffle=False, drop_remainder=False):
    size_of_dataset = len(X)
    if y is not None:
        y = tf.one_hot(y.astype(int), 2)
        ds = tf.data.Dataset.from_tensor_slices((np.array(X.astype(np.float32)), y))
    else:
        ds = tf.data.Dataset.from_tensor_slices(np.array(X.astype(np.float32)))
    if shuffle:
        ds = ds.shuffle(buffer_size=size_of_dataset)
    ds = ds.batch(batch_size, drop_remainder=drop_remainder)
    
    autotune = tf.data.experimental.AUTOTUNE
    ds = ds.prefetch(autotune)
    return ds

train_ds = prepare_tf_dataset(train_X_transformed, 16384, train_y)
val_ds = prepare_tf_dataset(val_X_transformed, 16384, val_y)
test_ds = prepare_tf_dataset(test_X_transformed, 16384)

## Model

### TabNet

위에서 언급한 클래스를 모두 정의

In [27]:
def glu(x, n_units=None):
    """Generalized linear unit nonlinear activation."""
    return x[:, :n_units] * tf.nn.sigmoid(x[:, n_units:])

In [28]:
class FeatureBlock(tf.keras.Model):
    """
    Implementation of a FL->BN->GLU block
    """
    def __init__(
        self,
        feature_dim,
        apply_glu = True,
        bn_momentum = 0.9,
        fc = None,
        epsilon = 1e-5,
    ):
        super(FeatureBlock, self).__init__()
        self.apply_gpu = apply_glu
        self.feature_dim = feature_dim
        units = feature_dim * 2 if apply_glu else feature_dim # desired dimension gets multiplied by 2
                                                              # because GLU activation halves it

        self.fc = tf.keras.layers.Dense(units, use_bias=False) if fc is None else fc # shared layers can get re-used
        self.bn = tf.keras.layers.BatchNormalization(momentum=bn_momentum, epsilon=epsilon)

    def call(self, x, training = None):
        x = self.fc(x) # inputs passes through the FC layer
        x = self.bn(x, training=training) # FC layer output gets passed through the BN
        if self.apply_gpu: 
            return glu(x, self.feature_dim) # GLU activation applied to BN output
        return x

    
class FeatureTransformer(tf.keras.Model):
    def __init__(
        self,
        feature_dim,
        fcs = [],
        n_total = 4,
        n_shared = 2,
        bn_momentum = 0.9,
    ):
        super(FeatureTransformer, self).__init__()
        self.n_total, self.n_shared = n_total, n_shared

        kwrgs = {
            "feature_dim": feature_dim,
            "bn_momentum": bn_momentum,
        }

        # build blocks
        self.blocks = []
        for n in range(n_total):
            # some shared blocks
            if fcs and n < len(fcs):
                self.blocks.append(FeatureBlock(**kwrgs, fc=fcs[n])) # Building shared blocks by providing FC layers
            # build new blocks
            else:
                self.blocks.append(FeatureBlock(**kwrgs)) # Step dependent blocks without the shared FC layers

    def call(self, x, training = None):
        # input passes through the first block
        x = self.blocks[0](x, training=training) 
        # for the remaining blocks
        for n in range(1, self.n_total):
            # output from previous block gets multiplied by sqrt(0.5) and output of this block gets added
            x = x * tf.sqrt(0.5) + self.blocks[n](x, training=training) 
        return x

    @property
    def shared_fcs(self):
        return [self.blocks[i].fc for i in range(self.n_shared)]
    
class AttentiveTransformer(tf.keras.Model):
    def __init__(self, feature_dim):
        super(AttentiveTransformer, self).__init__()
        self.block = FeatureBlock(
            feature_dim,
            apply_glu=False,
        )

    def call(self, x, prior_scales, training=None):
        x = self.block(x, training=training)
        return sparsemax(x * prior_scales)
    
class TabNet(tf.keras.Model):
    def __init__(
        self,
        num_features,
        feature_dim,
        output_dim,
        n_step = 2,
        n_total = 4,
        n_shared = 2,
        relaxation_factor = 1.5,
        bn_epsilon = 1e-5,
        bn_momentum = 0.7,
        sparsity_coefficient = 1e-5
    ):
        super(TabNet, self).__init__()
        self.output_dim, self.num_features = output_dim, num_features
        self.n_step, self.relaxation_factor = n_step, relaxation_factor
        self.sparsity_coefficient = sparsity_coefficient

        self.bn = tf.keras.layers.BatchNormalization(
            momentum=bn_momentum, epsilon=bn_epsilon
        )

        kargs = {
            "feature_dim": feature_dim + output_dim,
            "n_total": n_total,
            "n_shared": n_shared,
            "bn_momentum": bn_momentum
        }

        # first feature transformer block is built first to get the shared blocks
        self.feature_transforms = [FeatureTransformer(**kargs)]
        self.attentive_transforms = []
            
        # each step consists out of FT and AT
        for i in range(n_step):
            self.feature_transforms.append(
                FeatureTransformer(**kargs, fcs=self.feature_transforms[0].shared_fcs)
            )
            self.attentive_transforms.append(
                AttentiveTransformer(num_features)
            )
        
        # Final output layer
        self.head = tf.keras.layers.Dense(2, activation="softmax", use_bias=False)

    def call(self, features, training = None):

        bs = tf.shape(features)[0] # get batch shape
        out_agg = tf.zeros((bs, self.output_dim)) # empty array with outputs to fill
        prior_scales = tf.ones((bs, self.num_features)) # prior scales initialised as 1s
        importance = tf.zeros([bs, self.num_features]) # importances
        masks = []

        features = self.bn(features, training=training) # Batch Normalisation
        masked_features = features

        total_entropy = 0.0

        for step_i in range(self.n_step + 1):
            # (masked) features go through the FT
            x = self.feature_transforms[step_i](
                masked_features, training=training
            )
            
            # first FT is not used to generate output
            if step_i > 0:
                # first half of the FT output goes towards the decision 
                out = tf.keras.activations.relu(x[:, : self.output_dim])
                out_agg += out
                scale_agg = tf.reduce_sum(out, axis=1, keepdims=True) / (self.n_step - 1)
                importance += mask_values * scale_agg
                

            # no need to build the features mask for the last step
            if step_i < self.n_step:
                # second half of the FT output goes as input to the AT
                x_for_mask = x[:, self.output_dim :]
                
                # apply AT with prior scales
                mask_values = self.attentive_transforms[step_i](
                    x_for_mask, prior_scales, training=training
                )

                # recalculate the prior scales
                prior_scales *= self.relaxation_factor - mask_values
                
                # multiply the second half of the FT output by the attention mask to enforce sparsity
                masked_features = tf.multiply(mask_values, features)

                # entropy is used to penalize the amount of sparsity in feature selection
                total_entropy += tf.reduce_mean(
                    tf.reduce_sum(
                        tf.multiply(-mask_values, tf.math.log(mask_values + 1e-15)),
                        axis=1,
                    )
                )
                
                # append mask values for later explainability
                masks.append(tf.expand_dims(tf.expand_dims(mask_values, 0), 3))
                
        #Per step selection masks        
        self.selection_masks = masks
        
        # Final output
        final_output = self.head(out)
        
        # Add sparsity loss
        loss = total_entropy / (self.n_step-1)
        self.add_loss(self.sparsity_coefficient * loss)
        
        return final_output, importance

## HP Tuning
메모리가 부족하면 pass

- Feature Dimension - between 32 and 512

- Number of steps - from 2 to 9

- Relaxation factor - from 1 to 3

- Sparsity Coefficiet - from 0 to 0.1

- Batch Momentum - from 0.9 to 0.9999

- Class weight - from 1 to 10

In [31]:
import optuna
from optuna import Trial, visualization

def Objective(trial):
    feature_dim = trial.suggest_categorical("feature_dim", [32, 64, 128, 256, 512])
    n_step = trial.suggest_int("n_step", 2, 9, step=1)
    n_shared = trial.suggest_int("n_shared", 0, 4, step=1)
    relaxation_factor = trial.suggest_float("relaxation_factor", 1., 3., step=0.1)
    sparsity_coefficient = trial.suggest_float("sparsity_coefficient", 0.00000001, 0.1, log=True)
    bn_momentum = trial.suggest_float("bn_momentum", 0.9, 0.9999)
    tabnet_params = dict(num_features=train_X_transformed.shape[1],
                         output_dim=feature_dim,
                         feature_dim=feature_dim,
                         n_step=n_step, 
                         relaxation_factor=relaxation_factor,
                         sparsity_coefficient=sparsity_coefficient,
                         n_shared = n_shared,
                         bn_momentum = bn_momentum)
    
    
    cbs = [tf.keras.callbacks.EarlyStopping(
        monitor="val_loss", patience=5, restore_best_weights=True
    )]
    
    tn = TabNet(**tabnet_params)
    optimizer = tf.keras.optimizers.Adam(learning_rate=0.001, clipnorm=10)
    loss = [tf.keras.losses.CategoricalCrossentropy(from_logits=False)]
    
    tn.compile(optimizer, loss=loss)
    tn.fit(train_ds, epochs=100, validation_data=val_ds, callbacks=cbs, verbose=1)
    
    val_preds, _ =  tn.predict(val_ds)
    pr_auc = average_precision_score(val_y, val_preds[:,1])
    
    return pr_auc

study = optuna.create_study(direction="maximize", study_name='TabNet optimization')
study.optimize(Objective, n_jobs=-1, n_trials=100, gc_after_trial=True, show_progress_bar=False)

[32m[I 2022-10-28 03:54:38,368][0m A new study created in memory with name: TabNet optimization[0m


Epoch 1/100
Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100


[32m[I 2022-10-28 03:55:25,976][0m Trial 0 finished with value: 0.1 and parameters: {'feature_dim': 512, 'n_step': 2, 'n_shared': 0, 'relaxation_factor': 1.4, 'sparsity_coefficient': 0.0072571939788118765, 'bn_momentum': 0.9283788473853494}. Best is trial 0 with value: 0.1.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78

[32m[I 2022-10-28 03:56:22,658][0m Trial 1 finished with value: 0.016129032258064516 and parameters: {'feature_dim': 128, 'n_step': 8, 'n_shared': 2, 'relaxation_factor': 2.9000000000000004, 'sparsity_coefficient': 0.004390369257508038, 'bn_momentum': 0.9487575515274111}. Best is trial 0 with value: 0.1.[0m


Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 81/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 82/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 84/100
Epoc

[32m[I 2022-10-28 03:57:01,075][0m Trial 2 finished with value: 0.013888888888888888 and parameters: {'feature_dim': 64, 'n_step': 9, 'n_shared': 1, 'relaxation_factor': 2.7, 'sparsity_coefficient': 0.0001388852628966294, 'bn_momentum': 0.9380306946377736}. Best is trial 0 with value: 0.1.[0m


Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100


[32m[I 2022-10-28 03:57:12,148][0m Trial 3 finished with value: 0.1 and parameters: {'feature_dim': 128, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 2.7, 'sparsity_coefficient': 8.376461623297162e-06, 'bn_momentum': 0.9262092665048373}. Best is trial 0 with value: 0.1.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 2/100
Epoch 3/100
Epoch 18/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 20/100
Epoch 8/100
Epoch 21/100
Epoch 9/100
Epoch 22/100
Epoch 23/100
Epoch 10/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 12/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 19/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 21/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 23/100
Epoch 46/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 49/100
Epoch 27/100
Epoch 51/100
Epoch 28/100
Epoch 52/100
Epoch 53/100
Epoch 29/100
Epoch 54/100
Epoch 55/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 57/100
Epoch 58/100
Epoch 34/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Ep





[32m[I 2022-10-28 03:57:45,679][0m Trial 5 finished with value: 0.03333333333333333 and parameters: {'feature_dim': 512, 'n_step': 2, 'n_shared': 4, 'relaxation_factor': 1.4, 'sparsity_coefficient': 1.20933305460041e-05, 'bn_momentum': 0.9227022894136431}. Best is trial 0 with value: 0.1.[0m


Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100






[32m[I 2022-10-28 03:58:01,871][0m Trial 4 finished with value: 0.1111111111111111 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 1, 'relaxation_factor': 1.3, 'sparsity_coefficient': 0.00013712780924840138, 'bn_momentum': 0.9109992327159016}. Best is trial 4 with value: 0.1111111111111111.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 2/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 3/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 5/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 6/100
Epoch 63/100
Epoch 7/100
Epoch 64/100
Epoch 66/100
Epoch 8/100
Epoch 67/100
Epoch 68/100
Epoch 9/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 10/100
Epoch 72/100
Epoch 73/100
Ep

[32m[I 2022-10-28 03:58:32,741][0m Trial 7 finished with value: 0.012195121951219513 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 2, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 1.2952477100350326e-06, 'bn_momentum': 0.9544608322993061}. Best is trial 4 with value: 0.1111111111111111.[0m


Epoch 30/100
Epoch 1/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 3/100
Epoch 76/100
Epoch 4/100
Epoch 5/100
Epoch 77/100
Epoch 78/100
Epoch 6/100
Epoch 79/100
Epoch 8/100
Epoch 80/100
Epoch 9/100
Epoch 81/100
Epoch 10/100
Epoch 82/100
Epoch 11/100
Epoch 12/100
Epoch 83/100
Epoch 13/100
Epoch 84/100
Epoch 14/100
Epoch 85/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 87/100
Epoch 88/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 

[32m[I 2022-10-28 03:59:01,635][0m Trial 6 finished with value: 0.05555555555555555 and parameters: {'feature_dim': 64, 'n_step': 6, 'n_shared': 0, 'relaxation_factor': 1.1, 'sparsity_coefficient': 5.5306566116128045e-05, 'bn_momentum': 0.9530038069091721}. Best is trial 4 with value: 0.1111111111111111.[0m


Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 03:59:24,196][0m Trial 8 finished with value: 0.05263157894736842 and parameters: {'feature_dim': 32, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.8, 'sparsity_coefficient': 1.6112957055175355e-08, 'bn_momentum': 0.9841503726761143}. Best is trial 4 with value: 0.1111111111111111.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 2/100
Epoch 27/100
Epoch 3/100
Epoch 28/100
Epoch 4/100
Epoch 29/100
Epoch 5/100
Epoch 30/100
Epoch 6/100
Epoch 7/100
Epoch 31/100
Epoch 8/100
Epoch 9/100
Epoch 32/100
Epoch 10/100
Epoch 33/100
Epoch 11/100
Epoch 12/100
Epoch 35/100
Epoch 13/100
Epoch 14/100
Epoch 36/100
Epoch 16/100
Epoch 37/100
Epoch 17/100
Epoch 38/100
Epoch 39/100
Epoch 19/100
Epoch 40/100
Epoch 20/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 31/100
Epoch 49/100
Epoch 32/100
Epoch 50/100
Epoch 33/100
Epoch 51/100
Epoch 34/100
Epoch 52/100
Epoch 35/100
Epoch 53/100
Epoch 54/100
Epoch 37/100
Epoch 55/100
Epo

[32m[I 2022-10-28 03:59:58,324][0m Trial 10 finished with value: 0.5 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 2.2, 'sparsity_coefficient': 1.3396191306549097e-05, 'bn_momentum': 0.9170835757295724}. Best is trial 10 with value: 0.5.[0m


Epoch 93/100
Epoch 94/100
Epoch 1/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:00:04,790][0m Trial 9 finished with value: 1.0 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 3, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 0.0003182140913822515, 'bn_momentum': 0.9497510368233014}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 6/100
Epoch 8/100
Epoch 7/100
Epoch 9/100
Epoch 8/100
Epoch 11/100
Epoch 9/100
Epoch 12/100
Epoch 13/100
Epoch 10/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 12/100
Epoch 18/100
Epoch 13/100
Epoch 19/100
Epoch 20/100
Epoch 14/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 16/100
Epoch 26/100
Epoch 17/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 19/100
Epoch 20/100
Epoch 31/100
Epoch 21/100
Epoch 32/100
Epoch 33/100
Epoch 35/100
Epoch 36/100
Epoch 24/100
Epoch 37/100
Epoch 25/100
Epoch 38/100
Epoch 40/100
Epoch 28/100
Epoch 41/100
Epoch 42/100
Epoch 29/100
Epoch 43/100
Epoch 30/100
Epoch 31/100
Epoch 45/100
Epoch 47/100
Epoch 33/100
Epoch 34/100
Epoch 48/100
Epoch 35/100
Epoch 50/100
Epoch 36/100
Epoch 51/100
Epoch 37/100
Epoch 38/100
Epoch 52/100
Epoch 39/100
Epoch 53/100
Epoch 40/100
Epoch 54/100
Epoch 55/100
Epoch 41/100
Epoch 42/100
Epoch 57/100


[32m[I 2022-10-28 04:01:12,947][0m Trial 12 finished with value: 0.02040816326530612 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 4, 'relaxation_factor': 2.0, 'sparsity_coefficient': 0.08797868663193417, 'bn_momentum': 0.9014487103657095}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100


[32m[I 2022-10-28 04:01:14,497][0m Trial 11 finished with value: 0.14285714285714285 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 0, 'relaxation_factor': 1.9, 'sparsity_coefficient': 1.1588281060085983e-07, 'bn_momentum': 0.9056341079753147}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 3/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 10/100
Epoch 13/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 17/100
Epoch 17/100
Epoch 19/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 21/100
Epoch 22/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 25/100
Epoch 26/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 28/100
Epoch 29/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 35/100
Epoch 36/100
Epoch 35/100
Epoch 38/100
Epoch 39/100
Epoch 38/100
Epoch 41/100
Epoch 40/100
Epoch 41/100
Epoch 43/100
Epoch 44/100
Epoch 42/100
Epoch 45/100
Epoch 43/100
Epoch 46/100
Epoch 44/100
Epoch 47/100
Epoch 46/100
Epoch 48/100
Epoch 50/100
Epoch 51/100
Epoch 50/100
Epoch 52/100
Epoch 53/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 54/100
Epoch 55/100
Epoch 55/100
Epoch 56/100
Epoch 56/100
Epoch 57/100
Epoch 58/1

[32m[I 2022-10-28 04:02:09,751][0m Trial 14 finished with value: 0.14285714285714285 and parameters: {'feature_dim': 32, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 0.0024747092055193757, 'bn_momentum': 0.974967288096383}. Best is trial 9 with value: 1.0.[0m




[32m[I 2022-10-28 04:02:10,072][0m Trial 13 finished with value: 0.03225806451612903 and parameters: {'feature_dim': 256, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.1, 'sparsity_coefficient': 7.142865950024566e-07, 'bn_momentum': 0.9780677799337008}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 6/100
Epoch 7/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 10/100
Epoch 11/100
Epoch 13/100
Epoch 12/100
Epoch 14/100
Epoch 13/100
Epoch 15/100
Epoch 14/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 16/100
Epoch 17/100
Epoch 19/100
Epoch 18/100
Epoch 20/100
Epoch 19/100
Epoch 21/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 28/100
Epoch 30/100
Epoch 29/100
Epoch 31/100
Epoch 30/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 32/100
Epoch 35/100
Epoch 33/100
Epoch 36/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 37/100
Epoch 38/100
Epoch 38/100
Epoch 39/100
Epoch 39/100
Epoch 40/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 41/100
Epoch 43/100
Epoch 42/100
Epoch 44/100
Epoch 45/100
Epoch 43/100
Epoch 44/100
Epoch 47/100
E

[32m[I 2022-10-28 04:03:10,062][0m Trial 16 finished with value: 0.09090909090909091 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 1, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 0.0005476258424184111, 'bn_momentum': 0.9989901459571899}. Best is trial 9 with value: 1.0.[0m


Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100


[32m[I 2022-10-28 04:03:23,076][0m Trial 15 finished with value: 1.0 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 1, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.0006969105398008719, 'bn_momentum': 0.964270927501524}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78

[32m[I 2022-10-28 04:03:52,392][0m Trial 17 finished with value: 0.03333333333333333 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 3, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 5.988975233349416e-06, 'bn_momentum': 0.9634223611436797}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 3/100
Epoch 76/100
Epoch 77/100
Epoch 78/

[32m[I 2022-10-28 04:04:37,953][0m Trial 18 finished with value: 0.023255813953488372 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 3, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 0.07898252596744143, 'bn_momentum': 0.9657998298695998}. Best is trial 9 with value: 1.0.[0m


Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 2/100
Epoch 3/100
Epoch 81/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 83/100
Epoch 7/100
Epoch 84/100
Epoch 85/100
Epoch 9/100
Epoch 86/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 88/100
Epoch 89/100
Epoch 15/100
Epoch 91/100
Epoch 16/100
Epoch 92/100
Epoch 17/100
Epoch 

[32m[I 2022-10-28 04:05:09,393][0m Trial 19 finished with value: 0.3333333333333333 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 1, 'relaxation_factor': 1.8, 'sparsity_coefficient': 0.04314329396924392, 'bn_momentum': 0.9670172930997484}. Best is trial 9 with value: 1.0.[0m


Epoch 24/100
Epoch 1/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100


[32m[I 2022-10-28 04:05:25,785][0m Trial 20 finished with value: 0.25 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 1, 'relaxation_factor': 3.0, 'sparsity_coefficient': 0.0007197172439479281, 'bn_momentum': 0.943503526611746}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 2/100
Epoch 39/100
Epoch 40/100
Epoch 3/100
Epoch 41/100
Epoch 42/100
Epoch 5/100
Epoch 6/100
Epoch 44/100
Epoch 7/100
Epoch 45/100
Epoch 8/100
Epoch 46/100
Epoch 9/100
Epoch 10/100
Epoch 49/100
Epoch 11/100
Epoch 12/100
Epoch 50/100
Epoch 51/100
Epoch 13/100
Epoch 14/100
Epoch 52/100
Epoch 53/100
Epoch 15/100
Epoch 16/100
Epoch 54/100
Epoch 17/100
Epoch 55/100
Epoch 18/100
Epoch 57/100
Epoch 19/100
Epoch 20/100
Epoch 59/100
Epoch 60/100
Epoch 21/100
Epoch 61/100
Epoch 23/100
Epoch 64/100
Epoch 24/100
Ep

[32m[I 2022-10-28 04:06:07,427][0m Trial 21 finished with value: 0.043478260869565216 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 4, 'relaxation_factor': 3.0, 'sparsity_coefficient': 0.0008897387443439486, 'bn_momentum': 0.9407586161338476}. Best is trial 9 with value: 1.0.[0m


Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 1/100


[32m[I 2022-10-28 04:06:12,932][0m Trial 22 finished with value: 0.07692307692307693 and parameters: {'feature_dim': 32, 'n_step': 6, 'n_shared': 0, 'relaxation_factor': 2.2, 'sparsity_coefficient': 0.000463665967757629, 'bn_momentum': 0.9374389322891754}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 2/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 3/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 5/100
Epoch 30/100
Epoch 31/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100


[32m[I 2022-10-28 04:06:48,038][0m Trial 24 finished with value: 0.06666666666666667 and parameters: {'feature_dim': 512, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 2.6, 'sparsity_coefficient': 4.130547677311622e-05, 'bn_momentum': 0.9190223934912535}. Best is trial 9 with value: 1.0.[0m


Epoch 11/100
Epoch 12/100
Epoch 1/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100


[32m[I 2022-10-28 04:07:22,547][0m Trial 23 finished with value: 0.25 and parameters: {'feature_dim': 32, 'n_step': 6, 'n_shared': 0, 'relaxation_factor': 2.1, 'sparsity_coefficient': 5.9463880860714254e-05, 'bn_momentum': 0.9157997310757688}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 2/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 4/100
Epoch 68/100
Epoch 69/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 7/100
Epoch 8/100
Epoch 74/100
Epoch 75/100
Epoch 9/100


[32m[I 2022-10-28 04:08:15,397][0m Trial 25 finished with value: 0.047619047619047616 and parameters: {'feature_dim': 128, 'n_step': 8, 'n_shared': 1, 'relaxation_factor': 2.2, 'sparsity_coefficient': 1.4929655140575546e-06, 'bn_momentum': 0.9881547661463858}. Best is trial 9 with value: 1.0.[0m


Epoch 46/100
Epoch 47/100
Epoch 1/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 3/100
Epoch 4/100
Epoch 78/100
Epoch 79/100
Epoch 7/100
Epoch 80/100
Epoch 8/100
Epoch 10/100
Epoch 11/100
Epoch 81/100
Epoch 12/100
Epoch 13/100
Epoch 82/100
Epoch 15/100
Epoch 83/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 86/100
Epoch 21/100
Epoch 87/100
Epoch 88/100
Epoch 24/100
Epoch 89/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 91/100
Epoch 28/100
Epoch 30/100
Epoch 92/100
Epoch 93/100
Epoch 31/100
Epoch 94/100
Epoch 32/100
Epoch 95/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 98/100
Epoc

[32m[I 2022-10-28 04:08:43,210][0m Trial 26 finished with value: 0.011235955056179775 and parameters: {'feature_dim': 128, 'n_step': 8, 'n_shared': 1, 'relaxation_factor': 2.2, 'sparsity_coefficient': 1.9321062314538313e-06, 'bn_momentum': 0.9909194985124697}. Best is trial 9 with value: 1.0.[0m


Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100


[32m[I 2022-10-28 04:08:59,515][0m Trial 27 finished with value: 1.0 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 2, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 0.017300412754893708, 'bn_momentum': 0.9609401679600647}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79

[32m[I 2022-10-28 04:09:24,805][0m Trial 28 finished with value: 0.16666666666666666 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 2, 'relaxation_factor': 2.5, 'sparsity_coefficient': 0.00019628732891701535, 'bn_momentum': 0.9590069037137885}. Best is trial 9 with value: 1.0.[0m


Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 2/100
Epoch 73/100
Epoch 74/100
Epoch 4/100
Epoch 75/100
Epoch 5/100
Epoch 6/100
Epoch 76/100
Epoch 7/100
Epoch 77/100
Epoch 8/100
Epoch 78/100
Epoch 9/

[32m[I 2022-10-28 04:10:00,040][0m Trial 29 finished with value: 1.0 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 2, 'relaxation_factor': 1.6, 'sparsity_coefficient': 0.018050384365139744, 'bn_momentum': 0.9592104224233506}. Best is trial 9 with value: 1.0.[0m


Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 2/100
Epoch 98/100
Epoch 3/100
Epoch 99/100
Epoch 100/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100


[32m[I 2022-10-28 04:10:27,684][0m Trial 30 finished with value: 0.07142857142857142 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 2, 'relaxation_factor': 1.6, 'sparsity_coefficient': 0.018412906111727707, 'bn_momentum': 0.9758301587323598}. Best is trial 9 with value: 1.0.[0m


Epoch 13/100
Epoch 1/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100


[32m[I 2022-10-28 04:11:01,917][0m Trial 31 finished with value: 0.09090909090909091 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 2, 'relaxation_factor': 1.6, 'sparsity_coefficient': 0.016160917850315284, 'bn_momentum': 0.968996705704612}. Best is trial 9 with value: 1.0.[0m


Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 2/100
Epoch 88/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epo

[32m[I 2022-10-28 04:11:33,720][0m Trial 32 finished with value: 0.2 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 2, 'relaxation_factor': 1.5, 'sparsity_coefficient': 0.012353734826356048, 'bn_momentum': 0.9598655030322786}. Best is trial 9 with value: 1.0.[0m


Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100

[32m[I 2022-10-28 04:12:12,834][0m Trial 33 finished with value: 0.125 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 2, 'relaxation_factor': 1.3, 'sparsity_coefficient': 0.0027647938303586024, 'bn_momentum': 0.9495039673551249}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 2/100
Epoch 50/100
Epoch 3/100
Epoch 51/100
Epoch 52/100
Epoch 6/100
Epoch 7/100
Epoch 54/100
Epoch 55/100
Epoch 9/100
Epoch 56/100
Epoch 10/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 12/100
Epoch 13/100
Epoch 61/100
Epoch 14/100
Epoch 62/100
Epoch 63/100
Epoch 15/100
Epoch 64/100
Epoch 16/100
Epoch 65/100
Epoch 17/100
Epoch 66/100
Epoch 18/100
Epoch 67/100
Epoch 68/100


[32m[I 2022-10-28 04:12:59,325][0m Trial 34 finished with value: 0.25 and parameters: {'feature_dim': 256, 'n_step': 9, 'n_shared': 3, 'relaxation_factor': 1.2, 'sparsity_coefficient': 0.003576562663922608, 'bn_momentum': 0.9495679718298294}. Best is trial 9 with value: 1.0.[0m


Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100


[32m[I 2022-10-28 04:13:13,875][0m Trial 35 finished with value: 0.012658227848101266 and parameters: {'feature_dim': 512, 'n_step': 7, 'n_shared': 3, 'relaxation_factor': 1.8, 'sparsity_coefficient': 0.0018421263573759196, 'bn_momentum': 0.9316697471389155}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 3/100
Epoch 45/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 48/100
Epoch 10/100
Epoch 49/100
Epoch 11/100
Epoch 50/100
Epoch 12/100
Epoch 13/100
Epoch 52/100
Epoch 14/100
Epoch 53/100
Epoch 54/100
Epoch 15/100
Epoch 55/100
Epoch 56/100
Epoch 16/100
Epoch 57/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100


[32m[I 2022-10-28 04:13:43,686][0m Trial 36 finished with value: 0.03333333333333333 and parameters: {'feature_dim': 512, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 1.9, 'sparsity_coefficient': 0.005900181853728779, 'bn_momentum': 0.9321997628472821}. Best is trial 9 with value: 1.0.[0m


Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 2/100
Epoch 60/100
Epoch 61/100
Epoch 4/100
Epoch 5/100
Epoch 63/100
Epoch 6/100
Epoch 7/100
Epoch 64/100
Epoch 9/100
Epoch 65/100
Epoch 10/100
Epoch 11/100
Epoch 67/100
Epoch 13/100
Epoch 14/100
Epoch 68/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 70/100
Epoch 19/100
Epoch 20/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 23/100
Epoch 24/100
Epoch 75/100
Epoch 25/100
Epoch 26/100
Epoch 76/100
Epoch 27/100
Epoch 77/100
Epoch 28/100
Epoch 78/100
Epoch 29/100
Epoch 79/100
Epoch 30/100
Epoch 31/100
Epoch

[32m[I 2022-10-28 04:14:06,745][0m Trial 37 finished with value: 0.058823529411764705 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 1.9, 'sparsity_coefficient': 0.009320032915613771, 'bn_momentum': 0.9574678918876702}. Best is trial 9 with value: 1.0.[0m


Epoch 73/100
Epoch 74/100
Epoch 1/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:14:15,573][0m Trial 38 finished with value: 0.5 and parameters: {'feature_dim': 64, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.0, 'sparsity_coefficient': 0.0002213123732917068, 'bn_momentum': 0.955778675300332}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100


[32m[I 2022-10-28 04:14:35,770][0m Trial 39 finished with value: 0.125 and parameters: {'feature_dim': 256, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.0, 'sparsity_coefficient': 0.00026794163512046924, 'bn_momentum': 0.9444012196463457}. Best is trial 9 with value: 1.0.[0m


Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 8

[32m[I 2022-10-28 04:15:17,238][0m Trial 40 finished with value: 1.0 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 1.4, 'sparsity_coefficient': 0.03757950284574927, 'bn_momentum': 0.945069457379541}. Best is trial 9 with value: 1.0.[0m


Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 69/100
Epoch 6/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 73/100
Epoch 12/100
Epoch 13/100
Epoch 74/100
Epoch 14/100
Epoch 75/100
Epoch 15/100
Epoch 16/100
Epoch 76/100
Epoch 17/100
Epoch 

[32m[I 2022-10-28 04:15:55,389][0m Trial 41 finished with value: 0.03125 and parameters: {'feature_dim': 128, 'n_step': 8, 'n_shared': 2, 'relaxation_factor': 2.6, 'sparsity_coefficient': 0.0013184105653449912, 'bn_momentum': 0.9712582125605377}. Best is trial 9 with value: 1.0.[0m


Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:16:16,377][0m Trial 42 finished with value: 0.09090909090909091 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 1.5, 'sparsity_coefficient': 0.0411157457092957, 'bn_momentum': 0.9456768823733765}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 2/100
Epoch 57/100
Epoch 58/100
Epoch 5/100
Epoch 60/100
Epoch 6/100
Epoch 61/100
Epoch 62/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 64/100
Epoch 10/100
Epoch 66/100
Epoch 11/100
Epoch 67/100
Epoch 13/100
Epoch 70/100
Epoch 14/100
Epoch 71/100
Epoch 15/100
Epoch 16/100
Epoch 72/100
E

[32m[I 2022-10-28 04:16:51,947][0m Trial 43 finished with value: 0.05263157894736842 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 1.5, 'sparsity_coefficient': 0.028923383708785364, 'bn_momentum': 0.9622295977585122}. Best is trial 9 with value: 1.0.[0m


Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100


[32m[I 2022-10-28 04:17:13,977][0m Trial 44 finished with value: 0.25 and parameters: {'feature_dim': 256, 'n_step': 5, 'n_shared': 4, 'relaxation_factor': 1.4, 'sparsity_coefficient': 0.024004409895286342, 'bn_momentum': 0.9630692210334412}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 2/100
Epoch 55/100
Epoch 56/100
Epoch 3/100
Epoch 57/100
Epoch 58/100
Epoch 4/100
Epoch 5/100
Epoch 59/100
Epoch 60/100
Epoch 6/100
Epoch 7/100
Epoch 61/100
Epoch 8/100
Epoch 62/100
Epoch 9/100
Epoch 63/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 66/100
Epoch 67/100
Epoch 14/100
Epoch 68/100
Epoch 15/100
Epo

[32m[I 2022-10-28 04:17:55,121][0m Trial 45 finished with value: 0.04 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 4, 'relaxation_factor': 1.3, 'sparsity_coefficient': 0.00812796456212252, 'bn_momentum': 0.952852861505258}. Best is trial 9 with value: 1.0.[0m


Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 2/100
Epoch 3/100
Epoch 95/100
Epoch 4/100
Epoch 5/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 8/100
Epoch 100/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100

[32m[I 2022-10-28 04:18:18,214][0m Trial 46 finished with value: 0.043478260869565216 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 4, 'relaxation_factor': 1.3, 'sparsity_coefficient': 0.006213479594020745, 'bn_momentum': 0.9504921780251541}. Best is trial 9 with value: 1.0.[0m


Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 2/100
Epoch 66/100
Epoch 4/100
Epoch 5/100
Epoch 69/100
Epoch 70/100
Epoch 7/100
Epoch 71/100
Epoch 8/100
Epoch 72/100
Epoch 9/100
Epoch 73/100
Epoch 10/100
Epoch 74/100
Epoch 11/100
Epoch 75/100
Epoch 12/100
Epoch 13/100
Epoch 77/100
Epoch 78/100
Epoch 15/100
Epoch 79/100
Epoch 16/100
Epoch 17/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 20/100
Epoch 84/100
Epoch

[32m[I 2022-10-28 04:18:43,205][0m Trial 47 finished with value: 0.04 and parameters: {'feature_dim': 256, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 0.09268119552420263, 'bn_momentum': 0.9361504222784504}. Best is trial 9 with value: 1.0.[0m


Epoch 29/100
Epoch 30/100
Epoch 1/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100


[32m[I 2022-10-28 04:19:07,574][0m Trial 48 finished with value: 0.3333333333333333 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.0001142856870839636, 'bn_momentum': 0.9368077202589293}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 3/100
Epoch 2/100
Epoch 4/100
Epoch 3/100
Epoch 5/100
Epoch 4/100
Epoch 6/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 12/100
Epoch 16/100
Epoch 13/100
Epoch 17/100
Epoch 18/100
Epoch 14/100
Epoch 19/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 21/100
Epoch 18/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 22/100
Epoch 26/100
Epoch 23/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 25/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 27/100
Epoch 35/100
Epoch 28/100
Epoch 36/100
Epoch 29/100
Epoch 30/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 32/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 48/100
Epoch 50/100
Epoch 35/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 37/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 39/100
Epoch 58/100
Epoch 40/100
E

[32m[I 2022-10-28 04:19:44,660][0m Trial 50 finished with value: 0.16666666666666666 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 2, 'relaxation_factor': 2.0, 'sparsity_coefficient': 0.048013501604511746, 'bn_momentum': 0.9805721872556792}. Best is trial 9 with value: 1.0.[0m


Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:20:05,345][0m Trial 49 finished with value: 0.017857142857142856 and parameters: {'feature_dim': 64, 'n_step': 7, 'n_shared': 2, 'relaxation_factor': 2.0, 'sparsity_coefficient': 9.078886847707381e-05, 'bn_momentum': 0.9721382281424404}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 2/100
Epoch 13/100
Epoch 14/100
Epoch 3/100
Epoch 15/100
Epoch 4/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 21/100
Epoch 8/100
Epoch 22/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 14/100
Epoch 15/100
Epoch 29/100
Epoch 30/100
Epoch 16/100
Epoch 31/100
Epoch 17/100
Epoch 32/100
Epoch 33/100
Epoch 18/100
Epoch 34/100
Epoch 19/100
Epoch 36/100
Epoch 20/100
Epoch 38/100
Epoch 39/100
Epoch 21/100
Epoch 22/100
Epoch 40/100
Epoch 23/100
Epoch 42/100
Epoch 24/100
Epoch 43/100
Epoch 25/100
Epoch 44/100
Epoch 45/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 31/100
Epoch 53/100
Epoch 32/100
Epoch 54/100
Epoch 55/100
Epoch 33/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 34/100


[32m[I 2022-10-28 04:20:43,855][0m Trial 52 finished with value: 0.07692307692307693 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 2.3, 'sparsity_coefficient': 1.8435418548054585e-05, 'bn_momentum': 0.9549050086079555}. Best is trial 9 with value: 1.0.[0m


Epoch 67/100
Epoch 68/100
Epoch 1/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100


[32m[I 2022-10-28 04:20:51,029][0m Trial 51 finished with value: 0.0136986301369863 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 2, 'relaxation_factor': 2.8, 'sparsity_coefficient': 2.2249970627835674e-05, 'bn_momentum': 0.9543205613453708}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 2/100
Epoch 22/100
Epoch 3/100
Epoch 23/100
Epoch 4/100
Epoch 24/100
Epoch 5/100
Epoch 25/100
Epoch 6/100
Epoch 7/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 10/100
Epoch 31/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 32/100
Epoch 14/100
Epoch 33/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 20/100
Epoch 37/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 27/100
Epoch 28/100
Epoch 41/100
Epoch 29/100
Epoch 42/100
Epoch 30/100
Epoch 43/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 45/100
Epoch 34/100
Epoch 35/100
Epoch 46/100
Epoch 47/100
Epoch 38/100
Epoch 48/100
Epoch 39/100
Epoch 49/100
Epoch 50/100
Epoch 41/100
Epoch 52/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
E

[32m[I 2022-10-28 04:21:14,474][0m Trial 53 finished with value: 0.2 and parameters: {'feature_dim': 64, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.0, 'sparsity_coefficient': 0.00037015746017264947, 'bn_momentum': 0.9541398455870448}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100


[32m[I 2022-10-28 04:21:25,022][0m Trial 54 finished with value: 0.021739130434782608 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 2.3, 'sparsity_coefficient': 2.871274744724478e-06, 'bn_momentum': 0.9256067047088674}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 2/100
Epoch 5/100
Epoch 6/100
Epoch 4/100
Epoch 7/100
Epoch 8/100
Epoch 5/100
Epoch 9/100
Epoch 6/100
Epoch 10/100
Epoch 7/100
Epoch 11/100
Epoch 8/100
Epoch 12/100
Epoch 9/100
Epoch 10/100
Epoch 13/100
Epoch 11/100
Epoch 12/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 18/100
Epoch 14/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 15/100
Epoch 22/100
Epoch 16/100
Epoch 23/100
Epoch 17/100
Epoch 24/100
Epoch 25/100
Epoch 18/100
Epoch 28/100
Epoch 29/100
Epoch 20/100
Epoch 31/100
Epoch 32/100
Epoch 21/100
Epoch 22/100
Epoch 33/100
Epoch 23/100
Epoch 34/100
Epoch 24/100
Epoch 35/100
Epoch 25/100
Epoch 36/100
Epoch 26/100
Epoch 37/100
Epoch 27/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 29/100
Epoch 30/100
Epoch 42/100
Epoch 31/100
Epoch 43/100
Epoch 32/100
Epoch 44/100
Epoch 33/100
Epoch 45/100
Epoch 34/100
Epoch 46/100
Epoch 35/100
Epoch 47/100
Epoch 36/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 38/100
Ep

[32m[I 2022-10-28 04:21:53,499][0m Trial 56 finished with value: 0.05555555555555555 and parameters: {'feature_dim': 64, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.1, 'sparsity_coefficient': 0.0015706304951285948, 'bn_momentum': 0.9460363045141679}. Best is trial 9 with value: 1.0.[0m


Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:22:01,250][0m Trial 55 finished with value: 0.020833333333333332 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 2.1, 'sparsity_coefficient': 5.948749302724076e-06, 'bn_momentum': 0.9247224460532608}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 2/100
Epoch 12/100
Epoch 3/100
Epoch 4/100
Epoch 14/100
Epoch 15/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 17/100
Epoch 9/100
Epoch 10/100
Epoch 18/100
Epoch 11/100
Epoch 12/100
Epoch 20/100
Epoch 14/100
Epoch 22/100
Epoch 15/100
Epoch 17/100
Epoch 24/100
Epoch 18/100
Epoch 25/100
Epoch 19/100
Epoch 26/100
Epoch 20/100
Epoch 21/100
Epoch 28/100
Epoch 22/100
Epoch 23/100
Epoch 29/100
Epoch 24/100
Epoch 30/100
Epoch 25/100
Epoch 31/100
Epoch 26/100
Epoch 33/100
Epoch 27/100
Epoch 34/100
Epoch 28/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 30/100
Epoch 38/100
Epoch 31/100
Epoch 39/100
Epoch 32/100
Epoch 33/100
Epoch 41/100
Epoch 34/100
Epoch 42/100
Epoch 43/100
Epoch 35/100
Epoch 44/100
Epoch 36/100
Epoch 45/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 48/100
Epoch 49/100
Epoch 41/100
Epoch 50/100
Epoch 42/100
Epoch 51/100
Epo

[32m[I 2022-10-28 04:22:48,632][0m Trial 58 finished with value: 0.034482758620689655 and parameters: {'feature_dim': 32, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.5, 'sparsity_coefficient': 0.0009172046761779756, 'bn_momentum': 0.9147020945110155}. Best is trial 9 with value: 1.0.[0m


Epoch 51/100
Epoch 52/100
Epoch 1/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:23:12,680][0m Trial 57 finished with value: 0.012658227848101266 and parameters: {'feature_dim': 64, 'n_step': 6, 'n_shared': 1, 'relaxation_factor': 1.6, 'sparsity_coefficient': 0.0008321421871016837, 'bn_momentum': 0.965771584957481}. Best is trial 9 with value: 1.0.[0m


Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 2/100
Epoch 24/100
Epoch 3/100
Epoch 4/100
Epoch 25/100
Epoch 26/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 27/100
Epoch 9/100
Epoch 10/100
Epoch 28/100
Epoch 11/100
Epoch 12/100
Epoch 29/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 30/100
Epoch 18/100
Epoch 19/100
Epoch 31/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 26/100
Epoch 35/100
Epoch 36/100
Epoch 28/100
Epoch 30/100
Epoch 38/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 39/100
Epoch 37/100
Epoch 38/100
Epoch 40/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 42/100
Epoch 47/100
Epoch 43/100
Epoch 44/100
E

[32m[I 2022-10-28 04:23:39,280][0m Trial 59 finished with value: 0.06666666666666667 and parameters: {'feature_dim': 128, 'n_step': 6, 'n_shared': 0, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 6.971480762743128e-07, 'bn_momentum': 0.9079888851720942}. Best is trial 9 with value: 1.0.[0m


Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:23:47,500][0m Trial 60 finished with value: 0.02631578947368421 and parameters: {'feature_dim': 128, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.1, 'sparsity_coefficient': 0.004234637789703783, 'bn_momentum': 0.9404658454241085}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 2/100
Epoch 46/100
Epoch 47/100
Epoch 3/100
Epoch 48/100
Epoch 49/100
Epoch 4/100
Epoch 50/100
Epoch 5/100
Epoch 51/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 53/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 56/100
Epoch 12/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 17/100
Epoch 64/100
Epoch 65/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epo

[32m[I 2022-10-28 04:24:09,390][0m Trial 61 finished with value: 0.1111111111111111 and parameters: {'feature_dim': 256, 'n_step': 2, 'n_shared': 1, 'relaxation_factor': 1.1, 'sparsity_coefficient': 0.004370968333180712, 'bn_momentum': 0.9404376066030454}. Best is trial 9 with value: 1.0.[0m


Epoch 23/100
Epoch 1/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 2/100
Epoch 62/100
Epoch 3/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 6/100
Epoch 70/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 73/100
Epoch 11/100
Epoch 74/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 77/100
Epoch 16/100
Epoch 78/100
Epoch 17/100
Epoch 79/100
Epoch 80/100
Epoch 18/100
Epoch 19/100
Epoch 81/100
Epoch 82/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 85/100
Epoch 24/100
Epoch 

[32m[I 2022-10-28 04:24:31,799][0m Trial 62 finished with value: 0.038461538461538464 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.8, 'sparsity_coefficient': 7.403889492851949e-05, 'bn_momentum': 0.9578392693419874}. Best is trial 9 with value: 1.0.[0m


Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100


[32m[I 2022-10-28 04:24:39,178][0m Trial 63 finished with value: 0.05263157894736842 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.4, 'sparsity_coefficient': 7.032023891400261e-05, 'bn_momentum': 0.9580465293279258}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 2/100
Epoch 26/100
Epoch 27/100
Epoch 3/100
Epoch 4/100
Epoch 29/100
Epoch 5/100
Epoch 30/100
Epoch 6/100
Epoch 31/100
Epoch 32/100
Epoch 7/100
Epoch 33/100
Epoch 8/100
Epoch 34/100
Epoch 35/100
Epoch 9/100
Epoch 36/100
Epoch 10/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 12/100
Epoch 40/100
Epoch 41/100
Epoch 13/100
Epoch 42/100
Epoch 43/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 20/100
Epoch 52/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 25/100
Epoch 57/100
Epoch 26/100
Epoch 58/100
Epoch 59/100
Epoch 27/100
Epo

[32m[I 2022-10-28 04:25:26,712][0m Trial 64 finished with value: 0.012195121951219513 and parameters: {'feature_dim': 512, 'n_step': 3, 'n_shared': 2, 'relaxation_factor': 1.4, 'sparsity_coefficient': 1.100790310205412e-05, 'bn_momentum': 0.9665499836535817}. Best is trial 9 with value: 1.0.[0m


Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100


[32m[I 2022-10-28 04:25:41,293][0m Trial 65 finished with value: 0.09090909090909091 and parameters: {'feature_dim': 512, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 2.5, 'sparsity_coefficient': 0.00016125887206059134, 'bn_momentum': 0.9691459965799855}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 3/100
Epoch 45/100
Epoch 4/100
Epoch 46/100
Epoch 5/100
Epoch 47/100
Epoch 48/100
Epoch 7/100
Epoch 49/100
Epoch 8/100
Epoch 51/100
Epoch 10/100
Epoch 11/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 13/100
Epoch 55/100
Epoch 56/100
Epoch 14/100
Epoch 57/100
Epoch 16/100
Epoch 58/100
Epoch 59/100
Epoch 18/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 20/100
Epoch 63/100
Epoch 64/100
Epoch 21/100
Epoch 65/100
Epoch 22/100


[32m[I 2022-10-28 04:26:17,650][0m Trial 66 finished with value: 0.014285714285714285 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 2.5, 'sparsity_coefficient': 0.00018721601477028046, 'bn_momentum': 0.9612591773685311}. Best is trial 9 with value: 1.0.[0m


Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:26:36,681][0m Trial 67 finished with value: 0.017241379310344827 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.1, 'sparsity_coefficient': 0.053015159455992944, 'bn_momentum': 0.9617525283232676}. Best is trial 9 with value: 1.0.[0m


Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 7

[32m[I 2022-10-28 04:27:13,967][0m Trial 68 finished with value: 0.07692307692307693 and parameters: {'feature_dim': 32, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.2, 'sparsity_coefficient': 3.990899806921439e-05, 'bn_momentum': 0.9640359972194441}. Best is trial 9 with value: 1.0.[0m


Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 2/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100


[32m[I 2022-10-28 04:27:45,921][0m Trial 69 finished with value: 0.1111111111111111 and parameters: {'feature_dim': 32, 'n_step': 7, 'n_shared': 3, 'relaxation_factor': 1.2, 'sparsity_coefficient': 2.8448046722926952e-05, 'bn_momentum': 0.9039025042431633}. Best is trial 9 with value: 1.0.[0m


Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100


[32m[I 2022-10-28 04:28:09,025][0m Trial 70 finished with value: 0.058823529411764705 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 1, 'relaxation_factor': 1.2, 'sparsity_coefficient': 1.2007412978766646e-08, 'bn_momentum': 0.9005075239349811}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 3/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 5/100
Epoch 69/100
Epoch 70/100
Epoch 6/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 8/100
Epoch 9/100


[32m[I 2022-10-28 04:28:49,689][0m Trial 71 finished with value: 0.05555555555555555 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 1, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 2.5042579804129487e-08, 'bn_momentum': 0.9478165971374257}. Best is trial 9 with value: 1.0.[0m


Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 62/100
Epoch 63/100
Epoch 8/100
Epoch 64/100
Epoch 9/100
Epoch 65/100
Epoch 66/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 67/100
Epoch 68/100
Epoch 16/100
Epoch 17/100
Epoch 69/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 71/100
Epoch 23/100
Epoch 72/100
Epoch 24/100
Epoch 25/100
Epoch 73/100
Epoch 27/100
Epoch 74/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 75/100
Epoch

[32m[I 2022-10-28 04:29:27,985][0m Trial 72 finished with value: 0.02857142857142857 and parameters: {'feature_dim': 256, 'n_step': 7, 'n_shared': 1, 'relaxation_factor': 1.6, 'sparsity_coefficient': 0.013331208941542973, 'bn_momentum': 0.9768020626410179}. Best is trial 9 with value: 1.0.[0m


Epoch 80/100
Epoch 81/100
Epoch 1/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100


[32m[I 2022-10-28 04:29:35,977][0m Trial 73 finished with value: 0.037037037037037035 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.00031065694835707904, 'bn_momentum': 0.9344206694860404}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 2/100
Epoch 3/100
Epoch 4/100


[32m[I 2022-10-28 04:30:16,538][0m Trial 74 finished with value: 1.0 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.0003211808675778546, 'bn_momentum': 0.9316851843962773}. Best is trial 9 with value: 1.0.[0m


Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 42/100
Epoch 43/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 46/100
Epoch 16/100
Epoch 47/100
Epoch 17/100
Epoch 48/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 49/100
Epoch 23/100
Epoch 24/100
Epoch 50/100
Epoch 25/100
Epoch 51/100
Epoch 26/100
Epoch 27/100
Epoch 52/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 33/100
Epoch 34/100
Epoch 54/100
Epoch 35/100
Epoch 55/100
Epoch 56/100
Epoch 38/100
Epoch 57/100
Epoch 58/

[32m[I 2022-10-28 04:30:55,588][0m Trial 76 finished with value: 0.021739130434782608 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.1, 'sparsity_coefficient': 0.0025300136818125736, 'bn_momentum': 0.9295290447418003}. Best is trial 9 with value: 1.0.[0m


Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100


[32m[I 2022-10-28 04:31:04,568][0m Trial 75 finished with value: 0.16666666666666666 and parameters: {'feature_dim': 256, 'n_step': 8, 'n_shared': 1, 'relaxation_factor': 1.8, 'sparsity_coefficient': 0.03189518864495295, 'bn_momentum': 0.9735054038214133}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 2/100
Epoch 27/100
Epoch 3/100
Epoch 28/100
Epoch 29/100
Epoch 6/100
Epoch 31/100
Epoch 8/100
Epoch 32/100
Epoch 9/100
Epoch 33/100
Epoch 10/100
Epoch 35/100
Epoch 11/100
Epoch 12/100
Epoch 36/100
Epoch 13/100
Epoch 37/100
Epoch 38/100
Epoch 14/100
Epoch 39/100
Epoch 40/100
Epoch 16/100
Epoch 41/100
Epoch 17/100
Epoch 42/100
Epoch 43/100
Epoch 18/100
Epoch 44/100
Epoch 19/100
Epoch 45/100
Epoch 20/100
Epoch 46/100
Epoch 47/100
Epoch 21/100
Epoch 48/100
Epoch 22/100
Epoch 23/100
Epoch 49/100
Epoch 50/100
Epoch 25/100
Epoch 51/100
Epoch 26/100
Epoch 52/100
Epoch 53/100
Epoch 27/100
Epoch 54/100
Epoch 55/100
Epoch 28/100
Epoch 56/100
Epoch 29/100
Epoch 30/100
Epoch 31/100


[32m[I 2022-10-28 04:31:50,792][0m Trial 77 finished with value: 0.05 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.2, 'sparsity_coefficient': 0.0005918221898844971, 'bn_momentum': 0.9203023586002974}. Best is trial 9 with value: 1.0.[0m


Epoch 68/100
Epoch 69/100
Epoch 1/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100


[32m[I 2022-10-28 04:31:58,288][0m Trial 78 finished with value: 0.012195121951219513 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 4, 'relaxation_factor': 2.2, 'sparsity_coefficient': 0.00047802737605358954, 'bn_momentum': 0.9180139313799557}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 2/100
Epoch 3/100
Epoch 24/100
Epoch 4/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 32/100
Epoch 33/100
Epoch 13/100
Epoch 34/100
Epoch 15/100
Epoch 36/100
Epoch 17/100
Epoch 38/100
Epoch 40/100
Epoch 41/100
Epoch 21/100
Epoch 42/100
Epoch 23/100
Epoch 24/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 26/100
Epoch 49/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 52/100
Epoch 31/100
Epoch 32/100
Epoch 53/100
Epoch 33/100
Epoch 54/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 57/100
Epoch 58/100
Epoch 38/100
Epoch 39/100
Epoch 59/100
Epoch 60/100
Epoch 40/100
Epoch 61/100
Epoch 41/100
Ep

[32m[I 2022-10-28 04:32:31,459][0m Trial 80 finished with value: 0.08333333333333333 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 2, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 0.020218016716579284, 'bn_momentum': 0.9501106395191422}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100


[32m[I 2022-10-28 04:32:33,521][0m Trial 79 finished with value: 0.058823529411764705 and parameters: {'feature_dim': 256, 'n_step': 3, 'n_shared': 4, 'relaxation_factor': 2.6, 'sparsity_coefficient': 0.00025610963993493823, 'bn_momentum': 0.9509738104157784}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 2/100
Epoch 17/100
Epoch 3/100
Epoch 18/100
Epoch 19/100
Epoch 5/100
Epoch 21/100
Epoch 6/100
Epoch 22/100
Epoch 7/100
Epoch 23/100
Epoch 9/100
Epoch 24/100
Epoch 10/100
Epoch 11/100
Epoch 25/100
Epoch 12/100
Epoch 13/100
Epoch 28/100
Epoch 14/100
Epoch 15/100
Epoch 29/100
Epoch 16/100
Epoch 30/100
Epoch 31/100
Epoch 17/100
Epoch 32/100
Epoch 19/100
Epoch 33/100
Epoch 20/100
Epoch 34/100
Epoch 21/100
Epoch 22/100
Epoch 35/100
Epoch 23/100
Epoch 37/100
Epoch 39/100
Epoch 26/100
Epoch 40/100
Epoch 27/100
Epoch 28/100
Epoch 41/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 33/100
Epoch 48/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100

[32m[I 2022-10-28 04:33:12,099][0m Trial 81 finished with value: 0.3333333333333333 and parameters: {'feature_dim': 256, 'n_step': 4, 'n_shared': 2, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 0.0011992246343041813, 'bn_momentum': 0.9107518944145984}. Best is trial 9 with value: 1.0.[0m


Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100


[32m[I 2022-10-28 04:33:30,708][0m Trial 82 finished with value: 0.5 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.00013843956896171202, 'bn_momentum': 0.9419033012760172}. Best is trial 9 with value: 1.0.[0m


Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 2/100
Epoch 3/100
Epoch 61/100
Epoch 62/100
Epoch 5/100
Epoch 63/100
Epoch 6/100
Epoch 64/100
Epoch 65/100
Epoch 7/100
Epoch 66/100
Epoch 8/100
Epoch 9/100
Epoch 67/100
Epoch 10/100
Epoch 11/100
Epoch 68/100
Epoch 12/100
Epoch 69/100
Epoch 70/100
E

[32m[I 2022-10-28 04:34:00,938][0m Trial 83 finished with value: 0.16666666666666666 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.00010190830341990044, 'bn_momentum': 0.9430989369751598}. Best is trial 9 with value: 1.0.[0m


Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:34:22,671][0m Trial 84 finished with value: 0.3333333333333333 and parameters: {'feature_dim': 64, 'n_step': 4, 'n_shared': 3, 'relaxation_factor': 2.3, 'sparsity_coefficient': 0.00011489169541261755, 'bn_momentum': 0.9564990143965427}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 2/100
Epoch 73/100
Epoch 3/100
Epoch 4/100
Epoch 74/100
Epoch 5/100

[32m[I 2022-10-28 04:34:57,556][0m Trial 85 finished with value: 0.0136986301369863 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.4000000000000004, 'sparsity_coefficient': 4.220150728384892e-05, 'bn_momentum': 0.952379923343806}. Best is trial 9 with value: 1.0.[0m


Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 66/100
Epoch 67/100
Epoch 68/100
Epoch 69/100
Epoch 70/100
Epoch 71/100
Epoch 72/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:35:22,068][0m Trial 86 finished with value: 0.3333333333333333 and parameters: {'feature_dim': 64, 'n_step': 5, 'n_shared': 3, 'relaxation_factor': 2.2, 'sparsity_coefficient': 5.741548306423301e-06, 'bn_momentum': 0.9423075485263551}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 59/100
Epoch 60/100
Epoch 61/100
Epoch 62/100
Epoch 2/100
Epoch 63/100
Epoch 64/100
Epoch 65/100
Epoch 3/100
Epoch 67/100
Epoch 5/100
Epoch 68/100
Epoch 6/100
Epoch 69/100
Epoch 7/100
Epoch 70/100
Epoch 71/100
Epoch 10/100
Epoch 73/100
Epoch 11/100


[32m[I 2022-10-28 04:35:59,267][0m Trial 87 finished with value: 0.02702702702702703 and parameters: {'feature_dim': 64, 'n_step': 6, 'n_shared': 3, 'relaxation_factor': 1.9, 'sparsity_coefficient': 0.07115302919014888, 'bn_momentum': 0.9428266116100652}. Best is trial 9 with value: 1.0.[0m


Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 54/100
Epoch 9/100
Epoch 10/100
Epoch 56/100
Epoch 11/100
Epoch 12/100
Epoch 57/100
Epoch 13/100
Epoch 58/100
Epoch 59/100
Epoch 14/100
Epoch 15/100
Epoch 17/100
Epoch 61/100
Epoch 19/100
Epoch 62/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 66/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 67/100
Epoch 29/100
Epoch 68/100
Epoch 30/100
Epoch 69/100
Epoch 31/100
Epoch 71/100
Epoch 32/100
Epoch 72/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 73/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 74/100
Epoch 4

[32m[I 2022-10-28 04:36:24,088][0m Trial 89 finished with value: 0.06666666666666667 and parameters: {'feature_dim': 256, 'n_step': 2, 'n_shared': 2, 'relaxation_factor': 2.0, 'sparsity_coefficient': 0.00021095244801502773, 'bn_momentum': 0.9474551746631757}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100


[32m[I 2022-10-28 04:36:25,435][0m Trial 88 finished with value: 0.025 and parameters: {'feature_dim': 256, 'n_step': 6, 'n_shared': 2, 'relaxation_factor': 1.9, 'sparsity_coefficient': 0.061166299535881416, 'bn_momentum': 0.9457743060906304}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 2/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 8/100
Epoch 5/100
Epoch 9/100
Epoch 6/100
Epoch 10/100
Epoch 11/100
Epoch 7/100
Epoch 8/100
Epoch 12/100
Epoch 9/100
Epoch 13/100
Epoch 10/100
Epoch 14/100
Epoch 11/100
Epoch 15/100
Epoch 12/100
Epoch 13/100
Epoch 16/100
Epoch 14/100
Epoch 17/100
Epoch 15/100
Epoch 16/100
Epoch 18/100
Epoch 17/100
Epoch 19/100
Epoch 18/100
Epoch 20/100
Epoch 21/100
Epoch 19/100
Epoch 20/100
Epoch 22/100
Epoch 21/100
Epoch 23/100
Epoch 22/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 24/100
Epoch 25/100
Epoch 28/100
Epoch 26/100
Epoch 29/100
Epoch 30/100
Epoch 27/100
Epoch 31/100
Epoch 32/100
Epoch 28/100
Epoch 33/100
Epoch 29/100
Epoch 34/100
Epoch 30/100
Epoch 35/100
Epoch 32/100
Epoch 33/100
Epoch 38/100
Epoch 39/100
Epoch 35/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 39/100
Epoch 40/100
Epoch 47/100
Epoch 41/100
Epoch 42/100
Epoch 48/100
Epoch 43/100


[32m[I 2022-10-28 04:37:10,017][0m Trial 90 finished with value: 0.07142857142857142 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.5, 'sparsity_coefficient': 1.6451359160381268e-05, 'bn_momentum': 0.9391933750769664}. Best is trial 9 with value: 1.0.[0m




[32m[I 2022-10-28 04:37:10,392][0m Trial 91 finished with value: 1.0 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.5, 'sparsity_coefficient': 1.7216138697333867e-05, 'bn_momentum': 0.9286379494836702}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 2/100
Epoch 3/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 5/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 7/100
Epoch 12/100
Epoch 8/100
Epoch 13/100
Epoch 9/100
Epoch 14/100
Epoch 10/100
Epoch 15/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 18/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 25/100
Epoch 26/100
Epoch 28/100
Epoch 29/100
Epoch 28/100
Epoch 30/100
Epoch 29/100
Epoch 31/100
Epoch 30/100
Epoch 32/100
Epoch 31/100
Epoch 32/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 36/100
Epoch 37/100
Epoch 37/100
Epoch 39/100
Epoch 38/100
Epoch 40/100
Epoch 39/100
Epoch 41/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 42/100
Epoch 44/100
Epoch 43/100
Epoch 45/100
Epoch 44/100
Epoch 46/100
Epoch 47/100
Epoch 45/100
Epoch 46/100
Epoch 49/100
Ep

[32m[I 2022-10-28 04:37:51,531][0m Trial 93 finished with value: 0.022727272727272728 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.5, 'sparsity_coefficient': 0.009413540159379579, 'bn_momentum': 0.9224773810572284}. Best is trial 9 with value: 1.0.[0m


Epoch 85/100
Epoch 1/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100


[32m[I 2022-10-28 04:37:56,372][0m Trial 92 finished with value: 0.045454545454545456 and parameters: {'feature_dim': 32, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.5, 'sparsity_coefficient': 1.1004203686392412e-05, 'bn_momentum': 0.9289041651479666}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 2/100
Epoch 29/100
Epoch 3/100
Epoch 30/100
Epoch 4/100
Epoch 31/100
Epoch 5/100
Epoch 32/100
Epoch 6/100
Epoch 33/100
Epoch 7/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 8/100
Epoch 37/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 41/100
Epoch 43/100
Epoch 44/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 45/100
Epoch 20/100
Epoch 47/100
Epoch 48/100
Epoch 21/100
Epoch 49/100
Epoch 50/100
Epoch 22/100
Epoch 23/100
Epoch 52/100
Epoch 24/100
Epoch 25/100
Epoch 53/100
Epoch 54/100
Epoch 27/100
Epoch 28/100
Epoch 55/100
Epoch 29/100
Epoch 56/100
Epoch 57/100
Epoch 58/100
Epoch 30/100
Epo

[32m[I 2022-10-28 04:38:25,060][0m Trial 94 finished with value: 0.25 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 0, 'relaxation_factor': 1.6, 'sparsity_coefficient': 3.751516222448968e-06, 'bn_momentum': 0.932393672589251}. Best is trial 9 with value: 1.0.[0m


Epoch 71/100
Epoch 72/100
Epoch 1/100
Epoch 73/100
Epoch 74/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100


[32m[I 2022-10-28 04:38:30,866][0m Trial 95 finished with value: 0.5 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 0, 'relaxation_factor': 1.6, 'sparsity_coefficient': 2.798183452513248e-05, 'bn_momentum': 0.9318437329503264}. Best is trial 9 with value: 1.0.[0m


Epoch 1/100
Epoch 2/100
Epoch 3/100
Epoch 2/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 3/100
Epoch 4/100
Epoch 7/100
Epoch 5/100
Epoch 6/100
Epoch 10/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 13/100
Epoch 15/100
Epoch 14/100
Epoch 16/100
Epoch 15/100
Epoch 17/100
Epoch 16/100
Epoch 18/100
Epoch 17/100
Epoch 19/100
Epoch 18/100
Epoch 20/100
Epoch 21/100
Epoch 20/100
Epoch 22/100
Epoch 21/100
Epoch 23/100
Epoch 24/100
Epoch 22/100
Epoch 25/100
Epoch 26/100
Epoch 23/100
Epoch 27/100
Epoch 24/100
Epoch 28/100
Epoch 29/100
Epoch 25/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 34/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 37/100
Epoch 38/100
Epoch 32/100
Epoch 39/100
Epoch 33/100
Epoch 40/100
Epoch 41/100
Epoch 34/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 35/100
Epoch 45/100
Epoch 46/100
Epoch 36/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 38/100
Epoch 52/100
Epoch 53/100


[32m[I 2022-10-28 04:39:04,592][0m Trial 97 finished with value: 0.03333333333333333 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 0, 'relaxation_factor': 1.7000000000000002, 'sparsity_coefficient': 2.6641562966067956e-05, 'bn_momentum': 0.9145239930829965}. Best is trial 9 with value: 1.0.[0m


Epoch 74/100
Epoch 1/100
Epoch 75/100
Epoch 76/100
Epoch 77/100
Epoch 78/100
Epoch 79/100
Epoch 80/100
Epoch 81/100
Epoch 82/100
Epoch 83/100
Epoch 84/100
Epoch 85/100
Epoch 86/100
Epoch 87/100
Epoch 88/100
Epoch 89/100
Epoch 90/100
Epoch 91/100
Epoch 92/100
Epoch 93/100
Epoch 94/100
Epoch 95/100
Epoch 96/100
Epoch 97/100
Epoch 98/100
Epoch 99/100
Epoch 100/100


[32m[I 2022-10-28 04:39:14,765][0m Trial 96 finished with value: 0.016666666666666666 and parameters: {'feature_dim': 128, 'n_step': 3, 'n_shared': 0, 'relaxation_factor': 1.3, 'sparsity_coefficient': 0.00014945287699849686, 'bn_momentum': 0.959585490081182}. Best is trial 9 with value: 1.0.[0m


Epoch 2/100
Epoch 3/100
Epoch 4/100
Epoch 5/100
Epoch 6/100
Epoch 7/100
Epoch 8/100
Epoch 9/100
Epoch 10/100
Epoch 11/100
Epoch 12/100
Epoch 13/100
Epoch 14/100
Epoch 15/100
Epoch 16/100
Epoch 17/100
Epoch 18/100
Epoch 19/100
Epoch 20/100
Epoch 21/100
Epoch 22/100
Epoch 23/100
Epoch 24/100
Epoch 25/100
Epoch 26/100
Epoch 27/100
Epoch 28/100
Epoch 29/100
Epoch 30/100
Epoch 31/100
Epoch 32/100
Epoch 33/100
Epoch 34/100
Epoch 35/100
Epoch 36/100
Epoch 37/100
Epoch 38/100
Epoch 39/100
Epoch 40/100
Epoch 41/100
Epoch 42/100
Epoch 43/100
Epoch 44/100
Epoch 45/100
Epoch 46/100
Epoch 47/100
Epoch 48/100
Epoch 49/100
Epoch 50/100
Epoch 51/100
Epoch 52/100
Epoch 53/100
Epoch 54/100
Epoch 55/100
Epoch 56/100
Epoch 57/100
Epoch 2/100
Epoch 58/100
Epoch 3/100
Epoch 59/100
Epoch 4/100
Epoch 60/100
Epoch 61/100
Epoch 5/100
Epoch 62/100
Epoch 63/100
Epoch 64/100
Epoch 7/100
Epoch 8/100
Epoch 65/100
Epoch 66/100
Epoch 9/100
Epoch 67/100
Epoch 68/100
Epoch 10/100
Epoch 69/100
Epoch 11/100
Epoch 70/100
E

[32m[I 2022-10-28 04:39:39,687][0m Trial 98 finished with value: 0.011627906976744186 and parameters: {'feature_dim': 32, 'n_step': 2, 'n_shared': 0, 'relaxation_factor': 1.4, 'sparsity_coefficient': 1.7018038783648865e-05, 'bn_momentum': 0.9268480632442077}. Best is trial 9 with value: 1.0.[0m




[32m[I 2022-10-28 04:39:39,946][0m Trial 99 finished with value: 0.037037037037037035 and parameters: {'feature_dim': 512, 'n_step': 2, 'n_shared': 3, 'relaxation_factor': 1.6, 'sparsity_coefficient': 4.914270468796725e-05, 'bn_momentum': 0.9339691060033563}. Best is trial 9 with value: 1.0.[0m


In [32]:
tabnet = TabNet(num_features=train_X_transformed.shape[1],
                output_dim=128,
                feature_dim=512,
                n_step=2, 
                relaxation_factor=1.6,
                sparsity_coefficient=4.914270468796725e-05,
                n_shared=3,
                bn_momentum=0.9339691060033563)

cbs = [tf.keras.callbacks.EarlyStopping(
    monitor="val_loss", patience=30, restore_best_weights=True
)]

optimizer = tf.keras.optimizers.Adam(learning_rate=0.001, clipnorm=10)

loss = [tf.keras.losses.CategoricalCrossentropy(from_logits=False)]

tabnet.compile(optimizer, loss=loss)

tabnet.fit(train_ds, epochs=1000, validation_data=val_ds, callbacks=cbs,
           verbose=1, class_weight={0: 1, 1: 10})

Epoch 1/1000
Epoch 2/1000
Epoch 3/1000
Epoch 4/1000
Epoch 5/1000
Epoch 6/1000
Epoch 7/1000
Epoch 8/1000
Epoch 9/1000
Epoch 10/1000
Epoch 11/1000
Epoch 12/1000
Epoch 13/1000
Epoch 14/1000
Epoch 15/1000
Epoch 16/1000
Epoch 17/1000
Epoch 18/1000
Epoch 19/1000
Epoch 20/1000
Epoch 21/1000
Epoch 22/1000
Epoch 23/1000
Epoch 24/1000
Epoch 25/1000
Epoch 26/1000
Epoch 27/1000
Epoch 28/1000
Epoch 29/1000
Epoch 30/1000
Epoch 31/1000
Epoch 32/1000
Epoch 33/1000
Epoch 34/1000
Epoch 35/1000
Epoch 36/1000
Epoch 37/1000
Epoch 38/1000
Epoch 39/1000
Epoch 40/1000
Epoch 41/1000
Epoch 42/1000
Epoch 43/1000
Epoch 44/1000
Epoch 45/1000
Epoch 46/1000
Epoch 47/1000
Epoch 48/1000
Epoch 49/1000
Epoch 50/1000
Epoch 51/1000
Epoch 52/1000
Epoch 53/1000
Epoch 54/1000
Epoch 55/1000
Epoch 56/1000
Epoch 57/1000
Epoch 58/1000
Epoch 59/1000
Epoch 60/1000
Epoch 61/1000
Epoch 62/1000
Epoch 63/1000
Epoch 64/1000
Epoch 65/1000
Epoch 66/1000
Epoch 67/1000
Epoch 68/1000
Epoch 69/1000
Epoch 70/1000
Epoch 71/1000
Epoch 72/1000
E

<keras.callbacks.History at 0x7f4a8968b1d0>

In [33]:
from sklearn.metrics import roc_auc_score, average_precision_score

val_preds, val_imps = tabnet.predict(val_ds)

print('Test ROC AUC', np.round(roc_auc_score(val_y, val_preds[:, 1]), 4))
print('Test PR AUC', np.round(average_precision_score(val_y, val_preds[:, 1]), 4))

Test ROC AUC 0.6011
Test PR AUC 0.025


## Test Submission

In [34]:
test.head()

Unnamed: 0,TransactionID,TransactionDT,TransactionAmt,ProductCD,card1,card2,card3,card4,card5,card6,...,id_32,id_33,id_34,id_35,id_36,id_37,id_38,DeviceType,DeviceInfo,hour
0,3663549,18403224,31.95,W,10409,111.0,150.0,visa,226.0,debit,...,missing,missing,missing,missing,missing,missing,missing,missing,missing,0.0
1,3663550,18403263,49.0,W,4272,111.0,150.0,visa,226.0,debit,...,missing,missing,missing,missing,missing,missing,missing,missing,missing,0.0
2,3663551,18403310,171.0,W,4476,574.0,150.0,visa,226.0,debit,...,missing,missing,missing,missing,missing,missing,missing,missing,missing,0.0
3,3663552,18403310,284.95,W,10989,360.0,150.0,visa,166.0,debit,...,missing,missing,missing,missing,missing,missing,missing,missing,missing,0.0
4,3663553,18403317,67.95,W,18018,452.0,150.0,mastercard,117.0,debit,...,missing,missing,missing,missing,missing,missing,missing,missing,missing,0.0


In [35]:
test_preds, test_imp = tabnet.predict(test_ds)

submission_df  = pd.DataFrame({"TransactionID": test['TransactionID'].values,
                              'isFraud': test_preds[:, 1]})

submission_df.to_csv('tabnet_sumbission.csv', index=False)

