ベストsubのネットワーク構造をinput3層にした場合を試す #17

masatakashiwagi · 2020-10-11T01:43:35Z

ToDo

ベストのsubをFORKして、実験を行う
- 中間層のユニット数は少し調整する

実験

1. epoch=35, seeds=3パターン, activation function=ELU
- g- / c- / all feats(g-/c-/cp-)、それぞれのMLPに通した出力を別のMLPに渡してoutputとする
2. epoch=50, seeds=2パターン, activation function=ELU
- g-の出力をc-とconcatしてMLPに通した出力とall featsのMLPに通した出力を別のMLPに渡してoutputとする
3. epoch=50, seeds=2パターン, activation function=ReLU
- 実験2と同じ

モデル

こちらのsubで実験したモデルをベースにした

モデルの特徴

モデル: g- / c- / all feats(g-/c-/cp-)の特徴量をそれぞれ分けて3 inputとしてMLPを行った

masatakashiwagi · 2020-10-12T03:36:40Z

実験1

Local Score

今回: 0.015264159755005106
元々: 0.014564886043488417
差分: 0.0006992737115

LB Score

今回: 0.01927
元々: 0.01867
差分: 0.0006

Local CVとLBの相関はありそう。

モデル構造

hidden_size_list = [
        [872,512,512,256,128], # genes
        [130,128,128,64,32], # cells
        [1007,512,512,256,128], # all
        [288,256,256] # last
]

class MultiInputTabularNN(nn.Module):
    def __init__(self, cfg):
        super(MultiInputTabularNN, self).__init__()
        # genes
        genes_layer = []
        hidden_size = cfg.hidden_size_list[0]
        for i in range(len(hidden_size)-1):
            genes_layer.append(nn.BatchNorm1d(hidden_size[i]))
            genes_layer.append(nn.Dropout(cfg.dropout))
            genes_layer.append(nn.utils.weight_norm(nn.Linear(hidden_size[i], hidden_size[i+1])))
            genes_layer.append(nn.ELU())
        self.genes_block = nn.Sequential(*genes_layer)
        
        # cells
        cells_layer = []
        hidden_size = cfg.hidden_size_list[1]
        for i in range(len(hidden_size)-1):
            cells_layer.append(nn.BatchNorm1d(hidden_size[i]))
            cells_layer.append(nn.Dropout(cfg.dropout))
            cells_layer.append(nn.utils.weight_norm(nn.Linear(hidden_size[i], hidden_size[i+1])))
            cells_layer.append(nn.ELU())
        self.cells_block = nn.Sequential(*cells_layer)
        
        # all = genes + cells + meta
        all_layer = []
        hidden_size = cfg.hidden_size_list[2]
        for i in range(len(hidden_size)-1):
            all_layer.append(nn.BatchNorm1d(hidden_size[i]))
            all_layer.append(nn.Dropout(cfg.dropout))
            all_layer.append(nn.utils.weight_norm(nn.Linear(hidden_size[i], hidden_size[i+1])))
            all_layer.append(nn.ELU())
        self.all_block = nn.Sequential(*all_layer)

        # full connection
        fc_layer = []
        hidden_size = cfg.hidden_size_list[3]
        for i in range(len(hidden_size)-1):
            fc_layer.append(nn.BatchNorm1d(hidden_size[i]))
            fc_layer.append(nn.Dropout(cfg.dropout))
            fc_layer.append(nn.utils.weight_norm(nn.Linear(hidden_size[i], hidden_size[i+1])))
            fc_layer.append(nn.ELU())
        fc_layer.append(nn.BatchNorm1d(hidden_size[-1]))
        fc_layer.append(nn.Dropout(cfg.dropout))
        fc_layer.append(nn.utils.weight_norm(nn.Linear(hidden_size[-1], len(cfg.target_cols))))
        self.fc_block = nn.Sequential(*fc_layer)

    def forward(self, input_gene, input_cell, input_all):
        # gene
        x1 = self.genes_block(input_gene)
        
        # cell
        x2 = self.cells_block(input_cell)
        
        # all = gene + cell + meta
        x3 = self.all_block(input_all)
        
        # concatenate
        x = torch.cat([x1, x2, x3], dim=1)
        x = self.fc_block(x)
        
        return x

masatakashiwagi · 2020-10-12T16:29:41Z

実験2

Local Score

今回: 0.015213956350176748
元々: 0.014564886043488417
差分: 0.0006490703067

LB Score

今回: 0.01924
元々: 0.01867
差分: 0.00057

モデル構造

hidden_size_list = [
        [872,512,512,256,256], # genes
        [256+130,256,256,128,128], # cells
        [1007,512,512,256,256], # all
        [384,256,256] # last
]

class MultiInputTabularNN(nn.Module):
    def __init__(self, cfg):
        super(MultiInputTabularNN, self).__init__()
        ... # 実験1と同じ構造

    def forward(self, input_gene, input_cell, input_all):
        # gene
        x1 = self.genes_block(input_gene)
        
        # cell
        input_cell_x1 = torch.cat([x1, input_cell], dim=1)
        x2 = self.cells_block(input_cell_x1)
        
        # all = gene + cell + meta
        x3 = self.all_block(input_all)
        
        # concatenate
        x = torch.cat([x2, x3], dim=1)
        x = self.fc_block(x)
        
        return x

masatakashiwagi · 2020-10-13T09:08:50Z

実験3

活性化関数はELUよりReLUの方が良さそう
- ReLUの方がlossの上がり方が大きい

masatakashiwagi · 2020-10-13T09:15:10Z

実験まとめ

中間層の数、そのユニット数や活性化関数の選び方により結果が変わるため、一概に言えないが、複数inputによる多層構造はあまり有効でないかも。
ただし、モデルの多様性という意味では取り入れるのはありだと思う。
中間層の数は比較的浅めでも良さそう。
- https://www.kaggle.com/c/lish-moa/discussion/189595
g-因子の出力をc-因子に繋げる方法は有効なのかもしれない。
- https://www.kaggle.com/c/lish-moa/discussion/180918#1002595

masatakashiwagi added the experiment experiment label Oct 11, 2020

masatakashiwagi self-assigned this Oct 11, 2020

masatakashiwagi mentioned this issue Oct 14, 2020

Submissionの共有 #6

Open

masatakashiwagi closed this as completed Oct 14, 2020

masatakashiwagi mentioned this issue Oct 17, 2020

2020/10/17 21時〜 MTG アジェンダ #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ベストsubのネットワーク構造をinput3層にした場合を試す #17

ベストsubのネットワーク構造をinput3層にした場合を試す #17

masatakashiwagi commented Oct 11, 2020 •

edited

Loading

masatakashiwagi commented Oct 12, 2020 •

edited

Loading

masatakashiwagi commented Oct 12, 2020 •

edited

Loading

masatakashiwagi commented Oct 13, 2020 •

edited

Loading

masatakashiwagi commented Oct 13, 2020 •

edited

Loading

ベストsubのネットワーク構造をinput3層にした場合を試す #17

ベストsubのネットワーク構造をinput3層にした場合を試す #17

Comments

masatakashiwagi commented Oct 11, 2020 • edited Loading

ToDo

モデル

モデルの特徴

masatakashiwagi commented Oct 12, 2020 • edited Loading

実験1

Local Score

LB Score

モデル構造

masatakashiwagi commented Oct 12, 2020 • edited Loading

実験2

Local Score

LB Score

モデル構造

masatakashiwagi commented Oct 13, 2020 • edited Loading

実験3

masatakashiwagi commented Oct 13, 2020 • edited Loading

実験まとめ

masatakashiwagi commented Oct 11, 2020 •

edited

Loading

masatakashiwagi commented Oct 12, 2020 •

edited

Loading

masatakashiwagi commented Oct 12, 2020 •

edited

Loading

masatakashiwagi commented Oct 13, 2020 •

edited

Loading

masatakashiwagi commented Oct 13, 2020 •

edited

Loading