Switching training/test phase of DropoutLayer #26

lucidfrontier45 · 2016-11-22T02:59:44Z

Dropout and Batch Normalization are two major structures that should behave differently in both training and test phases.

According to the API reference (http://tensorlayer.readthedocs.io/en/latest/modules/layers.html#dropout-layer) , switching train/test for the DropoutLayer is handled by set different data to feed_dict.
e.g.

# for training
feed_dict = {x: X_train_a, y_: y_train_a}
feed_dict.update( network.all_drop )     # enable noise layers

# for testing
dp_dict = tl.utils.dict_to_one( network.all_drop ) # disable noise layers
feed_dict = {x: X_val_a, y_: y_val_a}
feed_dict.update(dp_dict)

I couldn't find how to switch for BatchNormLayer in the tutorial or API reference but according to the DCGAN example (https://github.com/zsdonghao/dcgan), it creates two different networks.
For training phase, is_train=True is passed to BatchNormLayer and is_train=False for test phase.

I think this is confusing that switching method is not unified. Or, is there any standard way for batch norm?
For example, TFLearn switches training/test by tflearn.is_training method.

The text was updated successfully, but these errors were encountered:

lucidfrontier45 · 2016-11-22T03:02:02Z

I think one way to unify the API is add a new Dropout layer that receives is_train argument.
See by test implementation below.

class Dropout(Layer):
    def __init__(self,
                layer = None,
                keep = 0.5,
                is_train = True,
                name = 'dropout_layer'):
        
        Layer.__init__(self, name=name)
        self.inputs = layer.outputs
        print("  tensorlayer:Instantiate Dropout %s: keep: %f" % (self.name, keep))

        set_keep[name] = tf.constant(keep, dtype=tf.float32)
        if is_train:
            self.outputs = tf.nn.dropout(self.inputs, set_keep[name], name=name) # 1.2
        else:
            self.outputs = self.inputs
            
        self.all_layers = list(layer.all_layers)
        self.all_params = list(layer.all_params)
        self.all_drop = dict(layer.all_drop)
        self.all_drop.update( {set_keep[name]: keep} )
        self.all_layers.extend( [self.outputs] )

zsdonghao · 2016-11-22T11:49:31Z

[NEW] FYI, the lastest version of DropoutLayer have a is_fix setting, you can fix the keeping probability by setting it to True.

Previous answer:

This may be better?

class Dropout(Layer):
    def __init__(self,
                layer = None,
                keep = 0.5,
                is_fix = False,
                name = 'dropout_layer'):
        
        Layer.__init__(self, name=name)
        self.inputs = layer.outputs
        print("  tensorlayer:Instantiate Dropout %s: keep: %f" % (self.name, keep))

        if is_fix:
            self.outputs = tf.nn.dropout(self.inputs, keep, name=name) 
        else:
           set_keep[name] = tf.placeholder(tf.float32)
           self.outputs = tf.nn.dropout(self.inputs, set_keep[name], name=name)
            
        self.all_layers = list(layer.all_layers)
        self.all_params = list(layer.all_params)
        self.all_drop = dict(layer.all_drop)
        if not is_fix:
              self.all_drop.update( {set_keep[name]: keep} )
        self.all_layers.extend( [self.outputs] )

wagamamaz · 2016-11-22T20:58:58Z

@lucidfrontier45 Is @zsdonghao 's code work for you? if yes, you can make a push request.

lucidfrontier45 · 2016-11-23T07:22:08Z

@wagamamaz

Before talking about my code or @zsdonghao 's I want to make clear how to use batch normalization.

class tensorlayer.layers.BatchNormLayer(
        layer = None,
        decay = 0.999,
        epsilon = 0.00001,
        act = tf.identity,
        is_train = None,
        beta_init = tf.zeros_initializer,
        gamma_init = tf.ones_initializer,
        name ='batchnorm_layer')

BatchNormLayer accepts is_train as arg for constructor. It's compile time but not run time.
I couldn't find any example of batch normalization except in DCGAN example. It makes two net, one with is is_train=True passed and is_train=False the other. Is this the intended usage of BatchNormLayer ?

If so, I think it's confusing that DropoutLayer and BatchNormLayer has different API for switching training/test phase and should make it unify.

One way is my Dropout implementation that accepts is_train argument.
Is @zsdonghao 's code for switching training/test phase?

zsdonghao · 2016-11-24T13:39:49Z

@lucidfrontier45 Hi, your suggestion is good.

Now, BatchNormLayer have is_train, but DropoutLayer doesn't. However, if a model contails BatchNormLayer, to build inferences for training and testing, we need to use the way in PTB example. In that case, we can use

if is_train:
    network = DropoutLayer(network, 0.8, name='xxx')

instead of put the is_train inside the DropoutLayer, or we can also enable/disable dropout layer by setting feed_dict see mnist cnn.

Please let me know, if you have any suggestion.

zsdonghao · 2016-12-07T15:38:08Z

FYI, the lastest version of DropoutLayer have a is_fix setting, you can fix the keeping probability by setting it to True.

lucidfrontier45 · 2016-12-08T13:08:45Z

@zsdonghao

if is_train:
    network = DropoutLayer(network, 0.8, name='xxx')

This looks fine to me. Thank you.

zsdonghao · 2016-12-27T15:40:35Z

IMPORTANT

@lucidfrontier45 @wagamamaz the latest version of TL has an args of is_fix, so you can do as follow:

if is_train:
    network = DropoutLayer(network, 0.8, is_fix=True, name='xxx')

quelle1 · 2017-12-15T09:27:29Z

network = Conv2d(net_in, df_dim, (k, k), (2, 2), act=lambda x: tl.act.lrelu(x, 0.2), padding='SAME', W_init=w_init, name='h0/conv2d')
tf.summary.histogram('h0/conv2d',tf.get_collection(tf.GraphKeys.VARIABLES, 'h0/conv2d'))

how to get the variable of network? the tensorboard shows nothing.

add cloudpickle to requirement.txt

lucidfrontier45 closed this as completed Dec 8, 2016

zsdonghao added the enhancement label Dec 27, 2016

zsdonghao changed the title ~~Switching training/test phase~~ Switching training/test phase of DropoutLayer Dec 27, 2016

zsdonghao pushed a commit that referenced this issue May 4, 2019

Merge pull request #26 from zsdonghao/jingqing-patch

208d3cc

add cloudpickle to requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Switching training/test phase of DropoutLayer #26

Switching training/test phase of DropoutLayer #26

lucidfrontier45 commented Nov 22, 2016

lucidfrontier45 commented Nov 22, 2016

zsdonghao commented Nov 22, 2016 •

edited

wagamamaz commented Nov 22, 2016

lucidfrontier45 commented Nov 23, 2016

zsdonghao commented Nov 24, 2016

zsdonghao commented Dec 7, 2016

lucidfrontier45 commented Dec 8, 2016

zsdonghao commented Dec 27, 2016 •

edited

quelle1 commented Dec 15, 2017

Switching training/test phase of DropoutLayer #26

Switching training/test phase of DropoutLayer #26

Comments

lucidfrontier45 commented Nov 22, 2016

lucidfrontier45 commented Nov 22, 2016

zsdonghao commented Nov 22, 2016 • edited

wagamamaz commented Nov 22, 2016

lucidfrontier45 commented Nov 23, 2016

zsdonghao commented Nov 24, 2016

zsdonghao commented Dec 7, 2016

lucidfrontier45 commented Dec 8, 2016

zsdonghao commented Dec 27, 2016 • edited

IMPORTANT

quelle1 commented Dec 15, 2017

zsdonghao commented Nov 22, 2016 •

edited

zsdonghao commented Dec 27, 2016 •

edited