CudnnLSTM dropout takes no effect #6466

robotnc · 2016-12-23T02:33:57Z

My environment is: Tensorflow 0.11.0rc2, in unbuntu 16.04, cuda8.0, cudnn5.1, GPU is GTX1080.

I am using the CudnnLSTM from tensorflow.contrib.cudnn_rnn package. I found that the dropout setting in CudnnLSTM seems take no effect, and I checked that there is no test for dropout in op unit test. So I write a simple code to test it, the code is below:

import tensorflow as tf
from tensorflow.contrib.cudnn_rnn import CudnnLSTM

class Cudnn_model():
  def __init__(self,dropout):
    self.model = CudnnLSTM(
        num_layers = 1,
        num_units = 8,
        input_size = 8,
        input_mode = "skip_input",
        direction = "unidirectional",
        dropout = dropout,
        )

    params_size_t = self.model.params_size()
    self.params = tf.Variable(tf.ones([params_size_t]), validate_shape=False)

  def run_step(self,rnn_inputs):
    outputs, output_h, output_c = self.model(
                input_data = rnn_inputs,
                input_h = tf.zeros([1,1,8]),
                input_c = tf.zeros([1,1,8]),
                params = self.params,
                is_training= True
               )
    self.outputs = outputs
    return outputs

def main():

inputs = tf.pack([[[0.0,1.0,2.0,3.0,4.0,5.0,6.0,7.0]]])
m1 = Cudnn_model(dropout = 0.0)
output1 = m1.run_step(inputs)
m2 = Cudnn_model(dropout = 0.5)
output2 = m2.run_step(inputs)
output3 = tf.nn.dropout(output1,0.5)
output4 = m1.run_step(tf.nn.dropout(inputs,0.5))

config = tf.ConfigProto(allow_soft_placement=True)
config.gpu_options.allow_growth = True
sess = tf.Session(config=config)
sess.run(tf.initialize_all_variables())

for i in range(5):
    out1,out2,out3,out4 = sess.run([output1,output2,output3,output4])
    print " ----- Try time %d -----" % i
    print "cndnn_dropout=0 : ", out1
    print "cudnn_dropout=0.5 : ", out2
    print "tf_out_dropout=0.5 : ", out3
    print "tf_in_dropout=0.5 : ", out4
return

And the result is：

----- Try time 0 -----
cndnn_dropout=0 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
cudnn_dropout=0.5 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
tf_out_dropout=0.5 [[[ 1.2165668   0.          0.          0.          0.          1.52103424
1.52239561  1.52289677]]]
tf_in_dropout=0.5 [[[ 0.6082834   0.6082834   0.6082834   0.76119781  0.6082834   0.76158684
0.6082834   0.6082834 ]]]

 ----- Try time 1 -----
cndnn_dropout=0 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
cudnn_dropout=0.5 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
tf_out_dropout=0.5 [[[ 0.  0.  0.  0.  0.  0.  0.  0.]]]
tf_in_dropout=0.5 [[[ 0.6082834   0.74009657  0.6082834   0.76119781  0.6082834   0.76158684
0.6082834   0.6082834 ]]]

 ----- Try time 2 -----
cndnn_dropout=0 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
cudnn_dropout=0.5 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
tf_out_dropout=0.5 [[[ 1.2165668   0.          0.          1.50730526  1.51733613  1.52103424
1.52239561  0.        ]]]
tf_in_dropout=0.5 [[[ 0.6082834   0.74009657  0.6082834   0.76119781  0.6082834   0.76158684
0.6082834   0.761594  ]]]

 ----- Try time 3 -----
cndnn_dropout=0 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
cudnn_dropout=0.5 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
tf_out_dropout=0.5 [[[ 0.          0.          1.48019314  0.          0.          0.
1.52239561  1.52289677]]]
tf_in_dropout=0.5 [[[ 0.6082834   0.6082834   0.6082834   0.76119781  0.76154053  0.6082834
0.76159316  0.761594  ]]]

 ----- Try time 4 -----
cndnn_dropout=0 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
cudnn_dropout=0.5 [[[ 0.6082834   0.70377535  0.74009657  0.75365263  0.75866807  0.76051712
0.76119781  0.76144838]]]
tf_out_dropout=0.5 [[[ 0.          1.40755069  0.          1.50730526  1.51733613  0.
1.52239561  1.52289677]]]
tf_in_dropout=0.5 [[[ 0.6082834   0.74009657  0.75866807  0.76119781  0.6082834   0.76158684
0.76159316  0.6082834 ]]]

From the result I see that the cudnn_dropout = 0.5 takes no effect, the result is always same with cudnn_dropout = 0.0.

The text was updated successfully, but these errors were encountered:

zhangyaobit · 2017-01-12T20:00:49Z

@robotnc Sorry for my late response; I was on vacation. Yes, dropout is not supported yet, see here.

Adding @zheng-xq

alquraishi · 2017-01-12T20:10:45Z

If we do get dropout support, can we get recurrent dropout, as that's the form that seems to actually help. Similar to what's in LayerNormBasicLSTMCell.

fxsuper · 2017-04-21T17:32:16Z

Any updates on this? Not having any dropout support for cuDNN-based RNNs seems really limiting, especially considering how much further ahead PyTorch support is for this. This isn't exactly a new or rarely used feature.

alquraishi · 2017-04-21T21:34:51Z

What's actually involved in adding (input) dropout support? Is it just wiring the places in this file which are marked with /*dropout*/?

robotnc · 2017-04-23T12:44:34Z

Hi all： I tried the FusedBlockLSTM in tf.contrib.rnn in Tensorflow V1.0, the speed is almost same with CudnnLSTM. And the FusedBlockLSTM can support Dropout, and is easier to use.

…

在 2017年4月22日，上午5:36，Mohammed AlQuraishi ***@***.***> 写道： What's actually involved in adding (input) dropout support? Is it just wiring the places in this file <https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/cudnn_rnn/kernels/cudnn_rnn_ops.cc#L701> which are marked with /*dropout*/? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#6466 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ARpmyx4oomxYmgcQB-PXvAat4tW0Uhvbks5rySFTgaJpZM4LUiQt>.

alquraishi · 2017-04-23T12:51:34Z

My experience is very different. I still see a 3x gap between FusedBlockLSTM and CudnnLSTM, so I would still think this is very useful to have. And the issue is that the feature is present as an option, but it doesn't actually do anything. So it's very misleading as it is.

robotnc · 2017-04-23T13:39:24Z

What’s your tensorflow version? I found the new V1.0 is faster than V0.1x

…

在 2017年4月23日，下午8:52，Mohammed AlQuraishi ***@***.***> 写道： My experience is very different. I still see a 3x gap between FusedBlockLSTM and CudnnLSTM, so I would still think this is very useful to have. And the issue is that the feature is present as an option, but it doesn't actually do anything. — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#6466 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ARpmyyzaGa3VGqyFWY561JLgDVMC6n1Iks5ry0mkgaJpZM4LUiQt>.

alquraishi · 2017-04-23T14:26:58Z

TF 1.1rc2. And like I mentioned there's not just a performance difference but a bona fide bug, because the CudnnLSTM API exposes a dropout option that does not do anything. If you don't mind reopen the ticket. Otherwise I'll start a new ticket.

FYI this is on a Pascal Titan X with a bidirectional LSTM of 800 units (each way) and 700 timesteps.

robotnc · 2017-04-23T14:34:31Z

The dropout is still needed, reopen again.

protoget · 2017-05-06T00:47:34Z

CudnnRNN dropout change is submitted, please keep an eye on the nightly builds.

skye · 2017-06-16T21:51:38Z

@protoget has this been resolved?

alquraishi · 2017-10-25T23:12:24Z

@protoget I see your commit last May, but I just tried again with TF 1.4.0rc0 and dropout still doesn't seem to do anything.

protoget · 2017-10-26T00:58:54Z

@skye @alquraishi
The dropout is supported.
Dropout is applied between layers -- if you only have one layer, it doesn't show up even if dropout ratio isn't zero.

robotnc changed the title ~~CudnnLSTM dropout take no effect~~ CudnnLSTM dropout takes no effect Dec 23, 2016

robotnc mentioned this issue Jan 5, 2017

Support to convert CudnnLSTM params to normal Weights and Bias #6072

Closed

michaelisard assigned zheng-xq Jan 5, 2017

michaelisard added the type:bug Bug label Jan 5, 2017

michaelisard assigned zhangyaobit and unassigned zheng-xq Jan 5, 2017

zhangyaobit added the stat:contribution welcome Status - Contributions welcome label Feb 16, 2017

zhangyaobit removed their assignment Feb 27, 2017

vrv assigned zheng-xq Apr 22, 2017

vrv removed the stat:contribution welcome Status - Contributions welcome label Apr 22, 2017

zheng-xq assigned protoget and unassigned zheng-xq Apr 23, 2017

robotnc closed this as completed Apr 23, 2017

robotnc reopened this Apr 23, 2017

protoget closed this as completed Oct 26, 2017

tRosenflanz mentioned this issue Jan 7, 2018

Add dropout and recurrent_dropout to CuDNNLSTM and CuDNNGRU keras-team/keras#8935

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CudnnLSTM dropout takes no effect #6466

CudnnLSTM dropout takes no effect #6466

robotnc commented Dec 23, 2016

zhangyaobit commented Jan 12, 2017

alquraishi commented Jan 12, 2017

fxsuper commented Apr 21, 2017

alquraishi commented Apr 21, 2017

robotnc commented Apr 23, 2017 via email

alquraishi commented Apr 23, 2017 •

edited

Loading

robotnc commented Apr 23, 2017 via email

alquraishi commented Apr 23, 2017 •

edited

Loading

robotnc commented Apr 23, 2017

protoget commented May 6, 2017

skye commented Jun 16, 2017

alquraishi commented Oct 25, 2017

protoget commented Oct 26, 2017

CudnnLSTM dropout takes no effect #6466

CudnnLSTM dropout takes no effect #6466

Comments

robotnc commented Dec 23, 2016

zhangyaobit commented Jan 12, 2017

alquraishi commented Jan 12, 2017

fxsuper commented Apr 21, 2017

alquraishi commented Apr 21, 2017

robotnc commented Apr 23, 2017 via email

alquraishi commented Apr 23, 2017 • edited Loading

robotnc commented Apr 23, 2017 via email

alquraishi commented Apr 23, 2017 • edited Loading

robotnc commented Apr 23, 2017

protoget commented May 6, 2017

skye commented Jun 16, 2017

alquraishi commented Oct 25, 2017

protoget commented Oct 26, 2017

alquraishi commented Apr 23, 2017 •

edited

Loading

alquraishi commented Apr 23, 2017 •

edited

Loading