Add support for variable length sequences in RNNs #873

apaszke · 2017-02-28T00:55:16Z

The PR is still lacking the docs and pep8 fixes, so it's not ready for merge yet, but I wanted to get it out today, so it can be reviewed. I'll fix any comments tomorrow.

Fixes #789.

cc @jekbradbury

torch/nn/utils/rnn.py

+
+    if batch_first:
+        output = output.transpose(0, 1)
+    return output


adamlerer

Thanks for picking this up for me @apaszke ; this looks great!

I'm going to let the PyOpenNMT people know about this, because I think switching to it could simplify some of the PyOpenNMT code, e.g. https://github.com/pytorch/examples/blob/master/OpenNMT/onmt/Translator.py#L56-L79

torch/backends/cudnn/__init__.py

    return descriptor

+
+def descriptor_sequence(tensor, batch_sizes):


torch/nn/_functions/rnn.py

        self.dropout_seed = torch.IntTensor(1).random_()[0]
        self.dropout_state = dropout_state

    def forward_extended(self, input, weight, hx):
-
-        assert(cudnn.is_acceptable(input))
+        assert cudnn.is_acceptable(input)


adamlerer

Nice. @ngimel might want to review the cudnn parts.

One thought: there's an implicit (?) invariant that differentiable arguments to autograd functions are of type Variable; there are various places that check for this. Is the use of PackedSequence instead of a Variable going to break anything in e.g. DataParallel?

torch/nn/modules/rnn.py

@@ -234,6 +236,8 @@ class LSTM(RNNBase):

    Inputs: input, (h_0, c_0)
        - **input** (seq_len, batch, input_size): tensor containing the features of the input sequence.
+          The input can also be a packed variable sequence. See :func:`torch.nn.utils.rnn.pack_padded_sequence`


torch/nn/utils/rnn.py

@@ -6,19 +6,43 @@
 PackedSequence = namedtuple('PackedSequence', ['data', 'batch_sizes'])


-def pack_padded_sequence(tensor, lengths, batch_first=False):
+def pack_padded_sequence(input, lengths, batch_first=False):
+    """Packes a Variable containing padded sequences of variable length.


ngimel · 2017-02-28T21:49:40Z

@adamlerer, good point re: DataParallel. But packed tensor can't be scattered, so the way to do multi-GPU with this is to have a module that accepts padded tensor Variable as input, and wrap this module in DataParallel.

Fixes pytorch#873 Two changes in this PR: Updated BroadcastingChunk to properly set correct sizes/strides for outputs; Update fuser guard logic to recognize dimension with stride == 1 to be contiguous; Note: stride==1 dimension is considered to be contiguous in PE. We have to stay consistent with that, otherwise, we'll keep putting on a guard that will fail later and we would reconstruct until we reach the bailout depth.

…021-11-08 IFU-master-2021-11-08

Add support for variable length sequences in RNNs

c0fc07e

jekbradbury reviewed Feb 28, 2017

View reviewed changes

torch/nn/utils/rnn.py Outdated

if batch_first:

output = output.transpose(0, 1)

return output

This comment was marked as off-topic.

Sign in to view

bmccann mentioned this pull request Feb 28, 2017

ONMT fixes and updates pytorch/examples#82

Merged

adamlerer reviewed Feb 28, 2017

View reviewed changes

Address comments and add docs

4a9c28b

adamlerer approved these changes Feb 28, 2017

View reviewed changes

Doc improvements

af88f5c

apaszke closed this Feb 28, 2017

apaszke reopened this Feb 28, 2017

apaszke mentioned this pull request Feb 28, 2017

torch.nn.utils.clip_grad_norm #878

Closed

apaszke closed this Feb 28, 2017

apaszke reopened this Feb 28, 2017

Final doc improvements

15ec2af

apaszke merged commit da72583 into master Mar 1, 2017

apaszke deleted the var_length_rnn branch March 1, 2017 16:36

killeent mentioned this pull request Mar 31, 2017

WIP: variable length sequences for RNNs soumith/cudnn.torch#355

Merged

sbodenstein mentioned this pull request Nov 28, 2017

Plans for RNN oneapi-src/oneDNN#46

Closed

ezyang added the open source label Jun 24, 2019

jaglinux pushed a commit to jaglinux/pytorch that referenced this pull request Dec 7, 2021

Merge pull request pytorch#873 from ROCmSoftwarePlatform/IFU-master-2…

0ca8e85

…021-11-08 IFU-master-2021-11-08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for variable length sequences in RNNs #873

Add support for variable length sequences in RNNs #873

apaszke commented Feb 28, 2017 •

edited

This comment was marked as off-topic.

adamlerer left a comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

adamlerer left a comment

This comment was marked as off-topic.

This comment was marked as off-topic.

ngimel commented Feb 28, 2017

		return descriptor


		def descriptor_sequence(tensor, batch_sizes):

Add support for variable length sequences in RNNs #873

Add support for variable length sequences in RNNs #873

Conversation

apaszke commented Feb 28, 2017 • edited

This comment was marked as off-topic.

adamlerer left a comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

This comment was marked as off-topic.

adamlerer left a comment

Choose a reason for hiding this comment

This comment was marked as off-topic.

This comment was marked as off-topic.

ngimel commented Feb 28, 2017

apaszke commented Feb 28, 2017 •

edited