Add sequence reshape operator #7662

pkuyym · 2018-01-18T12:31:06Z

Resolves #6678

… fix-6678

chengduoZH · 2018-01-18T13:06:15Z

paddle/operators/sequence_reshape_op.cu

+namespace ops = paddle::operators;
+REGISTER_OP_CUDA_KERNEL(
+    sequence_reshape,
+    ops::SequenceReshapeKernel<paddle::platform::CUDADeviceContext, float>);


You also need register double type for sequence_reshape.

chengduoZH · 2018-01-18T13:08:22Z

paddle/operators/sequence_reshape_op.cc

+then out is a LoDTensor:
+    out.lod  = [[0,    1,    3]]
+    out.data = [[0.1, 0.2, 0.3, 0.4],
+                [0.5, 0.6, 0.7, 0.8], [0.9, 1.0, 1.1, 1.2]]


This can be written as an integer so that it will be better to see.

chengduoZH · 2018-01-19T02:12:43Z

paddle/operators/sequence_reshape_op.h

+                        "to 0 after reshaped.",
+                        i + 1);
+      out_lod[0].push_back(out_lod[0].back() + offset);
+    }


I think that line 50~64 should be put in InferShape. This code belongs to the input data validity checking.

I think it's ok to do this in the kernel.

chengduoZH · 2018-01-19T02:16:32Z

paddle/operators/sequence_reshape_op.cc

+    auto x_dims = ctx->GetInputDim("X");
+    PADDLE_ENFORCE_EQ(x_dims.size(), 2U, "Rank of Input(X) should be 2.");
+    int dimension = ctx->Attrs().Get<int>("new_dim");
+    ctx->SetOutputDim("Out", {x_dims[0], static_cast<int64_t>(dimension)});


The output dim may be not {x_dims[0], dimension}. And the output dim can be computed in InferShape.

chengduoZH · 2018-01-19T02:19:46Z

paddle/operators/sequence_reshape_op.h

+    auto& out_lod = *out->mutable_lod();
+    out_lod.resize(1);
+    out_lod[0].clear();
+    out_lod[0].push_back(0);


What if out_width equals in_dims[1]?

Just do the copy.

chengduoZH · 2018-01-19T02:29:36Z

paddle/operators/sequence_reshape_op.h

+                     p_in_data + in_offset, bytes, dev_ctx.stream());
+#endif
+      }
+    }


From the description of the example, you need only copy input to output and reset out_lod and out_dim, but not so complex.

chengduoZH · 2018-01-19T07:29:35Z

paddle/operators/sequence_reshape_op.h

+    }
+
+    out->mutable_data<T>(context.GetPlace());
+    framework::Copy(*in, context.GetPlace(), out);


Line 65 can be placed on line 40, and out->mutable_data<T>(context.GetPlace()); can be removed.

It seems Copy will invoke mutable_data of dest tensor, so L64 is not necessary.

chengduoZH · 2018-01-19T07:33:51Z

paddle/operators/sequence_reshape_op.h

+    } else {
+      auto& out_lod = *out->mutable_lod();
+      out_lod.resize(1);
+      out_lod[0].clear();


push_back: this effectively increases the container size by one, which causes an automatic reallocation of the allocated storage space if -and only if- the new vector size surpasses the current vector capacity.

you can replace out_lod[0].clear(); with out_lod[0].resize(seq_num);.

chengduoZH · 2018-01-19T07:48:27Z

paddle/operators/sequence_reshape_op.cc

+    op_desc_ptr->SetOutput(framework::GradVarName("X"), InputGrad("X"));
+    op_desc_ptr->SetAttrMap(Attrs());
+    return std::unique_ptr<framework::OpDesc>(op_desc_ptr);
+  }


I don't think you need override Apply. You can use the default xxxGradOpMaker.
You can refer this, and register op with REGISTER_OP.

I think, adding GradOpMaker explicitly is not harmful.

Yes, I agreed with you.
But it seems not necessary, and sequence_reshape_op should consistent with other ops.
I think Apply should only be overridden in the complex op, just like while_op, recurrent_op and so on, because the default GradOpMaker does not meet these op's needs.

Default GradOpMaker will make the prototxt containing many unnecessary variables.

I see, it is helpful for memory optimization.

chengduoZH

LGTM+

chengduoZH · 2018-01-19T08:23:01Z

paddle/operators/sequence_reshape_op.cc

+    op_desc_ptr->SetOutput(framework::GradVarName("X"), InputGrad("X"));
+    op_desc_ptr->SetAttrMap(Attrs());
+    return std::unique_ptr<framework::OpDesc>(op_desc_ptr);
+  }


I see, it is helpful for memory optimization.

pkuyym added 4 commits January 18, 2018 11:29

Add sequence_reshape_op.

9bd9d8b

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f20617b

… fix-6678

Refine the implementation and add unit test.

bea4144

Change the CopyRight.

fc581bc

pkuyym requested review from qingqing01 and chengduoZH January 18, 2018 12:31

chengduoZH reviewed Jan 19, 2018

View reviewed changes

Simplify the implementation.

08cb472

pkuyym force-pushed the fix-6678 branch from 96a22cc to 08cb472 Compare January 19, 2018 04:55

chengduoZH reviewed Jan 19, 2018

View reviewed changes

resize before computing LoD.

b07ca1d

chengduoZH approved these changes Jan 19, 2018

View reviewed changes

pkuyym merged commit 4f93331 into PaddlePaddle:develop Jan 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sequence reshape operator #7662

Add sequence reshape operator #7662

pkuyym commented Jan 18, 2018

chengduoZH Jan 18, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 18, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

pkuyym Jan 19, 2018

chengduoZH Jan 19, 2018

chengduoZH left a comment

chengduoZH Jan 19, 2018

Add sequence reshape operator #7662

Add sequence reshape operator #7662

Conversation

pkuyym commented Jan 18, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chengduoZH left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment