New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add row conv operator #6013

Merged

sidgoyal78 merged 37 commits into PaddlePaddle:develop from sidgoyal78:row_conv

Dec 11, 2017

Contributor

sidgoyal78 commented Nov 29, 2017 •

edited

Loading

Resolves #5612 , by adding the implementation of the row-convolution operator.

Few notes:

The CPU version implementation is very straight-forward as of now (it has unvectorized loops) and can be optimized.
The GPU version is a direct porting of kernels written by @qingqing01 for the earlier version of paddle.(https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/function/RowConvOpGpu.cu). I took some time to see if I could improve something, but could not think of improvements.

sidgoyal78 added 4 commits

November 21, 2017 16:11


          Add initial CPU version


          Modify CPU version

9ecf3e6


          Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

838b161

… row_conv


          Add naive GPU version

65173bc

sidgoyal78 changed the title ~~Add row conv operator~~ Add row conv operator (in progress)

sidgoyal78 added 20 commits

November 28, 2017 17:33


          Remove type error for inp arguments

5b43ad0


          Remove update bug

272a912


          Try size_t* for batch_indices

543e2f3


          Fix variable name

1b581f2


          Fix minor <= bug

ebc4f47


          Add simplest dFilter implementation

640b873


          Add simplest dX grad implementation

261c972


          Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

c425096

… row_conv


          Check += error

6ef6c0b


          Remove arg list error

dcb84af


          Add tid checks

9fed9d7


          Add single thread implementation

0ac4c6f


          Fix input

1d02152


          Add better kernels for forward and dX computation

705cff7


          Modify blockDim and gridDim for dX kernel

5b43f46


          Add forward prop with shared memory

99ae545


          Fix variable name

9cc579d


          Add pointer to sharedmem

d5a0040


          Add dX with shared memory

84ed570


          Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

c9a1f96

… row_conv

qingqing01 self-requested a review

December 1, 2017 02:17

sidgoyal78 added 4 commits

November 30, 2017 19:34


          Modify backprop for cpu version of dX


          Add better dW kernel

56cf035


          Fix minor typo

84926a0


          Fix index in dW

a6d77cf

sidgoyal78 added 7 commits

December 1, 2017 11:56


          Try alt dX for cpu version

c5a793e


          Add improved dW version

d6410e2


          Fix errors

251ac0f


          Fix variable dim for shared mem


          Fix dim indexing in dW improved gpu

29ab076


          Fix documentation

025c32d


          Fix documentation

2fbc0cb

sidgoyal78 changed the title ~~Add row conv operator (in progress)~~ Add row conv operator

qingqing01 reviewed

View reviewed changes

Contributor

qingqing01 left a comment •

edited

Loading

@sidgoyal78 I like your code, including the code of the previous PRs. In my opinion, your code is very beautiful and high quality.

paddle/operators/row_conv_op.cc Outdated

+              $$
+              out_{i, :} = \sum_{j=i}^{i + context} in_{j,:} \dot W_{i-j, :}
+              $$

Contributor

qingqing01 Dec 5, 2017

For the doc, there are some comments in
https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/function/RowConvOp.cpp#L97

Contributor Author

sidgoyal78 Dec 5, 2017

Thanks for the pointer, i have included now.

paddle/operators/row_conv_op.cc Outdated

+                  AddInput("Filter",
+                           "(Tensor), the input(Filter) is a learnable parameter. It "
+                           "is a 2-D tensor with shape (future_context x N), where, "
+                           "future_context is the batch size and N is the data dimension.");

Contributor

qingqing01 Dec 5, 2017

I like your code, I think the name future_context is good :)

future_context is the batch size

future_context is the future context length.

Contributor Author

sidgoyal78 Dec 5, 2017

Fixed, thanks.

paddle/operators/row_conv_op.cc Outdated

+                  auto *Out = context.Output<LoDTensor>("Out");
+                  Out->mutable_data<T>(context.GetPlace());
+                  context.ShareLoD("X", "Out");

Contributor

qingqing01 Dec 5, 2017

Since there is ctx->ShareLoD("X", "Out") in the InferShape, and the previous bug for ShareLoD in InferShape has been fixed, line 123 can be removed.

Contributor Author

sidgoyal78 Dec 5, 2017

Fixed.

paddle/operators/row_conv_op.cc

+                            cot_seq(k, d) = weights(w, d) * cip_seq(k + w, d);
+                          } else {
+                            cot_seq(k, d) += weights(w, d) * cip_seq(k + w, d);
+                          }

Contributor

qingqing01 Dec 5, 2017

Maybe we can use elementwise mul and col-wise sum to remove the for loop in line 145 and line 147. But the optimization can be done in the future. So in this PR, I think it is ok here.

Contributor Author

sidgoyal78 Dec 5, 2017

Sure.

paddle/operators/row_conv_op.cc Outdated

+                void Compute(const framework::ExecutionContext &context) const override {
+                  auto *X = context.Input<LoDTensor>("X");
+                  auto *Filter = context.Input<Tensor>("Filter");
+                  auto *Out = context.Output<LoDTensor>("Out");

Contributor

qingqing01 Dec 5, 2017

The naming style:
X->x
Filer->filter
Out->out

https://google.github.io/styleguide/cppguide.html#Variable_Names

Contributor Author

sidgoyal78 Dec 5, 2017

Fixed.

Contributor Author

sidgoyal78 Dec 5, 2017

(I need to fix for .cu code)

sidgoyal78 added 2 commits

December 5, 2017 10:17


          Address review comments

456cf13


          Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2ae816a

… row_conv

qingqing01 approved these changes

View reviewed changes

Contributor

qingqing01 left a comment

LGTM.

sidgoyal78 merged commit 4ff6bc1 into PaddlePaddle:develop

kuke added this to Done in Speech in Fluid

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment