Neon Depthwise Convolution Transpose Function #3792

hedaoyuan · 2017-08-31T16:33:39Z

Add a depthwise convolution transpose function based on ARM-NEON optimization.

…kernel).

NHZlX

I see that the NeonDepthwiseConvTranspose support the situation that stride == 2. But why not change here https://github.com/hedaoyuan/Paddle/blob/90bf4f60aea012a3eeb819fe4655069d66dbe6e6/paddle/function/neon/NeonDepthwiseConvTranspose.cpp#L102

NHZlX · 2017-09-06T15:30:22Z

paddle/function/neon/NeonDepthwiseConv.h

@@ -474,6 +475,97 @@ struct DepthwiseConvKernel<4, 2> {
  }
 };

+template <class T>


Should we put the padding function into the neon_util file or something else. In case there are neon extensions of other convolution which use padding method.

I think it can be moved into neon_util.h after it is really needed.

hedaoyuan · 2017-09-07T04:11:32Z

I see that the NeonDepthwiseConvTranspose support the situation that stride == 2. But why not change here

In conv_transpose stride only used to convert input data(some operator like padding), after converted, the convolution process can be considered to be a stride size of 1.
https://github.com/vdumoulin/conv_arithmetic

NHZlX · 2017-09-07T04:33:17Z

LGTM

hedaoyuan added 5 commits August 31, 2017 19:57

Refine the neon depthwise convolution code(separate the Function and …

f7e75a0

…kernel).

Refine NeonDepthwiseConv.

4b6b725

[Refine code]Move class Padding into the NeonDepthwiseConv.h.

40d47fa

Add NeonDepthwiseConvTransposeFunction.

840104c

Add stride support 2 for NeonDepthwiseConvTranspose.

90bf4f6

hedaoyuan requested a review from NHZlX September 6, 2017 11:09

NHZlX reviewed Sep 7, 2017

View reviewed changes

NHZlX approved these changes Sep 7, 2017

View reviewed changes

hedaoyuan merged commit a8efed0 into PaddlePaddle:develop Sep 7, 2017

hedaoyuan added this to Convolution Optimization in Embedded and Mobile Deployment Sep 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Neon Depthwise Convolution Transpose Function #3792

Neon Depthwise Convolution Transpose Function #3792

hedaoyuan commented Aug 31, 2017

NHZlX left a comment

NHZlX Sep 6, 2017

hedaoyuan Sep 7, 2017

hedaoyuan commented Sep 7, 2017

NHZlX commented Sep 7, 2017

Neon Depthwise Convolution Transpose Function #3792

Neon Depthwise Convolution Transpose Function #3792

Conversation

hedaoyuan commented Aug 31, 2017

NHZlX left a comment

Choose a reason for hiding this comment

NHZlX Sep 6, 2017

Choose a reason for hiding this comment

hedaoyuan Sep 7, 2017

Choose a reason for hiding this comment

hedaoyuan commented Sep 7, 2017

NHZlX commented Sep 7, 2017