[arm]add gemm + relu6/leakyrelu fusion #2674

chenjiaoAngel · 2019-12-25T03:40:14Z

No description provided.

update

….cc to distinguish between conv3x3s1_depthwise_fp32.cc

it is coped from __gemm_sdot_meta_.h

update code

… test=develop

fix build error in kernels/x86/conv_compute.h

update

Merge branch 'develop' of git://github.com/PaddlePaddle/Paddle-Lite into PaddlePaddle-develop

Merge branch 'conv_pad' of https://github.com/chenjiaoAngel/Paddle-Lite into conv_pad

delete con2d_transpose test, this test can found in test/math/

yiicy · 2020-01-13T12:33:48Z

lite/tests/kernels/CMakeLists.txt

@@ -11,8 +11,6 @@ if((NOT LITE_WITH_OPENCL AND NOT LITE_WITH_FPGA) AND (LITE_WITH_X86 OR LITE_WITH
    lite_cc_test(test_kernel_activation_compute SRCS activation_compute_test.cc DEPS arena_framework ${npu_kernels} ${xpu_kernels} ${x86_kernels} ${cuda_kernels} ${arm_kernels} ${lite_ops} ${host_kernels})
    lite_cc_test(test_kernel_argmax_compute SRCS argmax_compute_test.cc DEPS arena_framework ${x86_kernels} ${cuda_kernels} ${arm_kernels} ${lite_ops} ${host_kernels})
    lite_cc_test(test_kernel_axpy_compute SRCS axpy_compute_test.cc DEPS arena_framework ${x86_kernels} ${cuda_kernels} ${arm_kernels} ${lite_ops} ${host_kernels})
-    lite_cc_test(test_kernel_conv_compute SRCS conv_compute_test.cc DEPS arena_framework ${xpu_kernels} ${npu_kernels} ${x86_kernels} ${cuda_kernels} ${arm_kernels} ${lite_ops} ${host_kernels})
-    lite_cc_test(test_kernel_conv2d_transpose_compute SRCS conv2d_transpose_compute_test.cc DEPS arena_framework ${x86_kernels} ${cuda_kernels} ${arm_kernels} ${lite_ops} ${host_kernels})


这两个单测不跑了？

不跑了，test/math下也有conv_compute和conv_transpose的单测。避免每次修改，要修改两个单测

conv的单测是我最近加的，用来跑npu kernel的，如果是arm kernel会跳过，不会多跑

lite/tests/kernels 目录下的单测最好保留。1、这里面的单测更多是用来测试功能性的东西，比如padding_algorithm，不同参数组合下的结果是否正确。2、这里面的单测会验证operator部分是否正确，而不只是kernel本身是否正确。3、不同平台的单测都可以比较容易的加在这里，只需要改下place就可以，用LITE_WITH_XXX分隔。

lite/tests/math 目录下的单测，我理解是测试同一个OP kernel的多种实现是否正确，应该不要和lite/tests/kernels下的单测合并

比如这样可以很快添加不同平台的单测

嗯，好。我下次提交把这个恢复过来。删除原因是，我现在在加relu6融合，这个需要修改单测，给conv_param.activation_param设值，不然跑单测会挂。不想重复修改单测，所以删除了。

MyPandaShaoxiang · 2020-01-14T07:07:17Z

lite/backends/arm/math/fill_bias_relu.cc

+  "vld1.32 {d10-d11}, [%[din_ptr]]! @ vld1q_f32(din_ptr) \n" \
+  "vld1.32 {d12-d13}, [%[din_ptr]]! @ vld1q_f32(din_ptr) \n" \
+  "vadd.f32 q3, q3, %q[vbias] @ add \n"                      \
+  "vadd.f32 q4, q5, %q[vbias] @ add \n"                      \


这应该是q4,q4?还是就是q4,q5

这应该是q4,q4

MyPandaShaoxiang · 2020-01-14T07:10:37Z

lite/backends/arm/math/fill_bias_relu.cc

+  "vbif q3, q8, q7               @ choose \n"    \
+  "vbif q4, q10, q9              @ choose \n"    \
+  "vbif q5, q12, q11             @ choose \n"    \
+  "vbif q6, q13, q13             @ choose \n"


q6,q14,q13?

MyPandaShaoxiang · 2020-01-14T07:13:26Z

lite/backends/arm/math/fill_bias_relu.cc

+  int remain = channel_size % 16;
+  float32x4_t vzero = vdupq_n_f32(0.f);
+  for (int j = 0; j < channel; j++) {
+  }


空循环？

忘了删除

chenjiaoAngel added 30 commits November 6, 2019 17:30

Merge pull request #10 from PaddlePaddle/develop

3f19a70

update

fix conv 2-pad to 4-pad

c882bdc

fix compute conv shape

be96a73

fix pad, test=develop

e73d3eb

change conv_depthwise_3x3s1_fp.cc name to conv3x3s1p01_depthwise_fp32…

f4cf632

….cc to distinguish between conv3x3s1_depthwise_fp32.cc

delete printf note in conv3x3s1, test=develop

a653861

delete printf note, test=develop

e418016

delete gem_sdot.h, test=develop

2485905

it is coped from __gemm_sdot_meta_.h

update compute padding, test=develop

103e47d

fix padding size, must be 2 or 4. test=develop

0547531

Merge pull request #13 from PaddlePaddle/develop

fb68960

update code

fix format in operators/conv_op.cc, test=develop

85e46ef

change #if 0 to #if 1, test=develop

24dbda6

put 2-pad to 4-pad in AttachImpl, test=develop

762dcf9

fix clang-format error inn tests/math/connv_compute_test, test=develop

236961e

fix x86 test result error, test=develop

89eae2c

add asymmetric padding test case in liite/tests/math/conv_compute.cc,…

a4c2e47

… test=develop

change paddings type to support dynamically modify, test=develop

096cda9

fix x86 build error in connv_compute_test, test=develop

1373c03

fix opencl build error, test=develop

e9321f5

fix oopencl build error, test=develop

4676bd9

fix opencl/conv_compute build error, test=develop

dbc7146

fix opencl/conv_compute build error, test=develop

10fcd9a

fix format in kernels/opencl/conv_computte_ttest,test=develop

e5d933c

Merge branch 'develop' into conv_pad

e4b596b

Merge branch 'develop' into conv_pad

88e3fab

fix build error, test=develop

5337322

fix build error in kernels/x86/conv_compute.h

Merge pull request #20 from PaddlePaddle/develop

53c9624

update

fix ccompute shape error in ooperators/conv_op.h, test=develop

3ba2b26

update coode

e31e21d

Merge branch 'develop' of git://github.com/PaddlePaddle/Paddle-Lite into PaddlePaddle-develop

chenjiaoAngel added 25 commits December 12, 2019 21:23

fix build error in winograd arm, test=develop

2e5c4ce

channge act_param as pointer in conv_block_tuils.h, test=develop

4f9ccbf

fix winograd in no equal 4-padding compute error, test=develop

83cdc82

add conv relu6 and leaky_relu in conv_dw_3x3s2, test=develop

3d5daf9

Merge branch 'develop' into conv_pad

06223b8

fix format, test=develop

8ab3c04

fix format in conv_block_utils, test=develop

e8939ae

move updatePadding from conv_op.cc to conv_op.h, test=develop

9fa4395

fix format conv_op.h, test=develop

22a2031

fix buuilde error in conv_oop.h, test=develop

8df11df

remove flag_relu parameter in conv_3x3_depthwise, test=develop

44b3b06

add conv relu6/lleakyrelu in sgemm, test=develop

21f5490

Merge branch 'develop' into conv_pad

0e6f8de

delete some notes, test=develop

3e78325

fix format, test=develop

adabe36

fix build moobile_android error, test=develop

5843055

pull code

59be1d1

Merge branch 'conv_pad' of https://github.com/chenjiaoAngel/Paddle-Lite into conv_pad

change matmul, conv_transpose, mul and fc in using sgemm, test=develop

ef2ad56

fix build error in sgemm_test.ccc, test=develop

849ea3d

add act_param in gru_utils. test=develop

b922993

fix compute error in conv_dw, test=develop

5bc330b

delete con2d_transpose test, test=develop

f55b68a

delete con2d_transpose test, this test can found in test/math/

fix build error, test=develop

901b450

Merge branch 'develop' into conv_pad

08e5230

Merge branch 'develop' into conv_pad

7b083f6

yiicy reviewed Jan 14, 2020

View reviewed changes

MyPandaShaoxiang requested changes Jan 14, 2020

View reviewed changes

fix compute error， test=develop

4dc4c98

MyPandaShaoxiang approved these changes Jan 14, 2020

View reviewed changes

MyPandaShaoxiang merged commit c0af965 into PaddlePaddle:develop Jan 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[arm]add gemm + relu6/leakyrelu fusion #2674

[arm]add gemm + relu6/leakyrelu fusion #2674

chenjiaoAngel commented Dec 25, 2019

yiicy Jan 13, 2020

chenjiaoAngel Jan 14, 2020

zhupengyang Jan 14, 2020

zhupengyang Jan 14, 2020

chenjiaoAngel Jan 14, 2020

MyPandaShaoxiang Jan 14, 2020

chenjiaoAngel Jan 14, 2020

MyPandaShaoxiang Jan 14, 2020

chenjiaoAngel Jan 14, 2020

MyPandaShaoxiang Jan 14, 2020

chenjiaoAngel Jan 14, 2020

[arm]add gemm + relu6/leakyrelu fusion #2674

[arm]add gemm + relu6/leakyrelu fusion #2674

Conversation

chenjiaoAngel commented Dec 25, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment