fuse optimize op transpiler #8940

jacquesqiao · 2018-03-09T14:08:58Z

… merge-optimize-op-transpiler

…/Paddle into merge-optimize-op-transpiler

panyx0718 · 2018-03-12T00:49:30Z

paddle/fluid/operators/sgd_group_op.cu

+  int grid_size = blockDim.x * gridDim.x;
+  for (int i = blockIdx.x * blockDim.x + threadIdx.x; i < num; i += grid_size) {
+    T g_data = g[i];
+    T p_data = p[i];


this assignments aren't needed?

Yes, these assignments can be saved.

panyx0718 · 2018-03-12T00:52:43Z

paddle/fluid/operators/sgd_group_op.cu

+        auto* grad_data = grads[j]->data<T>();
+        auto* param_data = params[j]->data<T>();
+        int param_num = params[j]->numel();
+        int block = 512;


how is the block number decided?

The block number should is decided by the property of GPU and the task of CUDA Kernel.
In most GPU, the max threads of one block are 1024, so the block number is 1024 in most case, but for SGDKernel, sometimes, the number of param_data maybe less 200 which is less 1024, if we also set the block number to 1024, it will cause waste the resource of GPU.
So block = 512 maybe not appropriate too.

panyx0718 · 2018-03-12T00:53:36Z

paddle/fluid/operators/sgd_group_op.h

+template <typename T>
+class SGDGroupOpKernel : public framework::OpKernel<T> {
+ public:
+  void Compute(const framework::ExecutionContext& ctx) const override {


sgd_group_op.h is just used to analysis the affection of using only one sgd op_kernel on GPU. It doesn't attempt to merge.

panyx0718 · 2018-03-12T00:59:46Z

python/paddle/fluid/merge_optimize_op_transpiler.py

+# limitations under the License.
+
+
+def fuse_optimize_op(input_program):


Do we have plan to do transpiler in C++? Iterating ops in Python won't scale to large programs

panyx0718

Looks good in general. A question is, how to scale your method to other optimizers? like momentum and adam?

chengduoZH · 2018-03-12T05:22:19Z

@panyx0718 Other optimizers also need a group operation separately, if adopting this strategy.

chengduoZH · 2018-03-12T07:18:56Z

I have done some experiments about replacing sgd_op with sgd_group_op, this is the experiment result:
Profile script: performance_tuning_se_resnet_train

	sgd	sgd_group	after/before
total time	318.099	171.128	0.462029117
called times	26920	40

chengduoZH and others added 5 commits March 8, 2018 15:00

add sgd group

cfa1f9b

add fuse_optimize_op transpiler

8ff728a

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

3e1267c

… merge-optimize-op-transpiler

Merge branch 'feature/Add_Sgd_group' of https://github.com/chengduoZH…

c31f2f7

…/Paddle into merge-optimize-op-transpiler

fix some bug

7056e17

jacquesqiao requested review from panyx0718 and chengduoZH March 9, 2018 14:09

jacquesqiao changed the title ~~Merge optimize op transpiler~~ fuse optimize op transpiler Mar 9, 2018

jacquesqiao mentioned this pull request Mar 9, 2018

A experiment of fuse all optimize op to one #8941

Closed

panyx0718 reviewed Mar 12, 2018

View reviewed changes

jacquesqiao closed this Apr 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fuse optimize op transpiler #8940

fuse optimize op transpiler #8940

jacquesqiao commented Mar 9, 2018 •

edited

panyx0718 Mar 12, 2018

chengduoZH Mar 12, 2018

panyx0718 Mar 12, 2018

chengduoZH Mar 12, 2018

panyx0718 Mar 12, 2018

chengduoZH Mar 12, 2018

panyx0718 Mar 12, 2018

panyx0718 left a comment

chengduoZH commented Mar 12, 2018

chengduoZH commented Mar 12, 2018

		# limitations under the License.


		def fuse_optimize_op(input_program):

fuse optimize op transpiler #8940

fuse optimize op transpiler #8940

Conversation

jacquesqiao commented Mar 9, 2018 • edited

panyx0718 Mar 12, 2018

Choose a reason for hiding this comment

chengduoZH Mar 12, 2018

Choose a reason for hiding this comment

panyx0718 Mar 12, 2018

Choose a reason for hiding this comment

chengduoZH Mar 12, 2018

Choose a reason for hiding this comment

panyx0718 Mar 12, 2018

Choose a reason for hiding this comment

chengduoZH Mar 12, 2018

Choose a reason for hiding this comment

panyx0718 Mar 12, 2018

Choose a reason for hiding this comment

panyx0718 left a comment

Choose a reason for hiding this comment

chengduoZH commented Mar 12, 2018

chengduoZH commented Mar 12, 2018

jacquesqiao commented Mar 9, 2018 •

edited