Add FillOp #3505

reyoung · 2017-08-15T12:20:05Z

Fill Op will fill a tensor with specific shape and data every time
when Run is invoked except run_once is True.

* Fill Op will fill a tensor with specific shape and data every time when Run is invoked except `run_once` is True.

QiJune · 2017-08-16T05:46:43Z

paddle/memory/memcpy.cc

+                                                  platform::GPUPlace src_place,
+                                                  const void* src, size_t num) {
+  platform::SetDeviceId(src_place.device);
+  platform::GpuMemcpySync(dst, src, num, cudaMemcpyDeviceToHost);


Maybe we do not need a sync Copy here. Copy work on a specific cuda stream too. If we really want to sync the copy:

Copy(dts_place, dst, src_place, src, num, stream_); cudaStreamSynchronize(stream_);

At now, we only have default stream(and I am fixing it in #3497 ), and you can pass 0 as cuda stream at now.

It is very strange that if we invoke some copy method in memory.h, it will trigger link error while compiling.

It is hard to debug if the developer is not familiar with C++, template, and memory.{h/cc}.

So, we should implement the Copy correctly in memory.{h/cc}. It is developer's choice to add a stream or not.

QiJune · 2017-08-16T05:55:05Z

paddle/operators/fill_op.cc

+        .SetDefault(false)
+        .InEnum({true, false});
+    AddAttr<std::vector<int>>("shape", "The shape of fill parameter");
+    AddAttr<std::vector<T>>("data", "The data will be filled");


Please have a look at #2917
There are mainly two kinds of ways to load data. The first way is load from vector or numpy. The second way is generated by paddle itself.
Will we have another method like FeedVariable(caffe2 have FeedBlob)?

The fill_op is part of topology and it does not conflict with FeedVariable.

Think a situation, the minus operator's gradient, are combined operators, they are

An Identify or Copy operator.

A Fill operator to fill a scalar as -1 and a Scale operator.

This is not about load data. It is about designing topology.

QiJune · 2017-08-16T06:00:57Z

paddle/operators/fill_op.h

+namespace paddle {
+namespace operators {
+template <typename T>
+class FillOpKernelBase : public framework::OpKernel {


Maybe the base class FillOpKernelBase is a little complex, just implementing data fill in FillOpGPUKernel and FillOpCPUKernel directly will be fine.

There are common lines of code, shared between CPU/GPU kernels. Make a BaseClass will let the code shared.

luotao1 · 2019-02-01T12:16:24Z

Close due to fill_op has already been done.

Add FillOp

cc064ea

* Fill Op will fill a tensor with specific shape and data every time when Run is invoked except `run_once` is True.

reyoung requested review from gangliao and QiJune August 15, 2017 12:20

qingqing01 mentioned this pull request Aug 16, 2017

Caffe2 Survey for Python Topology #3502

Closed

QiJune reviewed Aug 16, 2017

View reviewed changes

luotao1 closed this Feb 1, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add FillOp #3505

Add FillOp #3505

reyoung commented Aug 15, 2017

QiJune Aug 16, 2017

reyoung Aug 17, 2017

QiJune Aug 16, 2017

reyoung Aug 17, 2017

reyoung Aug 17, 2017

QiJune Aug 16, 2017 •

edited

reyoung Aug 17, 2017

luotao1 commented Feb 1, 2019

Add FillOp #3505

Add FillOp #3505

Conversation

reyoung commented Aug 15, 2017

QiJune Aug 16, 2017

Choose a reason for hiding this comment

reyoung Aug 17, 2017

Choose a reason for hiding this comment

QiJune Aug 16, 2017

Choose a reason for hiding this comment

reyoung Aug 17, 2017

Choose a reason for hiding this comment

reyoung Aug 17, 2017

Choose a reason for hiding this comment

QiJune Aug 16, 2017 • edited

Choose a reason for hiding this comment

reyoung Aug 17, 2017

Choose a reason for hiding this comment

luotao1 commented Feb 1, 2019

QiJune Aug 16, 2017 •

edited