[WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now #4218

QiJune · 2017-09-20T05:51:38Z

reyoung · 2017-09-20T17:34:06Z

paddle/framework/tensor.h

   *
   * @note    CopyFrom supports CPU <-> GPU, GPU <-> GPU.
   */
  template <typename T>
-  inline void CopyFrom(const Tensor& src, const platform::Place& dst_place);
+  inline void CopyFrom(const Tensor& src, const platform::Place& dst_place,
+                       bool is_sync = false);


I am not sure, but the easiest way might be to take a const DeviceContext& by CopyFrom?

It's hard for operator developers to pass a DeviceContext to CopyFrom method. Operator developers have to pass right DeviceContext depends on the src place(cpu or gpu) and dst place(cpu or gpu).

reyoung

I do not think DeviceContextMgr is necessary. Maybe we could just pass a DeviceContext to Tensor::CopyFrom and make it async?

QiJune · 2017-09-21T00:43:23Z

@reyoung

DeviceContext should not expose to users, and users should not and almost could not create right DeviceContext. If we pass a DeviceContext to CopyFrom, so where is the DeviceContext created?
We also use copy method in pybind between C++ and python, and we can hardly pass a DeviceContext here, and expose a api like this:

a = numpy.array(1, 2)
t.set_float(a, device_context);

Anyway, we need to create some DeviceContext in Paddle initialization stage, and handle the parallel CUDA streams internally.

paddle-bot-old · 2020-05-22T06:38:41Z

Since you haven't replied for a long time, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
由于您长期未回复，我们将关闭这个issue/pr。
若问题未解决或有后续问题，请随时重新打开，我们会继续跟进。

QiJune added 3 commits September 19, 2017 13:21

add device_context_manager

1ef16fb

ensure only one cuda stream is used in new framework at now

e4b9f15

refine device_context_manager

0381302

QiJune changed the title ~~[WIP]Implement Device manager and Ensure only one CUDA stream existing in new framework at now~~ [WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now Sep 20, 2017

QiJune added 4 commits September 20, 2017 16:03

remove unused codes

fb19c39

merge baidu/develop

67250a9

fix gpu build error

31a700b

refine code style

ff29c1d

reyoung reviewed Sep 20, 2017

View reviewed changes

reyoung requested changes Sep 20, 2017

View reviewed changes

paddle-bot-old bot closed this May 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now #4218

[WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now #4218

QiJune commented Sep 20, 2017

reyoung Sep 20, 2017

QiJune Sep 21, 2017 •

edited

reyoung left a comment

QiJune commented Sep 21, 2017

paddle-bot-old bot commented May 22, 2020

[WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now #4218

[WIP]Implement DeviceContextManager and Ensure only one CUDA stream existing in new framework at now #4218

Conversation

QiJune commented Sep 20, 2017

reyoung Sep 20, 2017

Choose a reason for hiding this comment

QiJune Sep 21, 2017 • edited

Choose a reason for hiding this comment

reyoung left a comment

Choose a reason for hiding this comment

QiJune commented Sep 21, 2017

paddle-bot-old bot commented May 22, 2020

QiJune Sep 21, 2017 •

edited