Fix unstable selected_rows_functor_test.cu #13505

reyoung · 2018-09-20T05:11:02Z

No description provided.

reyoung · 2018-09-20T05:11:31Z

paddle/fluid/operators/math/selected_rows_functor.cu

@@ -107,7 +107,7 @@ struct SelectedRowsAddTensor<platform::CUDADeviceContext, T> {
    PADDLE_ENFORCE_EQ(in1_height, out_dims[0]);

    auto& in1_value = input1.value();
-    framework::Vector<int64_t> in1_rows(input1.rows());


Prevent unnecessary memcpy here.

Not related to this bug.

reyoung · 2018-09-20T05:11:37Z

paddle/fluid/operators/math/selected_rows_functor.cu

@@ -206,7 +206,7 @@ struct SelectedRowsAddToTensor<platform::CUDADeviceContext, T> {
    PADDLE_ENFORCE_EQ(in1_height, in2_dims[0]);

    auto& in1_value = input1.value();
-    framework::Vector<int64_t> in1_rows(input1.rows());


Prevent unnecessary memcpy here.

Not related to this bug.

reyoung · 2018-09-20T05:11:56Z

paddle/fluid/operators/math/selected_rows_functor_test.cu

-  paddle::platform::CUDADeviceContext ctx(gpu_place);
+  paddle::platform::CUDADeviceContext& ctx =
+      *reinterpret_cast<paddle::platform::CUDADeviceContext*>(
+          paddle::platform::DeviceContextPool::Instance().Get(gpu_place));


Use the computation stream.

Why it must use the DeviceContextPool here?

I see. The copy of lod is async and it is in computation stream.

chengduoZH

Cool

chengduoZH · 2018-09-20T05:18:08Z

paddle/fluid/operators/math/selected_rows_functor_test.cu

-  paddle::platform::CUDADeviceContext ctx(gpu_place);
+  paddle::platform::CUDADeviceContext& ctx =
+      *reinterpret_cast<paddle::platform::CUDADeviceContext*>(
+          paddle::platform::DeviceContextPool::Instance().Get(gpu_place));


Why it must use the DeviceContextPool here?

Fix unstable selected_rows_functor_test.cu

b5996fa

reyoung requested review from panyx0718 and chengduoZH September 20, 2018 05:11

reyoung commented Sep 20, 2018

View reviewed changes

chengduoZH approved these changes Sep 20, 2018

View reviewed changes

panyx0718 approved these changes Sep 20, 2018

View reviewed changes

reyoung merged commit f7af695 into PaddlePaddle:develop Sep 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unstable selected_rows_functor_test.cu #13505

Fix unstable selected_rows_functor_test.cu #13505

reyoung commented Sep 20, 2018

reyoung Sep 20, 2018

reyoung Sep 20, 2018

reyoung Sep 20, 2018

chengduoZH Sep 20, 2018

chengduoZH Sep 20, 2018

chengduoZH left a comment

chengduoZH Sep 20, 2018

Fix unstable selected_rows_functor_test.cu #13505

Fix unstable selected_rows_functor_test.cu #13505

Conversation

reyoung commented Sep 20, 2018

reyoung Sep 20, 2018

Choose a reason for hiding this comment

reyoung Sep 20, 2018

Choose a reason for hiding this comment

reyoung Sep 20, 2018

Choose a reason for hiding this comment

chengduoZH Sep 20, 2018

Choose a reason for hiding this comment

chengduoZH Sep 20, 2018

Choose a reason for hiding this comment

chengduoZH left a comment

Choose a reason for hiding this comment

chengduoZH Sep 20, 2018

Choose a reason for hiding this comment