Reusing of softmax mkldnn primitives #10576

jczaja · 2018-05-10T14:08:57Z

Changes presented here are introducing concept of reusing once created MKLDNN primitives for softmax for specific scenario eg. when we have softmax MKLDNN op called for the second time with
same dims of input/output then we can reuse previously created objects rather than recreate them.
Intention of this change is to speedup Softmax MKLDNN operator (more MKLDNN ops will follow)

luotao1 · 2018-05-11T10:15:30Z

@jczaja please merge the latest code to pass the TeamCity. Thanks very much!

- Added hash function inside of MKLDNN softmax op to be used as handle for primitives stroing in a context - Style fixes to softmax mkldnn op - Fixes after review - Coding style - Fix to style - style fixes - style fix - style fixes - Fix to cody style check - Rephrasing a comment

jczaja · 2018-05-11T15:39:11Z

@luotao1 Thanks very much for a suggesstion. It helped.

tensor-tang

LGTM

tensor-tang · 2018-05-14T02:54:01Z

paddle/fluid/operators/softmax_mkldnn_op.cc

+    if (softmax_p == nullptr) {
+      // Currently only NC data format is supported
+      auto softmax_md =
+          MKLDNNMemDesc({softmax_tz}, memory::f32, memory::format::nc);


f32should depends on T, right?
Maybe this should be enhanced, or at least enforce as float.

It should be enhanced in next PR, since I find:

BDSHYF000120887:operators luotao02$ grep "memory::f32" *.cc activation_mkldnn_op.cc: ? platform::MKLDNNMemDesc(src_tz, mkldnn::memory::f32, activation_mkldnn_op.cc: : platform::MKLDNNMemDesc(src_tz, mkldnn::memory::f32, activation_mkldnn_op.cc: ? platform::MKLDNNMemDesc(src_tz, mkldnn::memory::f32, activation_mkldnn_op.cc: : platform::MKLDNNMemDesc(src_tz, mkldnn::memory::f32, pool_mkldnn_op.cc: auto src_md = platform::MKLDNNMemDesc(src_tz, mkldnn::memory::f32, pool_mkldnn_op.cc: auto dst_md = platform::MKLDNNMemDesc(dst_tz, mkldnn::memory::f32, pool_mkldnn_op.cc: {{}, mkldnn::memory::f32, mkldnn::memory::format::nchw}, pool_mkldnn_op.cc: auto diff_src_md = platform::MKLDNNMemDesc(diff_src_tz, mkldnn::memory::f32, pool_mkldnn_op.cc: auto diff_dst_md = platform::MKLDNNMemDesc(diff_dst_tz, mkldnn::memory::f32, softmax_mkldnn_op.cc: MKLDNNMemDesc({softmax_tz}, memory::f32, memory::format::nc);

jczaja force-pushed the prv-reuse-mkldnn-softmax-primitives branch 4 times, most recently from b0a5d41 to 4b3b20a Compare May 10, 2018 16:35

luotao1 added the Intel label May 11, 2018

jczaja force-pushed the prv-reuse-mkldnn-softmax-primitives branch 2 times, most recently from 27167c7 to 50e559b Compare May 11, 2018 11:07

jczaja force-pushed the prv-reuse-mkldnn-softmax-primitives branch from 50e559b to 7bf00c3 Compare May 11, 2018 12:31

kbinias requested a review from luotao1 May 11, 2018 15:22

luotao1 requested a review from tensor-tang May 14, 2018 01:52

tensor-tang reviewed May 14, 2018

View reviewed changes

luotao1 added this to Doing in Intel Optimization on Fluid May 14, 2018

luotao1 approved these changes May 14, 2018

View reviewed changes

luotao1 merged commit 8c7d2e2 into PaddlePaddle:develop May 14, 2018

luotao1 moved this from Doing to Done in Intel Optimization on Fluid May 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reusing of softmax mkldnn primitives #10576

Reusing of softmax mkldnn primitives #10576

jczaja commented May 10, 2018

luotao1 commented May 11, 2018

jczaja commented May 11, 2018

tensor-tang left a comment

tensor-tang May 14, 2018

luotao1 May 14, 2018

Reusing of softmax mkldnn primitives #10576

Reusing of softmax mkldnn primitives #10576

Conversation

jczaja commented May 10, 2018

luotao1 commented May 11, 2018

jczaja commented May 11, 2018

tensor-tang left a comment

Choose a reason for hiding this comment

tensor-tang May 14, 2018

Choose a reason for hiding this comment

luotao1 May 14, 2018

Choose a reason for hiding this comment