Add mkldnn_softmax #4331

tensor-tang · 2017-09-22T15:43:17Z

luotao1

MKLDNNActivation.h中resetFwd和resetBwd的实现能放到MKLDNNActivation.cpp中么，因为这两个函数的实现太长了，放在头文件中不合适。

luotao1 · 2017-09-25T09:14:38Z

paddle/gserver/activations/MKLDNNActivation.h

-    return Error();
+/**
+ * @brief Base class of MKLDNN softmax Activation,
+ * only have mkldnn forward, use cpu implement for backward.


请问为什么这里不用mkldnn backward呢

因为mkldnn目前还没有实现softmax的backward，所以先保留与CPU的实现一样。

luotao1 · 2017-09-25T09:26:51Z

paddle/gserver/activations/MKLDNNActivation.cpp

+            "mkldnn_" #ACT_TYPE);                                    \
+  });
+
+/**
 * @def DEFINE_MKLDNN_ELTWISE_ACTIVATION
 */
 #define DEFINE_MKLDNN_ELTWISE_ACTIVATION(ACT_TYPE, ALPHA, BWD_ALPHA)        \


class MKLDNN_ACTIVATION_CLASS_NAME类能继承MKLDNN_ACTIVATION_CLASS_NAME么？
·这样以下内容就不用重复写了：
55行，60行，64-65行，68-71行。

好的，可以再简化点。

luotao1 · 2017-09-25T09:29:16Z

paddle/math/Matrix.cpp

@@ -3637,7 +3637,7 @@ void CpuMatrix::oneHotCrossEntropy(Matrix& output, IVector& label) {
  for (size_t i = 0; i < numSamples; ++i, out += dim) {
    CHECK_GE(lbl[i], 0);
    CHECK_LT((size_t)lbl[i], dim);
-    cost[i] = -std::log(out[lbl[i]]);
+    cost[i] = -std::log(std::max(out[lbl[i]], real(FLT_MIN)));


FLT_MIN这个数是在哪儿设置的？paddle中没有这个变量。

定义在#include <float.h>里面

luotao1 · 2017-09-25T09:29:29Z

paddle/math/Matrix.cpp

@@ -3652,7 +3652,7 @@ void CpuMatrix::oneHotCrossEntropyBp(Matrix& output, IVector& label) {
  real* grad = getData();
  int* lbl = label.getData();
  for (size_t i = 0; i < numSamples; ++i, out += dim, grad += dim) {
-    grad[lbl[i]] -= 1 / out[lbl[i]];
+    grad[lbl[i]] -= 1 / std::max(out[lbl[i]], real(FLT_MIN));


定义在#include <float.h>里面

luotao1 · 2017-09-25T09:33:23Z

paddle/gserver/activations/MKLDNNActivation.h

@@ -93,42 +128,21 @@ class MKLDNNEltwiseActivation : public MKLDNNActivation {
    return (mkldnn::algorithm)0;


119-124行，可以用正则表达式，把mkldnn换成eltwise

会用map来简化下。

luotao1 · 2017-09-26T02:48:49Z

paddle/gserver/activations/MKLDNNActivation.cpp

+  if (outputG->useGpu()) {
+    outputG->softmaxBackward(*outputV);
+  } else {
+    SetDevice device(act.deviceId);


这一段是直接复制的SoftmaxActivation::backward吧，多复制了193-196行。这里可以使用MatrixPtr么？

确实可以删掉，thx。 done

luotao1

LGTM

tensor-tang requested a review from luotao1 September 22, 2017 15:43

tensor-tang added this to Doing in Optimization on Intel Platform Sep 22, 2017

tensor-tang added 3 commits September 25, 2017 16:08

enable mkldnn_softmax

7483087

add clip to avoid log zero and nan

abfd9fd

remove useless code

799f80a

tensor-tang force-pushed the mkldnn_softmax branch from 89cd07a to 799f80a Compare September 25, 2017 08:20

luotao1 reviewed Sep 25, 2017

View reviewed changes

tensor-tang force-pushed the mkldnn_softmax branch 2 times, most recently from c5216eb to d5f5828 Compare September 25, 2017 14:23

refine code and remove min clip

2c6ac62

tensor-tang force-pushed the mkldnn_softmax branch from d5f5828 to 2c6ac62 Compare September 25, 2017 15:33

luotao1 reviewed Sep 26, 2017

View reviewed changes

tensor-tang force-pushed the mkldnn_softmax branch 2 times, most recently from 007a64a to 043aa8a Compare September 26, 2017 04:37

remove gpu code when backward mkldnn_softmax

672c968

tensor-tang force-pushed the mkldnn_softmax branch from 043aa8a to 672c968 Compare September 26, 2017 05:37

luotao1 approved these changes Sep 26, 2017

View reviewed changes

luotao1 merged commit 0cc85d7 into PaddlePaddle:develop Sep 26, 2017

tensor-tang deleted the mkldnn_softmax branch September 26, 2017 11:57

tensor-tang moved this from Doing to Done in Optimization on Intel Platform Sep 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add mkldnn_softmax #4331

Add mkldnn_softmax #4331

tensor-tang commented Sep 22, 2017 •

edited

luotao1 left a comment

luotao1 Sep 25, 2017

tensor-tang Sep 25, 2017

luotao1 Sep 25, 2017

tensor-tang Sep 25, 2017

luotao1 Sep 25, 2017

tensor-tang Sep 25, 2017

luotao1 Sep 25, 2017

tensor-tang Sep 25, 2017

luotao1 Sep 25, 2017

tensor-tang Sep 25, 2017

luotao1 Sep 26, 2017

tensor-tang Sep 26, 2017

luotao1 left a comment

		@@ -93,42 +128,21 @@ class MKLDNNEltwiseActivation : public MKLDNNActivation {
		return (mkldnn::algorithm)0;

Add mkldnn_softmax #4331

Add mkldnn_softmax #4331

Conversation

tensor-tang commented Sep 22, 2017 • edited

luotao1 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

luotao1 left a comment

Choose a reason for hiding this comment

tensor-tang commented Sep 22, 2017 •

edited