New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

MKLDNN layout: Support for sum operator #11102

Merged

tensor-tang merged 3 commits into PaddlePaddle:develop from mozga-intel:mozga-intel/Sum_mkldnn_layout

Jun 19, 2018

Contributor

mozga-intel commented May 31, 2018 •

edited

Loading

Waiting for supports of the mkldnn’s layout.

Please have a look at this sum operator supported by the MKLDNN’s layout and assess if we can keep this design of the code.

This code uses the implementation of layout which is available in the last pull-request. Therefore some of the function in this code are not available in this pull-request, but are available here

This version of code can be merged into the main branch when the pull-request with layout is accepted.

The concept of splits of the long code was proposed by @luotao1

Pull-request is related to #11040

mozga-intel added the Intel label

mozga-intel requested a review from luotao1

May 31, 2018 14:42

mozga-intel mentioned this pull request

Mkldnn layout #11040

Merged

mozga-intel force-pushed the mozga-intel/Sum_mkldnn_layout branch from da0b3f4 to 6c7c210 Compare

June 1, 2018 10:40

mozga-intel mentioned this pull request

Naming rules in MKLDNN related codes #11139

Closed

mozga-intel force-pushed the mozga-intel/Sum_mkldnn_layout branch 7 times, most recently from d99777e to fcd0dd1 Compare

June 12, 2018 08:42

mozga-intel requested a review from tensor-tang

June 12, 2018 10:28

Contributor Author

mozga-intel commented Jun 12, 2018

@luotao1, @tensor-tang The code is prepared to the code-review process.

Contributor Author

mozga-intel commented Jun 12, 2018 •

edited

Loading

@tensor-tang Creates out-of-place sum_primitive_desc for sum of n inputs multiplied by scale with resulting output_desc memory descriptor.

This code supports only LODTensor as a format of input. Otherwise I have done the rolling back to the naive version of this code. In the nearly future I will implement the support for the other case.

luotao1 added this to Doing in Intel Optimization on Fluid

tensor-tang reviewed

View reviewed changes

Contributor

tensor-tang left a comment

Do we have any unit test for this changes since added use_mkldnn flag.

paddle/fluid/operators/sum_mkldnn_op.cc

    
                    if (src_tz.size() == 1 && (input_format == memory::format::nchw ||

                                               input_format == memory::format::nhwc)) {

                      input_format = memory::format::x;

                    }

Contributor

tensor-tang Jun 14, 2018

how about else? Then do not set input_format which should be undef or any?

Contributor Author

mozga-intel Jun 15, 2018 •

edited

Loading

Basically, The format is set at least once, it means that I expect that the size of format is different than 1 or 2, mainly is 4. The code is triggered in this place

paddle/fluid/operators/sum_mkldnn_op.cc

    
                    auto& input0 = in_vars[0]->Get<LoDTensor>();

                    PADDLE_ENFORCE(input0.layout() == DataLayout::kMKLDNN &&

                                       input0.format() != memory::format::format_undef,

                                   "Wrong layout/format for inputs[0]");

Contributor

tensor-tang Jun 14, 2018

How about other inputs? Why only check the first one?

Contributor Author

mozga-intel Jun 15, 2018

The other inputs are checking inside the loop: line 96-98

paddle/fluid/operators/sum_mkldnn_op.cc

    
                      auto& input = in_vars[i]->Get<LoDTensor>();

                      PADDLE_ENFORCE(input.layout() == DataLayout::kMKLDNN &&

                                         input.format() != memory::format::format_undef,

                                     "Wrong layout/format for inputs");

Contributor

tensor-tang Jun 14, 2018

So it will throw error if do not set format. But it seems you only set one input, or does it mean you only accept one input?

Contributor Author

mozga-intel Jun 15, 2018

If we want to have the best performance and the fluent flow in our graph of primitives, the format of input should be compatible with the others primitives. When we have mkldnn's primitives we can assume that the format is always defined as a mkldnn's format.

paddle/fluid/operators/sum_mkldnn_op.cc Outdated

    
                    std::shared_ptr<memory> dst_mem;

                    if (in_place)

                      dst_mem.reset(new memory(sum_pd.dst_primitive_desc()));

Contributor

tensor-tang Jun 14, 2018

Maybe you should keep the style

if (...) {
  ...
} else {
  ...
}

Contributor Author

mozga-intel Jun 15, 2018

Done.

paddle/fluid/operators/sum_mkldnn_op.cc Outdated

    
                    auto sum_prim = mkldnn::sum(sum_pd, inputs, *dst_mem);

                    output_format =

                        (memory::format)sum_pd.dst_primitive_desc().desc().data.format;

Contributor

tensor-tang Jun 14, 2018

Maybe could use a function to get format from pd

Contributor Author

mozga-intel Jun 15, 2018

Done

paddle/fluid/operators/sum_op.cc

    
              #ifdef PADDLE_WITH_MKLDNN

                  if (library == framework::LibraryType::kPlain &&

                      platform::CanMKLDNNBeUsed(ctx)) {

Contributor

tensor-tang Jun 14, 2018

Maybe this CanMKLDNNBeUsed could directly return false ifndef PADDLE_WITH_MKLDNN. then we do not need add https://github.com/PaddlePaddle/Paddle/pull/11102/files#diff-a8d897ab383e5245127bb03da3d9830cR74 everywhere.

Contributor Author

mozga-intel Jun 15, 2018

@tensor-tang. I think that is a splendid way to gets a little bit more clean checking when the primitive can be executed. If it is not problem I will take advantage of this idea, and I make it as other pull-request.

mozga-intel force-pushed the mozga-intel/Sum_mkldnn_layout branch 4 times, most recently from 707e7af to 60623a6 Compare

June 18, 2018 13:36

Contributor Author

mozga-intel commented Jun 18, 2018

@luotao1 Could you continue the checking of this code, please.

luotao1 reviewed

View reviewed changes

Contributor

luotao1 left a comment •

edited

Loading

Could you add unit-test in test_sum_mkldnn_op.py?

mozga-intel added 2 commits

June 19, 2018 09:26


          MKLDNN layout: Support for sum operator

96b4904


          MKLDNN layout: the code-review changes

6512be5

mozga-intel force-pushed the mozga-intel/Sum_mkldnn_layout branch from 60623a6 to 6715524 Compare

June 19, 2018 07:31


          MKLDNN sum unit-test

b88cda8

mozga-intel force-pushed the mozga-intel/Sum_mkldnn_layout branch from 6715524 to b88cda8 Compare

June 19, 2018 10:02

Contributor Author

mozga-intel commented Jun 19, 2018

tensor-tang approved these changes

View reviewed changes

Contributor

tensor-tang left a comment

LGTM
Thanks @mozga-intel

tensor-tang merged commit 64045c2 into PaddlePaddle:develop

luotao1 moved this from Doing to Done in Intel Optimization on Fluid

guochaorong mentioned this pull request

CE模型: sequence_tagging_for_ner 执行模型出错 #11615

Closed

tensor-tang mentioned this pull request

Revert "MKLDNN layout: Support for sum operator" #11628

Merged

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment