Add aten mkldnn linear operator #19210

bddppq · 2019-04-12T18:40:54Z

Stack:
    :white_circle: #19633 Add is_mkldnn to at::Tensor  💚
    :white_circle: #19204 Add aten mkldnn conv2d operator  💚
    :white_circle: #19205 Add aten mkldnn ops: relu, max_pool2d and avg_pool2d  💚
    :white_circle: #19206 Add aten mkldnn batch_norm operator  💚
    :white_circle: #19207 Add aten mkldnn add operator  💚
    :white_circle: #19209 Add aten mkldnn view operator  💚
    :black_circle: #19210 Add aten mkldnn linear operator  💚
    :white_circle: #19648 Adjust resnext run script  💚

Pull Request resolved: #19210

Differential Revision: D14901641

Differential Revision: D14901641 Differential Version: 79209392

aten/src/ATen/native/native_functions.yaml

Differential Revision: D14901641 Differential Version: 79250212

tools/autograd/derivatives.yaml

tools/autograd/gen_variable_type.py

Differential Revision: D14901641 Differential Version: 79296738

Differential Revision: D14901641 Differential Version: 79299281

Differential Revision: D14901641 Differential Version: 79300548

Differential Revision: D14901641 Differential Version: 79320912

Differential Revision: D14901641 Differential Version: 79329883

tools/autograd/derivatives.yaml

Differential Revision: D14901641 Differential Version: 79689943

Differential Revision: D14901641 Differential Version: 79698400

Differential Revision: D14901641 Differential Version: 79729547

Differential Revision: D14901641 Differential Version: 79814099

torch/nn/functional.py

Differential Revision: D14901641 Differential Version: 80386619

Differential Revision: D14901641 Differential Version: 80533694

Differential Revision: D14901641 Differential Version: 80534302

Differential Revision: D14901641 Differential Version: 80534909

Differential Revision: D14901641 Differential Version: 80541659

Differential Revision: D14901641 Differential Version: 80543785

Differential Revision: D14901641 Differential Version: 80564048

Differential Revision: D14901641 Differential Version: 80580145

Differential Revision: D14901641 Differential Version: 80586585

bddppq · 2019-04-24T16:51:39Z

@zdevito @suo @ZolotukhinM In this PR I'm changing nn.Linear to directly call c++ aten::linear (which underlyingly dispatches to the same addmm/matmul call), but the difference is nn.Linear's torchscript now will directly capture linear instead of addmm/matmul. (In this PR only scripting get changed, tracing is not unaffected, but I would like to change tracing as well in a follow up PR) I think this is the right change since the torchscript now get simplified from

graph(%x : Tensor,
      %23 : Tensor,
      %24 : Tensor):
  %2 : int = prim::Constant[value=1]()
  %3 : None = prim::Constant()
  %4 : bool = prim::Constant[value=0]()
  %5 : int = prim::Constant[value=2]()
  %9 : int = aten::dim(%x)
  %10 : bool = aten::eq(%9, %5)
  %11 : bool = prim::If(%10)
    block0():
      %12 : bool = aten::__isnot__(%23, %3)
      -> (%12)
    block1():
      -> (%4)
  %ret : Tensor = prim::If(%11)
    block0():
      %bias.2 : Tensor = prim::unchecked_unwrap_optional(%23)
      %15 : Tensor = aten::t(%24)
      %ret.1 : Tensor = aten::addmm(%bias.2, %x, %15, %2, %2)
      -> (%ret.1)
    block1():
      %17 : Tensor = aten::t(%24)
      %output.1 : Tensor = aten::matmul(%x, %17)
      %19 : bool = aten::__isnot__(%23, %3)
      %output : Tensor = prim::If(%19)
        block0():
          %bias.3 : Tensor = prim::unchecked_unwrap_optional(%23)
          %output.2 : Tensor = aten::add_(%output.1, %bias.3, %2)
          -> (%output.2)
        block1():
          -> (%output.1)
      -> (%output)
  return (%ret)

to

graph(%x : Tensor,
      %6 : Tensor,
      %7 : Tensor):
  %5 : Tensor = aten::linear(%x, %7, %6)
  return (%5)

And it makes other backends like mkldnn/conversion works like work's life easier to support linear.
However our concern is this change might break some compiler passes that specifically pattern match addmm/matmul, could you give me some pointers to where I can check them?

eellison · 2019-04-24T17:44:25Z

I'm not sure that we should do this. If you run the scripted function you get the following result:

@torch.jit.script
def test_linear(x, y):
    return F.linear(x, y)

test_linear(torch.rand([2, 2]), torch.rand([2, 2]))
print(test_linear.graph_for(torch.rand([2, 2]), torch.rand([2, 2])))

Which outputs the graph

graph(%x : Double(*, *),
      %y : Double(*, *)):
  %15 : Double(*, *) = aten::t(%y)
  %output.1 : Double(*, *) = aten::matmul(%x, %15)
  return (%output.1)

What is the end goal here ? If we continually create new ops which were previously composed of existing ones, we'll be putting more and more stress on various parts of the system -shape analysis, the graph fuser, TVM, etc.

I don't have a good context on mkldnn interop so there may be valid reasons there. But as far as torchscript I'm not really convinced.

zdevito · 2019-04-24T18:43:21Z

It is totally reasonable that there would be a built in linear operator. Tons of libraries already have it as optimized fused thing. We should add it.

However, @eellison is right: anytime a fused op is added, it is also necessary to change all of the JIT analysis and optimization passes so that they still work in this world. In the case of Linear, this is change is almost certainly breaking matrix multiply optimizations that the JIT does because of missing formulas for shape propagation, differentiation, and others.

This is an invasive change on a very important operator, so it deserves careful consideration about what possibly is breaking by hiding its implementation from optimization passes.

Differential Revision: D14901641 Differential Version: 80683620

dzhulgakov · 2019-04-25T02:31:11Z

@zdevito - can you please help to come up with change plan for this change in JIT. As you said, switching to at:linear does make sense. I guess grepping for aten::addmm should be a good starting point.

One option is to land this PR first as it switches only scripting and follow up with a separate PR for doing tracing and proper JIT changes. Would it be reasonable?

Differential Revision: D14901641 Differential Version: 80773730

Differential Revision: D14901641 Differential Version: 80780048

bddppq · 2019-04-26T05:26:04Z

I added a workaround that overrides the forward method of nn.Linear in case in to_mkldnn. Will do the linear dispatch change in a separate diff to unblock mkldnn.

dzhulgakov

As a hack with the follow up later - it looks good

dzhulgakov · 2019-04-26T06:18:27Z

aten/src/ATen/native/mkldnn/TensorShape.cpp

@@ -40,6 +44,12 @@ Tensor mkldnn_reshape(const Tensor& self, IntArrayRef size) {
  return new_with_itensor_mkldnn(std::move(y), self.options());
 }

+Tensor mkldnn_clone(const Tensor& self) {
+  ideep::tensor& src = itensor_from_mkldnn(self);
+  ideep::tensor dst{src};


this does make a copy, right? (it's supposed to)

yes it's a copy ctor https://bddppq.github.io/codebrowser/pytorch/pytorch/third_party/ideep/include/ideep/tensor.hpp.html#1198

Differential Revision: D14901641 Differential Version: 80785115

Differential Revision: D14901641 Differential Version: 80799730

Summary: Pull Request resolved: pytorch/pytorch#19210 Reviewed By: dzhulgakov Differential Revision: D14901641 fbshipit-source-id: 8fa68b9941fd93cea0f313a828cba34c5c81ae11

facebook-github-bot · 2019-04-27T01:09:54Z

This pull request has been merged in c9f380d.

Summary: Pull Request resolved: pytorch#19210 Reviewed By: dzhulgakov Differential Revision: D14901641 fbshipit-source-id: 8fa68b9941fd93cea0f313a828cba34c5c81ae11

V5: Initial commit

f1cb2c3

Differential Revision: D14901641 Differential Version: 79209392

This was referenced Apr 12, 2019

Add aten mkldnn conv2d operator #19204

Closed

Add aten mkldnn ops: relu, max_pool2d and avg_pool2d #19205

Closed

Add aten mkldnn batch_norm operator #19206

Closed

Add aten mkldnn add operator #19207

Closed

Add aten mkldnn view operator #19209

Closed

soumith reviewed Apr 12, 2019

View reviewed changes

aten/src/ATen/native/native_functions.yaml Show resolved Hide resolved

V6: (no description)

ae0c829

Differential Revision: D14901641 Differential Version: 79250212

gchanan reviewed Apr 12, 2019

View reviewed changes

tools/autograd/derivatives.yaml Outdated Show resolved Hide resolved

gchanan reviewed Apr 12, 2019

View reviewed changes

tools/autograd/gen_variable_type.py Outdated Show resolved Hide resolved

bddppq added the module: mkldnn Related to Intel IDEEP or oneDNN (a.k.a. mkldnn) integration label Apr 12, 2019

bddppq added 5 commits April 12, 2019 21:15

V7: Merge with parent diff changes

d599e79

Differential Revision: D14901641 Differential Version: 79296738

V8: Merge with parent diff changes

0ae8ce7

Differential Revision: D14901641 Differential Version: 79299281

V9: (no description)

ba17776

Differential Revision: D14901641 Differential Version: 79300548

V10: (no description)

9d4f6d8

Differential Revision: D14901641 Differential Version: 79320912

V11: (no description)

04e23fa

Differential Revision: D14901641 Differential Version: 79329883

gchanan requested changes Apr 15, 2019

View reviewed changes

tools/autograd/derivatives.yaml Outdated Show resolved Hide resolved

tools/autograd/derivatives.yaml Outdated Show resolved Hide resolved

bddppq added 4 commits April 16, 2019 15:12

V12: Merge with parent diff changes

fcb5e6a

Differential Revision: D14901641 Differential Version: 79689943

V13: Merge with parent diff changes

4e32afd

Differential Revision: D14901641 Differential Version: 79698400

V14: Merge with parent diff changes

9036e2a

Differential Revision: D14901641 Differential Version: 79729547

V15: (no description)

90a4aa3

Differential Revision: D14901641 Differential Version: 79814099

dzhulgakov reviewed Apr 19, 2019

View reviewed changes

torch/nn/functional.py Outdated Show resolved Hide resolved

bddppq added 2 commits April 22, 2019 13:23

V16: Merge with parent diff changes

99846ca

Differential Revision: D14901641 Differential Version: 80386619

V18: Merge with parent diff changes

906e115

Differential Revision: D14901641 Differential Version: 80533694

bddppq mentioned this pull request Apr 23, 2019

Add is_mkldnn to at::Tensor #19633

Closed

bddppq added 5 commits April 23, 2019 14:18

V19: Merge with parent diff changes

ecafc31

Differential Revision: D14901641 Differential Version: 80534302

V20: Merge with parent diff changes

87c912e

Differential Revision: D14901641 Differential Version: 80534909

V21: Merge with parent diff changes

29dac66

Differential Revision: D14901641 Differential Version: 80541659

V22: (no description)

2bd7643

Differential Revision: D14901641 Differential Version: 80543785

V25: Merge with parent diff changes

f942045

Differential Revision: D14901641 Differential Version: 80564048

bddppq mentioned this pull request Apr 24, 2019

Adjust resnext run script #19648

Closed

bddppq added 2 commits April 23, 2019 21:42

V26: Merge with parent diff changes

e984151

Differential Revision: D14901641 Differential Version: 80580145

V27: (no description)

7ee1736

Differential Revision: D14901641 Differential Version: 80586585

bddppq requested review from zdevito, suo and ZolotukhinM April 24, 2019 16:40

zdevito removed their request for review April 24, 2019 18:47

V28: Merge with parent diff changes

b417ef9

Differential Revision: D14901641 Differential Version: 80683620

bddppq added 2 commits April 25, 2019 18:20

V29: Merge with parent diff changes

d32c74d

Differential Revision: D14901641 Differential Version: 80773730

V31: Merge with parent diff changes

4f7e188

Differential Revision: D14901641 Differential Version: 80780048

dzhulgakov approved these changes Apr 26, 2019

View reviewed changes

bddppq added 2 commits April 26, 2019 01:03

V32: Merge with parent diff changes

879039b

Differential Revision: D14901641 Differential Version: 80785115

V33: Merge with parent diff changes

7dfc80e

Differential Revision: D14901641 Differential Version: 80799730

facebook-github-bot closed this in c9f380d Apr 26, 2019

facebook-github-bot added the merged label Apr 27, 2019

bddppq mentioned this pull request May 8, 2019

[WIP][RFC] Edit nn.Linear to take arbitrary in and out feature dimensions #20258

Closed

ezyang deleted the export-D14901641 branch May 30, 2019 15:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aten mkldnn linear operator #19210

Add aten mkldnn linear operator #19210

bddppq commented Apr 12, 2019 •

edited

Loading

bddppq commented Apr 24, 2019

eellison commented Apr 24, 2019 •

edited

Loading

zdevito commented Apr 24, 2019

dzhulgakov commented Apr 25, 2019

bddppq commented Apr 26, 2019

dzhulgakov left a comment

dzhulgakov Apr 26, 2019

bddppq Apr 26, 2019

facebook-github-bot commented Apr 27, 2019

Add aten mkldnn linear operator #19210

Add aten mkldnn linear operator #19210

Conversation

bddppq commented Apr 12, 2019 • edited Loading

bddppq commented Apr 24, 2019

eellison commented Apr 24, 2019 • edited Loading

zdevito commented Apr 24, 2019

dzhulgakov commented Apr 25, 2019

bddppq commented Apr 26, 2019

dzhulgakov left a comment

Choose a reason for hiding this comment

dzhulgakov Apr 26, 2019

Choose a reason for hiding this comment

bddppq Apr 26, 2019

Choose a reason for hiding this comment

facebook-github-bot commented Apr 27, 2019

bddppq commented Apr 12, 2019 •

edited

Loading

eellison commented Apr 24, 2019 •

edited

Loading