keep output type after calling SubgraphRewriter #65453

XiaobingSuper · 2021-09-22T08:48:10Z

For jit SubgraphRewriter, it doesn't keep output type after overwriting the old graph, for example, in profiling mode, the old graph has the old operator's shapes, but after replacing the old operator with a newer operator by applying SubgraphRewriter, the tensor shape info was eliminated.

The activation is that I want to replace pytorch convolution with a customer's convolution, I first register aten::_convolution as a profiler node that can reorder the input and output's shapes, and then using graph rewrite to replace it as aten::conv2d, which tensors' shapes info are eliminated. I hope using input size do some pre-progress before replacing aten::conv2d with the customer's convolution.

Before rewrite:

graph(%self.1 : __torch__.MyModule,
      %x.1 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu)):
  %7 : int = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6/                      site-packages/torch/nn/modules/conv.py:443:0
  %6 : bool = prim::Constant[value=0](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6                      /site-packages/torch/nn/modules/conv.py:443:0
  %5 : bool = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6                      /site-packages/torch/nn/modules/conv.py:443:0
  %4 : NoneType = prim::Constant()
  %3 : int[] = prim::Constant[value=[1, 1]]()
  %2 : int[] = prim::Constant[value=[0, 0]]()
  %conv : __torch__.torch.nn.modules.conv.Conv2d = prim::GetAttr[name="conv"](%self.1)
  %z : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::clone(%x.1, %4) # jit_test.py:2                      2:0
  %weight : Float(3, 3, 1, 1, strides=[3, 1, 1, 1], requires_grad=0, device=cpu) = prim::GetAttr[name="weight"](%conv)
  %x : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::_convolution(%x.1, %weight, %4,                       %3, %2, %3, %6, %2, %7, %6, %6, %5, %5), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.                      6/site-packages/torch/nn/modules/conv.py:443:0
  %16 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::add(%x, %z, %7) # jit_test.py:                      24:0
  return (%16)

after rewrite by using aten::conv2d

graph(%self.1 : __torch__.MyModule,
      %x.1 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu)):
  %7 : int = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6/site-packages/torch/nn/modules/conv.py:443:0
  %6 : bool = prim::Constant[value=0](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6/site-packages/torch/nn/modules/conv.py:443:0
  %5 : bool = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6/site-packages/torch/nn/modules/conv.py:443:0
  %4 : NoneType = prim::Constant()
  %3 : int[] = prim::Constant[value=[1, 1]]()
  %2 : int[] = prim::Constant[value=[0, 0]]()
  %conv : __torch__.torch.nn.modules.conv.Conv2d = prim::GetAttr[name="conv"](%self.1)
  %z : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::clone(%x.1, %4) # jit_test.py:22:0
  %weight : Float(3, 3, 1, 1, strides=[3, 1, 1, 1], requires_grad=0, device=cpu) = prim::GetAttr[name="weight"](%conv)
  %18 : Tensor = aten::conv2d(%x.1, %weight, %4, %3, %2, %3, %7)
  %16 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::add(%18, %z, %7) # jit_test.py:24:0
  return (%16)

expected result after replace aten::_convolution with aten::conv2d:

graph(%self.1 : __torch__.MyModule,
      %x.1 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu)):
  %7 : int = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6/                      site-packages/torch/nn/modules/conv.py:443:0
  %6 : bool = prim::Constant[value=0](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6                      /site-packages/torch/nn/modules/conv.py:443:0
  %5 : bool = prim::Constant[value=1](), scope: __module.conv # /home/xiaobinz/miniconda3/envs/pytorch-master/lib/python3.6                      /site-packages/torch/nn/modules/conv.py:443:0
  %4 : NoneType = prim::Constant()
  %3 : int[] = prim::Constant[value=[1, 1]]()
  %2 : int[] = prim::Constant[value=[0, 0]]()
  %conv : __torch__.torch.nn.modules.conv.Conv2d = prim::GetAttr[name="conv"](%self.1)
  %z : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::clone(%x.1, %4) # jit_test.py:2                      2:0
  %weight : Float(3, 3, 1, 1, strides=[3, 1, 1, 1], requires_grad=0, device=cpu) = prim::GetAttr[name="weight"](%conv)
  %18 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::conv2d(%x.1, %weight, %4, %3,                       %2, %3, %7)
  %16 : Float(2, 3, 20, 20, strides=[1200, 400, 20, 1], requires_grad=0, device=cpu) = aten::add(%18, %z, %7) # jit_test.py                      :24:0
  return (%16)

facebook-github-bot · 2021-09-22T08:48:17Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/65453
📄 Preview docs built from this PR
📄 Preview C++ docs built from this PR
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit d53c4be (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

codecov · 2021-09-22T12:15:49Z

Codecov Report

Merging #65453 (80b9ebc) into master (64d3c73) will decrease coverage by 0.00%.
The diff coverage is n/a.

❗ Current head 80b9ebc differs from pull request most recent head d53c4be. Consider uploading reports for the commit d53c4be to get more accurate results

@@            Coverage Diff             @@
##           master   #65453      +/-   ##
==========================================
- Coverage   66.38%   66.37%   -0.01%     
==========================================
  Files         739      739              
  Lines       94295    94299       +4     
==========================================
- Hits        62594    62592       -2     
- Misses      31701    31707       +6

ZolotukhinM · 2021-09-22T19:04:51Z

That change looks reasonable, thank you for making it! Do you mind adding a test for this scenario as well? You can put it here: https://github.com/pytorch/pytorch/blob/master/test/cpp/jit/test_subgraph_rewriter.cpp

XiaobingSuper · 2021-09-23T01:07:19Z

@ZolotukhinM, do you know how to check a graph node has shape info? I don't find a test case to check it.

ZolotukhinM · 2021-09-23T01:38:46Z

We can access type info from the Value* in JIT graph. This code can be used as an example:

pytorch/torch/csrc/jit/tensorexpr/kernel.cpp

Lines 201 to 220 in fccaa4a

    
           auto const& it = v->type()->cast<TensorType>(); 
        
           c10::ScalarType dtype = c10::ScalarType::Float; 
        
           if (!it) { 
        
             return c10::nullopt; 
        
           } 
        
           if (!it->isComplete()) { 
        
             return c10::nullopt; 
        
           } 
        
           if (it->scalarType()) { 
        
             // TODO: ideally we should be strict here and return nullopt if the dtype is 
        
             // absent in the JIT IR. We're assuming a default Float dtype for now, until 
        
             // dtype propagation is implemented. 
        
             dtype = *it->scalarType(); 
        
           } 
        
           auto concrete_sizes = it->sizes().concrete_sizes(); 
        
           if (!concrete_sizes) { 
        
             return c10::nullopt; 
        
           }

Alternatively, and it can be a better way for testing I think, we could simply print the IR after the rewrite and see if the new IR has the shape info. We could then use FileCheck statements to scan the output and search for the shape info in it (when a value has no shape info, it's printed as "%x : Tensor", when it does have a shape info, it's printed like "%x : Float(10, 20)").

XiaobingSuper · 2021-09-24T00:28:32Z

@ZolotukhinM, one test case is added.

ZolotukhinM

Awesome, thanks! Do you need help with merging the PR?

XiaobingSuper · 2021-09-24T02:10:11Z

@ZolotukhinM, yes, please help merge it, thanks!

facebook-github-bot · 2021-09-24T02:10:54Z

@ZolotukhinM has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-09-24T18:09:29Z

@ZolotukhinM merged this pull request in 1682722.

seemethere · 2021-12-08T18:15:19Z

@malfet I see we marked this for inclusion in the 1.10.1 release, can we link to the issue that this resolves to verify it fixes a regression?

malfet · 2021-12-08T22:19:06Z

@seemethere this PR adds a test for the regression

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Sep 22, 2021

XiaobingSuper requested a review from ZolotukhinM September 22, 2021 08:48

facebook-github-bot added the cla signed label Sep 22, 2021

pytorchbot added the open source label Sep 22, 2021

keep output type after calling SubgraphRewriter

80b9ebc

XiaobingSuper force-pushed the jit-rewrite branch from 2d5f91e to 80b9ebc Compare September 22, 2021 09:00

add test case

d53c4be

XiaobingSuper force-pushed the jit-rewrite branch from 3601cf7 to d53c4be Compare September 23, 2021 14:06

ZolotukhinM approved these changes Sep 24, 2021

View reviewed changes

facebook-github-bot closed this in 1682722 Sep 24, 2021

facebook-github-bot added the Merged label Sep 24, 2021

malfet added this to the 1.10.1 milestone Nov 2, 2021

XiaobingSuper deleted the jit-rewrite branch December 20, 2021 04:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

keep output type after calling SubgraphRewriter #65453

keep output type after calling SubgraphRewriter #65453

XiaobingSuper commented Sep 22, 2021

facebook-github-bot commented Sep 22, 2021 •

edited

Loading

codecov bot commented Sep 22, 2021 •

edited

Loading

ZolotukhinM commented Sep 22, 2021

XiaobingSuper commented Sep 23, 2021

ZolotukhinM commented Sep 23, 2021

XiaobingSuper commented Sep 24, 2021

ZolotukhinM left a comment

XiaobingSuper commented Sep 24, 2021

facebook-github-bot commented Sep 24, 2021

facebook-github-bot commented Sep 24, 2021

seemethere commented Dec 8, 2021

malfet commented Dec 8, 2021

keep output type after calling SubgraphRewriter #65453

keep output type after calling SubgraphRewriter #65453

Conversation

XiaobingSuper commented Sep 22, 2021

facebook-github-bot commented Sep 22, 2021 • edited Loading

🔗 Helpful links

💊 CI failures summary and remediations

codecov bot commented Sep 22, 2021 • edited Loading

Codecov Report

ZolotukhinM commented Sep 22, 2021

XiaobingSuper commented Sep 23, 2021

ZolotukhinM commented Sep 23, 2021

XiaobingSuper commented Sep 24, 2021

ZolotukhinM left a comment

Choose a reason for hiding this comment

XiaobingSuper commented Sep 24, 2021

facebook-github-bot commented Sep 24, 2021

facebook-github-bot commented Sep 24, 2021

seemethere commented Dec 8, 2021

malfet commented Dec 8, 2021

facebook-github-bot commented Sep 22, 2021 •

edited

Loading

codecov bot commented Sep 22, 2021 •

edited

Loading