[quant][graphmode][fx] Support quantization for standalone module #44074

jerryzh168 · 2020-09-02T23:04:44Z

Stack from ghstack:

[quant][graphmode][fx][eagermode] Support sigmoid/hardsigmoid/tanh in eager and fx graph mode #45539 [quant] Add support for qat Sigmoid module
[quant] Add FixedQParamsFakeQuantize module #45538 [quant] Add FixedQParamsFakeQuantize module
[quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input #45473 [quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input
[quant][graphmode][fx] Merge all quantization mode #45292 [quant][graphmode][fx] Merge all quantization mode
[quant] Use PlaceholderObserver as default dynamic quant observer #45343 [quant] Use PlaceholderObserver as default dynamic quant observer
[quant][graphmode][fx] Support quantization for standalone module #44074 [quant][graphmode][fx] Support quantization for standalone module

Summary:
Sometimes user need to quantize a submodule as one unit, and this submodule
will be lowered to a different backend like accelerator.

The submodule will be quantized with the same fx based graph mode quantization functions
and will be connected with the rest of the model automatically.

APIs:

class StandaloneModule(torch.nn.Module):
            def __init__(self):
                super().__init__()
                self.conv = torch.nn.Conv2d(1, 1, 1)

            def forward(self, x):
                return self.conv(x)

class CustomTracer(Tracer):
      def is_leaf_module(self, m, module_qualified_name):
          return (m.__module__.startswith('torch.nn') and
                     not isinstance(m, torch.nn.Sequential)) or \
                    isinstance(m, StandaloneModule)

class ModelThatUsesStandaloneModule(...):
      def __init__(self):
          super().__init__()
          self.standalone = StandaloneModule()

      def forward(self, x):
          return self.standalone(x)

m = ModelThatUsesStandaloneModule()
qconfig_dict = {"": qconfig, "standalone_module_name": ["standalone"]}
m = prepare_fx(m, qconfig_dict)
calibrate(m, data)
m = convert_fx(m)

m.standalone = lower_to_acclerator(m.standalone)

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D23580642

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-09-02T23:28:53Z

💊 CI failures summary and remediations

As of commit 7dd90f4 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 194 times.

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 731a64a128fb5851c9af779b77fc3426319a94ec Pull Request resolved: #44074

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: afdeb1374c9bf0a528ce404616faadfd5f05f77f Pull Request resolved: #44074

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 47dae57c936d9bae8e2f83485d7ac7c5ef31fb74 Pull Request resolved: #44074

codecov · 2020-09-09T04:06:25Z

Codecov Report

❗ No coverage uploaded for pull request base (gh/jerryzh168/428/base@29670ca). Click here to learn what that means.
The diff coverage is n/a.

@@                    Coverage Diff                    @@
##             gh/jerryzh168/428/base   #44074   +/-   ##
=========================================================
  Coverage                          ?   68.61%           
=========================================================
  Files                             ?      406           
  Lines                             ?    52072           
  Branches                          ?        0           
=========================================================
  Hits                              ?    35729           
  Misses                            ?    16343           
  Partials                          ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 29670ca...7dd90f4. Read the comment docs.

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

… module" Summary: Sometimes user need to quantize a submodule as one unit, and this submodule will be lowered to a different backend like accelerator. The submodule will be quantized with the same fx based graph mode quantization functions and will be connected with the rest of the model automatically. APIs: ```python class StandaloneModule(torch.nn.Module): def __init__(self): super().__init__() self.conv = torch.nn.Conv2d(1, 1, 1) def forward(self, x): return self.conv(x) class CustomTracer(Tracer): def is_leaf_module(self, m, module_qualified_name): return (m.__module__.startswith('torch.nn') and not isinstance(m, torch.nn.Sequential)) or \ isinstance(m, StandaloneModule) register_traceable_module_class(StandaloneModule) m = ModelThatUsesStandaloneModule() m = prepare_fx(m, qconfig_dict) calibrate(m, data) m = convert_fx(m) m.standalone = lower_to_acclerator(m.standalone) ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

jerryzh168 · 2020-09-25T00:29:34Z

Just curious, have we also considered implementing this without introducing an additional set of APIs? Could there be a way to mark subregions of the graph to be "child_module", and to not follow regular matching patterns for those subregions?

I think we can also put it in qconfig_dict, would that make sense?

… module" Summary: Sometimes user need to quantize a submodule as one unit, and this submodule will be lowered to a different backend like accelerator. The submodule will be quantized with the same fx based graph mode quantization functions and will be connected with the rest of the model automatically. APIs: ```python class StandaloneModule(torch.nn.Module): def __init__(self): super().__init__() self.conv = torch.nn.Conv2d(1, 1, 1) def forward(self, x): return self.conv(x) class CustomTracer(Tracer): def is_leaf_module(self, m, module_qualified_name): return (m.__module__.startswith('torch.nn') and not isinstance(m, torch.nn.Sequential)) or \ isinstance(m, StandaloneModule) register_traceable_module_class(StandaloneModule) m = ModelThatUsesStandaloneModule() m = prepare_fx(m, qconfig_dict) calibrate(m, data) m = convert_fx(m) m.standalone = lower_to_acclerator(m.standalone) ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

… module" Summary: Sometimes user need to quantize a submodule as one unit, and this submodule will be lowered to a different backend like accelerator. The submodule will be quantized with the same fx based graph mode quantization functions and will be connected with the rest of the model automatically. APIs: ```python class StandaloneModule(torch.nn.Module): def __init__(self): super().__init__() self.conv = torch.nn.Conv2d(1, 1, 1) def forward(self, x): return self.conv(x) class CustomTracer(Tracer): def is_leaf_module(self, m, module_qualified_name): return (m.__module__.startswith('torch.nn') and not isinstance(m, torch.nn.Sequential)) or \ isinstance(m, StandaloneModule) class ModelThatUsesStandaloneModule(...): def __init__(self): super().__init__() self.standalone = StandaloneModule() def forward(self, x): return self.standalone(x) m = ModelThatUsesStandaloneModule() qconfig_dict = {"": qconfig, "standalone_module_name": ["standalone"]} m = prepare_fx(m, qconfig_dict) calibrate(m, data) m = convert_fx(m) m.standalone = lower_to_acclerator(m.standalone) ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

jerryzh168 · 2020-09-25T22:05:51Z

torch/fx/graph_module.py

@@ -194,10 +194,20 @@ def __reduce__(self):
    def __deepcopy__(self, memo):
        fake_mod = torch.nn.Module()
        fake_mod.__dict__ = copy.deepcopy(self.__dict__)
-        return GraphModule(fake_mod, self.graph)
+        graph_module = GraphModule(fake_mod, self.graph)


these are changes from #45182, will rebase on master after it is landed.

vkuzo

lgtm! some optional nit comments inline

vkuzo · 2020-09-25T20:45:43Z

torch/quantization/fx/quantize.py

@@ -199,7 +206,7 @@ def get_qconfig(module):
            elif node.op == 'call_module':
                self.qconfig_map[node.name] = get_qconfig(self.modules[node.target])

-    def _prepare(self, model, qconfig_dict, inplace, is_dynamic_quant):
+    def _prepare(self, model, qconfig_dict, inplace, is_dynamic_quant, is_child_module):


repeating docs is worth it in some cases, at least IMO. Up to you though.

torch/quantization/quantize_fx.py

torch/quantization/fx/quantize.py

torch/quantization/quantize_fx.py

torch/quantization/fx/quantize.py

… module" Summary: Sometimes user need to quantize a submodule as one unit, and this submodule will be lowered to a different backend like accelerator. The submodule will be quantized with the same fx based graph mode quantization functions and will be connected with the rest of the model automatically. APIs: ```python class StandaloneModule(torch.nn.Module): def __init__(self): super().__init__() self.conv = torch.nn.Conv2d(1, 1, 1) def forward(self, x): return self.conv(x) class CustomTracer(Tracer): def is_leaf_module(self, m, module_qualified_name): return (m.__module__.startswith('torch.nn') and not isinstance(m, torch.nn.Sequential)) or \ isinstance(m, StandaloneModule) class ModelThatUsesStandaloneModule(...): def __init__(self): super().__init__() self.standalone = StandaloneModule() def forward(self, x): return self.standalone(x) m = ModelThatUsesStandaloneModule() qconfig_dict = {"": qconfig, "standalone_module_name": ["standalone"]} m = prepare_fx(m, qconfig_dict) calibrate(m, data) m = convert_fx(m) m.standalone = lower_to_acclerator(m.standalone) ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

jerryzh168 · 2020-09-29T01:10:37Z

torch/quantization/fx/standalone_module.py

@@ -0,0 +1,27 @@
+from torch.fx import GraphModule
+
+class ObservedStandaloneGraphModule(GraphModule):


@jamesr66a @zdevito is this what we want?

This looks fine to me in the absence of being able to directly customize what symbolic_trace returns. I've filed an issue to track that: #45534

… module" Summary: Sometimes user need to quantize a submodule as one unit, and this submodule will be lowered to a different backend like accelerator. The submodule will be quantized with the same fx based graph mode quantization functions and will be connected with the rest of the model automatically. APIs: ```python class StandaloneModule(torch.nn.Module): def __init__(self): super().__init__() self.conv = torch.nn.Conv2d(1, 1, 1) def forward(self, x): return self.conv(x) class CustomTracer(Tracer): def is_leaf_module(self, m, module_qualified_name): return (m.__module__.startswith('torch.nn') and not isinstance(m, torch.nn.Sequential)) or \ isinstance(m, StandaloneModule) class ModelThatUsesStandaloneModule(...): def __init__(self): super().__init__() self.standalone = StandaloneModule() def forward(self, x): return self.standalone(x) m = ModelThatUsesStandaloneModule() qconfig_dict = {"": qconfig, "standalone_module_name": ["standalone"]} m = prepare_fx(m, qconfig_dict) calibrate(m, data) m = convert_fx(m) m.standalone = lower_to_acclerator(m.standalone) ``` Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

facebook-github-bot · 2020-09-30T18:15:57Z

This pull request has been merged in 5539066.

[quant][graphmode][fx] Support quantization for custom module

510a368

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 mentioned this pull request Sep 2, 2020

[quant][eagermode][refactor] Add set/get method for quantization and fusion mappings #43990

Closed

facebook-github-bot added the fx label Sep 2, 2020

Update on "[quant][graphmode][fx] Support quantization for custom mod…

3c083a1

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

e671e62

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 added a commit that referenced this pull request Sep 3, 2020

[quant][graphmode][fx] Support quantization for custom module

7d90efc

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 731a64a128fb5851c9af779b77fc3426319a94ec Pull Request resolved: #44074

Update on "[quant][graphmode][fx] Support quantization for custom mod…

7637725

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

jerryzh168 added a commit that referenced this pull request Sep 8, 2020

[quant][graphmode][fx] Support quantization for custom module

37bec1a

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: afdeb1374c9bf0a528ce404616faadfd5f05f77f Pull Request resolved: #44074

Update on "[quant][graphmode][fx] Support quantization for custom mod…

f61dc18

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

1007c5c

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

jerryzh168 added a commit that referenced this pull request Sep 9, 2020

[quant][graphmode][fx] Support quantization for custom module

1bb3d71

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: ghstack-source-id: 47dae57c936d9bae8e2f83485d7ac7c5ef31fb74 Pull Request resolved: #44074

Update on "[quant][graphmode][fx] Support quantization for custom mod…

a286d90

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

a9f935e

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

8d02562

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

a0f666c

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

7673461

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

Update on "[quant][graphmode][fx] Support quantization for custom mod…

b802e20

…ule" Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: Differential Revision: [D23580642](https://our.internmc.facebook.com/intern/diff/D23580642) [ghstack-poisoned]

jerryzh168 requested review from vkuzo and z-a-f September 10, 2020 00:58

jerryzh168 added 2 commits September 24, 2020 17:45

jerryzh168 mentioned this pull request Sep 25, 2020

[quant] Use PlaceholderObserver as default dynamic quant observer #45343

Closed

jerryzh168 added 4 commits September 25, 2020 11:52

jerryzh168 commented Sep 25, 2020

View reviewed changes

vkuzo approved these changes Sep 25, 2020

View reviewed changes

jerryzh168 added 3 commits September 25, 2020 17:41

jerryzh168 mentioned this pull request Sep 29, 2020

[quant][graphmode][fx][fix] Fix observer insert logic for ops that inherits quantization parameters for input #45473

Closed

jerryzh168 commented Sep 29, 2020

View reviewed changes

jerryzh168 added 5 commits September 28, 2020 18:19

This was referenced Sep 29, 2020

[quant] Add FixedQParamsFakeQuantize module #45538

Closed

[quant][graphmode][fx][eagermode] Support sigmoid/hardsigmoid/tanh in eager and fx graph mode #45539

Closed

facebook-github-bot closed this in 5539066 Sep 30, 2020

facebook-github-bot added the merged label Sep 30, 2020

jerryzh168 mentioned this pull request Oct 1, 2020

[fx] GraphModule copy top level attributes from root #45182

Closed

facebook-github-bot deleted the gh/jerryzh168/428/head branch October 4, 2020 14:18

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quant][graphmode][fx] Support quantization for standalone module #44074

[quant][graphmode][fx] Support quantization for standalone module #44074

jerryzh168 commented Sep 2, 2020 •

edited

dr-ci bot commented Sep 2, 2020 •

edited

codecov bot commented Sep 9, 2020 •

edited

jerryzh168 commented Sep 25, 2020

jerryzh168 Sep 25, 2020

vkuzo left a comment

vkuzo Sep 25, 2020

jerryzh168 Sep 29, 2020

jamesr66a Sep 29, 2020

facebook-github-bot commented Sep 30, 2020

		@@ -0,0 +1,27 @@
		from torch.fx import GraphModule

		class ObservedStandaloneGraphModule(GraphModule):

[quant][graphmode][fx] Support quantization for standalone module #44074

[quant][graphmode][fx] Support quantization for standalone module #44074

Conversation

jerryzh168 commented Sep 2, 2020 • edited

dr-ci bot commented Sep 2, 2020 • edited

💊 CI failures summary and remediations

codecov bot commented Sep 9, 2020 • edited

Codecov Report

jerryzh168 commented Sep 25, 2020

jerryzh168 Sep 25, 2020

Choose a reason for hiding this comment

vkuzo left a comment

Choose a reason for hiding this comment

vkuzo Sep 25, 2020

Choose a reason for hiding this comment

jerryzh168 Sep 29, 2020

Choose a reason for hiding this comment

jamesr66a Sep 29, 2020

Choose a reason for hiding this comment

facebook-github-bot commented Sep 30, 2020

jerryzh168 commented Sep 2, 2020 •

edited

dr-ci bot commented Sep 2, 2020 •

edited

codecov bot commented Sep 9, 2020 •

edited