[quant][graphmode] FP16 quant support - Insert cast operators #40709

supriyar · 2020-06-29T17:47:37Z

Stack from ghstack:

[quant][graphmode] FP16 quant support - Operator Fusion #40710 [quant][graphmode] FP16 quant support - Operator Fusion
[quant][graphmode] FP16 quant support - Insert cast operators #40709 [quant][graphmode] FP16 quant support - Insert cast operators
[quant][graphmode] Add FP16 quant support - Insert Noop Observers #40708 [quant][graphmode] Add FP16 quant support - Insert Noop Observers

Summary:
Cast to kHalf and back to kFloat before the linear operator to mimic FP16 quant support

Test Plan:
python test/test_quantization.py test_convert_dynamic_fp16

Reviewers:

Subscribers:

Tasks:

Tags:

Differential Revision: D22335977

Summary: Cast to kHalf and back to kFloat before the linear operator to mimic FP16 quant support Test Plan: python test/test_quantization.py test_convert_dynamic_fp16 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

…ors" Summary: Cast to kHalf and back to kFloat before the linear operator to mimic FP16 quant support Test Plan: python test/test_quantization.py test_convert_dynamic_fp16 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

dr-ci · 2020-06-29T18:11:29Z

💊 CI failures summary and remediations

As of commit cabf163 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 6 times.

jerryzh168 · 2020-06-29T21:28:11Z

torch/csrc/jit/passes/quantization/helper.cpp

@@ -578,11 +589,9 @@ bool is_module(
  const auto& match_vmap = match.values_map;
  Value* relu = match_vmap.at(vmap.at(vname));
  auto type = relu->type()->cast<ClassType>();


this can be removed?

jerryzh168 · 2020-06-29T21:32:30Z

torch/csrc/jit/passes/quantization/helper.h

@@ -44,6 +44,9 @@ TORCH_API bool isScalar(Value* v);
 // Check if value is the input of the graph
 TORCH_API bool hitGraphInput(Value* value);

+// Return the module name that corresponds to the value.
+TORCH_API c10::optional<std::string> get_module_name(Value* value);


should we use the same naming convention like other functions? I just noticed this problem, probably the filter functions should follow the name convention as well. but we can do that later.

jerryzh168 · 2020-06-29T21:40:35Z

torch/csrc/jit/passes/quantization/insert_quant_dequant.cpp

+  if (quant_type == QuantType::DYNAMIC && isNoopObserver(observer->input(0))) {
+    dequant = insertFP16CastOps(g, observer_out);
+  } else if (
+      quant_type == QuantType::DYNAMIC && !isWeight(module, observer_out)) {
    Value* dtype = g->insertGetAttr(self, qparam_names.back());
    std::tie(choose_qparams, quant, dequant) = insertChooseQParamQuantDequant(
        g, observer_out, dtype, at::Symbol::aten(quantize_func));


probably better to add a comment for else branch, it's accepting both dynamic + activation and static quant

test/quantization/test_quantize_jit.py

…ors" Summary: Cast to kHalf and back to kFloat before the linear operator to mimic FP16 quant support Test Plan: python test/test_quantization.py test_convert_dynamic_fp16 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

facebook-github-bot · 2020-07-01T22:06:44Z

This pull request has been merged in 55b5ab1.

[quant][graphmode] FP16 quant support - Insert cast operators

77c8c5a

Summary: Cast to kHalf and back to kFloat before the linear operator to mimic FP16 quant support Test Plan: python test/test_quantization.py test_convert_dynamic_fp16 Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

supriyar requested a review from apaszke as a code owner June 29, 2020 17:47

This was referenced Jun 29, 2020

[quant][graphmode] Add FP16 quant support - Insert Noop Observers #40708

Closed

[quant][graphmode] FP16 quant support - Operator Fusion #40710

Closed

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jun 29, 2020

supriyar requested review from jerryzh168 and raghuramank100 June 29, 2020 18:15

jerryzh168 reviewed Jun 29, 2020

View reviewed changes

jerryzh168 approved these changes Jun 29, 2020

View reviewed changes

jerryzh168 reviewed Jun 29, 2020

View reviewed changes

test/quantization/test_quantize_jit.py Outdated Show resolved Hide resolved

supriyar added 2 commits June 30, 2020 11:04

facebook-github-bot closed this in 55b5ab1 Jul 1, 2020

facebook-github-bot added the merged label Jul 1, 2020

supriyar mentioned this pull request Jul 2, 2020

[quant][graphmode] Refactor quantization patterns #40894

Closed

facebook-github-bot deleted the gh/supriyar/145/head branch July 5, 2020 14:17

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[quant][graphmode] FP16 quant support - Insert cast operators #40709

[quant][graphmode] FP16 quant support - Insert cast operators #40709

Uh oh!

supriyar commented Jun 29, 2020 •

edited

Loading

Uh oh!

dr-ci bot commented Jun 29, 2020 •

edited

Loading

Uh oh!

jerryzh168 Jun 29, 2020

Uh oh!

jerryzh168 Jun 29, 2020

Uh oh!

jerryzh168 Jun 29, 2020

Uh oh!

Uh oh!

facebook-github-bot commented Jul 1, 2020

Uh oh!

Uh oh!

[quant][graphmode] FP16 quant support - Insert cast operators #40709

[quant][graphmode] FP16 quant support - Insert cast operators #40709

Uh oh!

Conversation

supriyar commented Jun 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci bot commented Jun 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

jerryzh168 Jun 29, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jun 29, 2020

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Jun 29, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

facebook-github-bot commented Jul 1, 2020

Uh oh!

Uh oh!

supriyar commented Jun 29, 2020 •

edited

Loading

dr-ci bot commented Jun 29, 2020 •

edited

Loading