Enable new FP16 and support mixed precision by MO #8514

mvafin · 2021-11-10T13:32:21Z

Details:

Enable new FP16 IR generation that was disabled in Temporarily disable new FP16 generation #8650
Disable converting to FP32 by default in MO which allows mixed-precision IR
Replace usage of np.array to mo_array which forces fp32 dtype if no dtype was passed in MO

Tickets:

66048

Maxim-Doronin · 2021-11-19T10:24:38Z

@vinograd47 @ArtemySkrebkov-intel @AntonDudchenko please take a closer look. It may have a critical impact on vpu functionality

mvafin · 2021-11-19T10:42:29Z

Because of replacements on MO side this PR have a lot of changes. To better review it, please look into separate commits.

model-optimizer/mo/front/common/partial_infer/utils.py

model-optimizer/unit_tests/extensions/back/ChangeOutputTypeAttributes_test.py

model-optimizer/extensions/middle/EltwiseChecker.py

ArtemySkrebkov · 2021-11-19T14:30:22Z

@olegmorenov

Can you please generate a package of IR using this PR for us to test?

rkazants · 2021-12-17T22:12:40Z

Please fix a bug exposed in e2e. I will run one more DLB run on weekend:

[ ERROR ]  Traceback (most recent call last):
  File "/home/jenkins/agent/workspace/private-ci/ie/e2e-tests-linux-ubuntu20-e2e/b/tmp/e2e_test_venv/lib/python3.6/site-packages/openvino/tools/mo/utils/class_registration.py", line 278, in apply_transform
    for_graph_and_each_sub_graph_recursively(graph, replacer.find_and_replace_pattern)
  File "/home/jenkins/agent/workspace/private-ci/ie/e2e-tests-linux-ubuntu20-e2e/b/tmp/e2e_test_venv/lib/python3.6/site-packages/openvino/tools/mo/middle/pattern_match.py", line 46, in for_graph_and_each_sub_graph_recursively
    func(graph)
  File "/home/jenkins/agent/workspace/private-ci/ie/e2e-tests-linux-ubuntu20-e2e/b/tmp/e2e_test_venv/lib/python3.6/site-packages/openvino/tools/mo/front/tf/WhileNormalize.py", line 23, in find_and_replace_pattern
    self.normalize_loop_node(graph, node)
  File "/home/jenkins/agent/workspace/private-ci/ie/e2e-tests-linux-ubuntu20-e2e/b/tmp/e2e_test_venv/lib/python3.6/site-packages/openvino/tools/mo/front/tf/WhileNormalize.py", line 36, in normalize_loop_node
    'value': mo_array(True, dtype=np.bool)}).create_node()
NameError: name 'mo_array' is not defined

tools/mo/openvino/tools/mo/front/onnx/hard_sigmoid_ext.py

tools/mo/openvino/tools/mo/front/caffe/scale_ext.py

rkazants

Please fix failure. Also, I need to review these changes one more time.

inference-engine/src/mkldnn_plugin/mkldnn_plugin.cpp

src/common/transformations/src/transformations/common_optimizations/common_optimizations.cpp

src/plugins/intel_gna/gna_plugin.cpp

tools/mo/openvino/tools/mo/front/common/partial_infer/utils.py

tools/mo/openvino/tools/mo/front/common/layout.py

rkazants

Disagree with these changes regarding "Replace usage of np.array to mo_array which forces fp32 dtype if no dtype was passed in MO". Please revert

rkazants

please apply comments

tools/mo/openvino/tools/mo/back/compress_quantized_weights.py

tools/mo/openvino/tools/mo/front/InterpolateNormalizer.py

tools/mo/openvino/tools/mo/front/MatMul_normalizer.py

tools/mo/openvino/tools/mo/front/common/partial_infer/utils.py

tools/mo/openvino/tools/mo/pipeline/common.py

tools/mo/openvino/tools/mo/main.py

tools/mo/openvino/tools/mo/front/tf/bucketize_ext.py

tools/mo/openvino/tools/mo/front/onnx/group_norm_ext.py

rkazants · 2021-12-22T17:56:17Z

tools/mo/openvino/tools/mo/ops/convolution.py

+        if in_type_1 in [np.float16, np.float32, np.float64] and in_type_0 != in_type_1 and in_node_1.op == 'Const':
+            in_node_1 = node.in_port(1).get_source().node
+            log.error("Changing Const node '{}' data type from {} to {} for Convolution operation".format(
+                in_node_1.soft_get('name', in_node_1.id), in_type_1, in_type_0),
+                extra={'is_warning': True})
+            convert_const_node_value_type(in_node_1, in_type_0)


What cases exactly does this logic cover? input and weight types?

rkazants

In general, it looks great. Remained some comments not so relevant now if we get normal DLB results. Let's anticipate DLB and merge if it is fine.

…precision

* Enable new FP16 format and support mixed precision * Apply review comments * Fix issue with fp64 in FakeQuantWithMinMaxVars.py * Enabme decompression converts fusing for CPU plugin * Apply review feedback * Fix code style * Fix issue with np.full and apply review feedback * Apply review feedback * Fix HardSigmoid onnx extractor * Replace np.arrays that were skipped with mo_array * Fix compress_quantized_weights_test.py * Fix import issues * Apply review feedback and fix type of fusing linops in MO * Apply review feedback * Fix types for Mean/Scales and MXNET zeros * Add RandomUniform_8 to ConvertPrecision * Fix merge issue * Fix consts names collision in GPU plugin

mvafin requested a review from a team November 10, 2021 13:32

openvino-pushbot added the category: MO Model Optimizer label Nov 10, 2021

mvafin force-pushed the mo/mixed_precision branch from ec7b8b4 to 5aa4471 Compare November 17, 2021 23:03

jane-intel self-assigned this Nov 18, 2021

mvafin force-pushed the mo/mixed_precision branch 2 times, most recently from 79a3084 to d3e3ce5 Compare November 18, 2021 16:41

mvafin requested a review from a team November 18, 2021 16:41

mvafin requested a review from a team as a code owner November 18, 2021 16:55

mvafin force-pushed the mo/mixed_precision branch from 51dea7e to aa7e390 Compare November 18, 2021 20:55

mvafin changed the title ~~Support mixed precision by MO~~ Enable new FP16 and support mixed precision by MO Nov 19, 2021

nkogteva mentioned this pull request Nov 19, 2021

[GNA] Fix support of FP16 weights #8647

Closed

Maxim-Doronin requested a review from AntonDudchenko November 19, 2021 10:24

Maxim-Doronin requested review from vinograd47 and ArtemySkrebkov November 19, 2021 10:25

jane-intel reviewed Nov 19, 2021

View reviewed changes

model-optimizer/mo/front/common/partial_infer/utils.py Outdated Show resolved Hide resolved

model-optimizer/unit_tests/extensions/back/ChangeOutputTypeAttributes_test.py Outdated Show resolved Hide resolved

jane-intel reviewed Nov 19, 2021

View reviewed changes

model-optimizer/extensions/middle/EltwiseChecker.py Outdated Show resolved Hide resolved

mvafin force-pushed the mo/mixed_precision branch from 1b1cfcd to 1fed674 Compare November 20, 2021 22:02

mvafin requested a review from jane-intel November 20, 2021 22:02

mvafin force-pushed the mo/mixed_precision branch from 1fed674 to 0fa00ae Compare November 21, 2021 18:19

ArtemySkrebkov mentioned this pull request Nov 22, 2021

DO NOT MERGE: Integration testing of FP16 wieghts compression for VPUX #8746

Closed

mvafin added this to the 2022.1 milestone Nov 22, 2021

mvafin mentioned this pull request Nov 30, 2021

Mo/mixed precision integration #8933

Closed

mvafin requested review from a team November 30, 2021 20:41

mvafin force-pushed the mo/mixed_precision branch from c615c84 to 911e569 Compare November 30, 2021 20:55

mvafin requested a review from a team December 3, 2021 11:51

mvafin force-pushed the mo/mixed_precision branch from a2be70b to 3543894 Compare December 3, 2021 12:24

Fix compress_quantized_weights_test.py

043579d

rkazants reviewed Dec 17, 2021

View reviewed changes

tools/mo/openvino/tools/mo/front/onnx/hard_sigmoid_ext.py Outdated Show resolved Hide resolved

rkazants reviewed Dec 17, 2021

View reviewed changes

tools/mo/openvino/tools/mo/front/caffe/scale_ext.py Show resolved Hide resolved

rkazants requested changes Dec 17, 2021

View reviewed changes

Fix import issues

35163b5

mvafin requested a review from rkazants December 18, 2021 09:00