Quantization tool supporting Conv and MatMul nodes #1892

PhaniShekhar · 2019-03-29T01:26:27Z

Quantization tool for converting an onnx model into quantized onnx model. Currently only conversion of Conv and MatMul nodes is supported.

CLAassistant · 2019-03-29T01:26:34Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

askhade · 2019-04-18T01:03:39Z

tools/quantization/README.md

+# Load the onnx model
+model = onnx.load('path/to/the/model.onnx')
+# Quantize
+quantized_model = quantize(model, per_channel=False, quantization_mode=QuantizationMode.IntegerOps_Dynamic)


Static mode involves more inputs right? like the quantization params… can you include an example for static mode too...

askhade · 2019-04-23T22:13:02Z

tools/quantization/quantize.py

+        max_range = max(abs(rmin), abs(rmax))
+        scale = (float(max_range)*2) / quantize_range
+        zero_point = 0
+        quantized_data = (np.asarray(data) / scale).round().astype('b') #signed byte type


Does this round half to even?

askhade · 2019-04-23T22:18:11Z

tools/quantization/quantize.py

+            S: scale
+            z: zero point
+    '''
+    rmin = min(data)


what happens when range does not include 0? For example is the range is 2-10 then in this case we dont have a unique representation for 0. Can you do rmin = min(min(data), 0) and similar for max...

askhade · 2019-04-23T22:25:37Z

tools/quantization/quantize.py

+        scale_name = weight.name + '_scale'
+        zero_point_name = weight.name + '_zero_point'
+
+        # Remove existing weight initializer


What happens when this input is also being used by another node? This condition should be checked.

askhade

Please address the comments and update the branch by merging with master

Quantization tool supporting Conv and MatMul nodes

5baec93

shinh mentioned this pull request Apr 1, 2019

Quantized ops pfnet-research/chainer-compiler#104

Closed

Changes to support onnx quantization spec. Some fixes.

009d961

PhaniShekhar requested a review from a team as a code owner April 17, 2019 03:07

Add Readme file

e26052d

askhade reviewed Apr 18, 2019

View reviewed changes

askhade reviewed Apr 23, 2019

View reviewed changes

askhade suggested changes Apr 24, 2019

View reviewed changes

askhade closed this Jan 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantization tool supporting Conv and MatMul nodes #1892

Quantization tool supporting Conv and MatMul nodes #1892

PhaniShekhar commented Mar 29, 2019

CLAassistant commented Mar 29, 2019 •

edited

Loading

askhade Apr 18, 2019

askhade Apr 23, 2019

askhade Apr 23, 2019

askhade Apr 23, 2019

askhade left a comment

Quantization tool supporting Conv and MatMul nodes #1892

Quantization tool supporting Conv and MatMul nodes #1892

Conversation

PhaniShekhar commented Mar 29, 2019

CLAassistant commented Mar 29, 2019 • edited Loading

askhade Apr 18, 2019

Choose a reason for hiding this comment

askhade Apr 23, 2019

Choose a reason for hiding this comment

askhade Apr 23, 2019

Choose a reason for hiding this comment

askhade Apr 23, 2019

Choose a reason for hiding this comment

askhade left a comment

Choose a reason for hiding this comment

CLAassistant commented Mar 29, 2019 •

edited

Loading