DynamicQuantizeLinear opset 20 and float 8 #5472

xadupre · 2023-08-04T14:14:01Z

Description

DynamicQuantizeLinear only supports uint 8. This PR adds support for int8 and float 8.

Motivation and Context

The operator is used to dynamically quantize an input.

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

onnx/backend/test/case/node/dynamicquantizelinear.py

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

onnx/reference/ops/_op_list.py

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

docs/Changelog.md

docs/Operators.md

yufenglee · 2023-08-08T05:16:38Z

Description

DynamicQuantizeLinear only supports uint 8. This PR adds support for int8 and float 8.

Motivation and Context

The operator is used to dynamically quantize an input.

We also need to add fp8 support for MatMulInteger to support dynamic quantization for fp8.

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

xadupre · 2023-08-08T09:57:10Z

We also need to add fp8 support for MatMulInteger to support dynamic quantization for fp8.

The function defined by CUDA cublasLtMatMul allows more than one option for the output type with the same input types. Since there is no scale for the output, the output type could be float32, float16 or bfloat16. I started to modify QLinearMatMul in PR #5473 which can be seen as a more generic version of MatMulInteger. There is also the transposition out of the equation and cublasLtMatMul only supports A.T @ B with float 8 (and column major order). Zero point is not used to float 8 types. The name MatMulInteger also includes Integer in it. Is it possible to modify the quantization tools to use QLinearMatMul instead?

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

justinchuby · 2023-08-08T13:46:28Z

Nit: "convertion" -> "conversion"

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

justinchuby · 2023-08-19T12:40:23Z

Is this ready for reviews?

jcwchen

We also see more interests to add 16 bit support in QuantizeLinear/DequantizeLinear: #3971 (comment). If we do bump DynamicQuantizeLinear in this opset version, it might be a good time to add 16 bit support for QuantizeLinear/DequantizeLinear at the same time as well (if it makes sense, it can be done in another PR).

xadupre · 2023-08-19T15:44:50Z

The only thing which wiuld require a larger consensus is the method i used to estimate the scale for float 8. Models are usually trained with float 8 and the scale estimation is part of the training. It is different from what i came up with.

justinchuby · 2023-08-25T15:18:33Z

Cc @gramalingam

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

gramalingam · 2023-08-31T18:37:43Z

onnx/defs/quantization/defs.cc

 * rounding to nearest ties to even.

 Data quantization formula is:
 ```
 y = saturate (round (x / y_scale) + y_zero_point)
 ```

-* for saturation, it saturates to [0, 255] if it's uint8, or [-127, 127] if it's int8. Right now only uint8 is supported.
-* rounding to nearest ties to even.
+y_zero_point must be 0 for any float 8 type.


But this is a problem. If we are forced to use 0 as the zeropoint, the above computation of scale will not guarantee that all values can be reasonably represented as a float8. We may need to change the computation of scale as well, using something like "max ( max(x)/qmax, min(x)/qmin )" with some adjustments for rounding etc.

But, better still, if this is being used in practice, what are they doing?

May need some adjustments to ensure signs are handled correctly as well.

xadupre added 2 commits August 4, 2023 15:37

DynamicQuantizeLinear opset 20 and float 8

d0398b6

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

black

e75ac5b

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

xadupre added the operator Issues related to ONNX operators label Aug 4, 2023

github-advanced-security bot found potential problems Aug 4, 2023

View reviewed changes

xadupre added 5 commits August 4, 2023 18:14

fix runtime

967c256

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update examples

4775606

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

Merge branch 'main' of https://github.com/onnx/onnx into ff8

b6c79a0

lint

b694c93

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update docs

f26c5bc

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

github-advanced-security bot found potential problems Aug 4, 2023

View reviewed changes

onnx/reference/ops/_op_list.py Fixed Show fixed Hide fixed

xadupre added 17 commits August 7, 2023 11:43

fix shape inference and test output type

a3bed43

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update documentation

c572247

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

Merge branch 'main' of https://github.com/onnx/onnx into ff8

bccb3e5

style

dc53259

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix backend end example

495c885

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

doc

6bfafb7

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

coverage

31b1aad

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix lint

75b109f

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix warning

21aa30c

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix output type

8e0fdf3

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix example and body implementation

a1937c6

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update doc

d02baaf

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update float 8 backend test

bbcd3af

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

Merge branch 'main' of https://github.com/onnx/onnx into ff8

d7de3c0

fix float 8 dql

f5ed608

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix md

296a0c5

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix md

dfdef32

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

yufenglee reviewed Aug 8, 2023

View reviewed changes

docs/Changelog.md Outdated Show resolved Hide resolved

yufenglee reviewed Aug 8, 2023

View reviewed changes

docs/Operators.md Outdated Show resolved Hide resolved

xadupre added 4 commits August 8, 2023 10:09

Merge branch 'main' of https://github.com/onnx/onnx into ff8

593e317

fix _load_proto

2e20d29

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

update docs

8ac9122

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

remove tests for onnxruntime

643a63a

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

xadupre marked this pull request as ready for review August 8, 2023 09:36

xadupre requested review from a team as code owners August 8, 2023 09:36

fix annotation

07dc933

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix misspelling

7b36fe9

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

jcwchen reviewed Aug 19, 2023

View reviewed changes

justinchuby added this to the 1.15 milestone Aug 30, 2023

xadupre added 3 commits August 30, 2023 17:23

fix merge conflicts

5a9b0b7

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix merge conflicts

deea67b

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

fix merge conflicts

1701950

Signed-off-by: Xavier Dupre <xadupre@microsoft.com>

gramalingam reviewed Aug 31, 2023

View reviewed changes

liqunfu modified the milestones: 1.15, 1.16 Sep 22, 2023

justinchuby modified the milestones: 1.16, 1.17 Feb 8, 2024

xadupre closed this Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DynamicQuantizeLinear opset 20 and float 8 #5472

DynamicQuantizeLinear opset 20 and float 8 #5472

xadupre commented Aug 4, 2023

yufenglee commented Aug 8, 2023

Description

Motivation and Context

xadupre commented Aug 8, 2023

justinchuby commented Aug 8, 2023

justinchuby commented Aug 19, 2023

jcwchen left a comment

xadupre commented Aug 19, 2023

justinchuby commented Aug 25, 2023

gramalingam Aug 31, 2023

gramalingam Aug 31, 2023

DynamicQuantizeLinear opset 20 and float 8 #5472

DynamicQuantizeLinear opset 20 and float 8 #5472

Conversation

xadupre commented Aug 4, 2023

Description

Motivation and Context

yufenglee commented Aug 8, 2023

Description

Motivation and Context

xadupre commented Aug 8, 2023

justinchuby commented Aug 8, 2023

justinchuby commented Aug 19, 2023

jcwchen left a comment

Choose a reason for hiding this comment

xadupre commented Aug 19, 2023

justinchuby commented Aug 25, 2023

gramalingam Aug 31, 2023

Choose a reason for hiding this comment

gramalingam Aug 31, 2023

Choose a reason for hiding this comment