Fix FoldTransposeIntoQuantInit Transformation #78

iksnagreb · 2023-10-05T10:40:57Z

Intends to fix problems related to the FoldTransposeIntoQuantInit transformation. The transformation should only be applied if the Transpose node actually follows a so called QuantInit, these are Quant (or BipolarQuant) nodes where all inputs are initializers. Currently, the transform is always applied, even if just some inputs have initializers. This causes problems with the shape inference, as the remaining Quant node does not transpose its runtime inputs. This is fixed by making the test for a node being QuantInit more strict.

I have tested this by running the unit tests for QONNX as well as those under tests/transformation over at FINN and did not observe any issues so far. For more context please see the following issues: #77, Xilinx/finn#878, Xilinx/finn#892.

pypa/pip#11294

Previously, the transform was seemingly applied to all Quant-Transpose patterns, irrespective of whether *all* the inputs are actually initializers. This should now be fixed by testing more strictly for the node being a QuantInit.

iksnagreb · 2023-10-05T11:26:29Z

Hm, all tests fail? What is going on? Seems not to be my fault?

iksnagreb · 2023-10-19T13:35:02Z

I have just added a new unit test validating both situations, i.e., keeping and removing the Transpose node, as well as running the InferShapes transformation directly after FoldTransposeIntoQuantInit.

iksnagreb · 2023-10-19T14:02:43Z

Hm, again some unrelated tests fail. This time it is some HTTP Error 500: INTERNAL SERVER ERROR. However, all passed for the two commits in between, seems to be something external?

…ature/test_rect_dwise_dilated_conv_lowering Add extra conv lowering tests + fix linter issues

…eb/qonnx into iksnagreb-fix/transpose_into_quant

maltanar · 2023-10-23T21:07:12Z

The previous CI failures were due to an onnxruntime bug that got fixed last week, so main is now updated to use the most recent version that takes care of that which I updated this PR with.

The last CI failures seem to be due to some intermittent server failure, re-running was enough to make all tests pass.

Otherwise, the PR looks all good to me - thanks @iksnagreb ! I'll only add one comment here about this bit before I hit merge:

                        # Skip transposing the initializer if the number of
                        # dimensions do not match
                        if perm is not None and len(perm) != tensor.ndim:
                            # Note: Soft skip ok or is this an error?
                            continue

The only cases I've previously seen that would give a perm tensor with a different number of dimensions is when the tensor in question was a scalar (for instance a global zeropoint value of 0). This would be safe to skip.

In theory, one could take advantage of shape broadcasting to create quantization parameters that are neither scalar nor matching the number of dimensions for the target tensor, as this is already mis-specified in the Quant node spec. I'll update the Quant node spec to permit only scalar OR ndim == tensor ndim cases.

iksnagreb · 2023-10-24T06:37:10Z

Thank you!

maltanar and others added 3 commits September 22, 2023 19:54

[Test, Lint] add extra conv lowering tests + fix linter issues

b79cb8b

pypa/pip#11294

Clean up some comments and todos left over from refactoring

7c2f5d8

iksnagreb mentioned this pull request Oct 5, 2023

Streamlining of Scaled Dot-Product Attention Xilinx/finn#901

Draft

13 tasks

iksnagreb added 2 commits October 19, 2023 11:27

Address some linting issues

c0a5453

Add test cases for the FoldTransposeIntoQuantInit transformation

6f74e8f

Fix some typos

8b2b656

maltanar and others added 6 commits October 23, 2023 21:13

[README] remove non-existing optional docs install instr.

2fb1e57

[Test] remove saving of debug onnx files for test_expose_intermediate

d003d6f

[Deps] use onnxrt>=1.16.1 to avoid onnxrt issue #17631

2a425ec

Merge pull request fastmachinelearning#76 from fastmachinelearning/fe…

be72cb2

…ature/test_rect_dwise_dilated_conv_lowering Add extra conv lowering tests + fix linter issues

[License] add missing license headers

e62517a

Merge branch 'fix/transpose_into_quant' of https://github.com/iksnagr…

0351d9e

…eb/qonnx into iksnagreb-fix/transpose_into_quant

maltanar merged commit c966b46 into fastmachinelearning:main Oct 23, 2023
5 checks passed

fpjentzsch mentioned this pull request Nov 10, 2023

cleanup function and ConvertQONNXtoFINN transformation issue Xilinx/finn#892

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix FoldTransposeIntoQuantInit Transformation #78

Fix FoldTransposeIntoQuantInit Transformation #78

iksnagreb commented Oct 5, 2023

iksnagreb commented Oct 5, 2023

iksnagreb commented Oct 19, 2023

iksnagreb commented Oct 19, 2023

maltanar commented Oct 23, 2023

iksnagreb commented Oct 24, 2023

Fix FoldTransposeIntoQuantInit Transformation #78

Fix FoldTransposeIntoQuantInit Transformation #78

Conversation

iksnagreb commented Oct 5, 2023

iksnagreb commented Oct 5, 2023

iksnagreb commented Oct 19, 2023

iksnagreb commented Oct 19, 2023

maltanar commented Oct 23, 2023

iksnagreb commented Oct 24, 2023