[TFL] Enhance MoveBinaryOpBeforeReshape pattern to enable fusion of 2D bias. #47510

WindQAQ · 2021-03-02T20:19:48Z

Some uses of EinsumDense layer will create 2D or higher bias. For example,

layer = tf.keras.layers.MultiHeadAttention(num_heads=3, key_dim=5)
target = tf.keras.Input(shape=[8, 16], batch_size=1)
source = tf.keras.Input(shape=[4, 16], batch_size=1)
output_tensor = layer(target, source, return_attention_scores=False)
model = tf.keras.Model([target, source], output_tensor)

This PR reorders Reshape and BinaryOp to make 2D bias flatten and fusable to FullyConnected. In the case above, bias add can be fused into FullyConnected at location 0, 2 and 172.

Some uses of EinsumDense layer will create 2D or higher bias. For example, ```python layer = tf.keras.layers.MultiHeadAttention(num_heads=3, key_dim=5) target = tf.keras.Input(shape=[8, 16], batch_size=1) source = tf.keras.Input(shape=[4, 16], batch_size=1) output_tensor = layer(target, source, return_attention_scores=False) model = tf.keras.Model([target, source], output_tensor) ``` This PR reorder Reshape and BinaryOp to make 2D bias flatten and fusable to FullyConnected.

abattery · 2021-03-02T21:07:44Z

tensorflow/compiler/mlir/lite/tests/optimize.mlir

@@ -642,6 +642,34 @@ func @NotReorderReshapeAddIfHighDim(%arg0: tensor<1x1x1x1x30x96xf32>) -> tensor<
  // CHECK: return %[[rs2]]
 }

+// CHECK-LABEL: @ReorderReshapeAdd2DConst


Could you add test cases for reordering and fusing?

Updated. I follow the test cases of FuseFullyConnectedReshapeAddConst*. Let me know if that's applicable. Thank you!

abattery · 2021-03-02T22:09:24Z

Thanks for the contributions!

abattery · 2021-03-03T00:46:38Z

Please add additional checks in the new patterns to make sure that the newly generated binary ops will have <=4D inputs since TFLite binary kernels support broadcasting up to 4D inputs.

abattery · 2021-03-03T00:49:54Z

Also it would be nice to add a test case for >4D input cases.

abattery · 2021-03-03T01:04:12Z

Also can we limit the reorderings only when the FullyConnected op is appeared next to it?

WindQAQ · 2021-03-03T01:44:26Z

Also can we limit the reorderings only when the FullyConnected op is appeared next to it?

Do you mean this pattern only, or all other reordering pattern?

abattery · 2021-03-03T01:47:39Z

Do you mean this pattern only, or all other reordering pattern?

It is okay for the newly added patterns only for now.

WindQAQ · 2021-03-03T03:03:35Z

Also it would be nice to add a test case for >4D input cases.

Because we are restricting input to be defined by FullyConnected, which usually outputs 2D, I cannot really find a use case with 4D input. I do include one test about high-D input in e26e2a7. Let me know if it can pass all internal checks. Thank you!

google-ml-butler bot added the size:M CL Change Size: Medium label Mar 2, 2021

google-ml-butler bot requested a review from joker-eph March 2, 2021 20:19

google-cla bot added the cla: yes label Mar 2, 2021

abattery reviewed Mar 2, 2021

View reviewed changes

Add some tests for reordering and fusion

22d7167

abattery approved these changes Mar 2, 2021

View reviewed changes

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Mar 2, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 2, 2021

WindQAQ added 2 commits March 2, 2021 18:41

Restrict operands to have at most rank 4

e26e2a7

Restrict input to be defined by FullyConnected

f0c7f60

google-ml-butler bot removed the ready to pull PR ready for merge process label Mar 3, 2021

abattery approved these changes Mar 3, 2021

View reviewed changes

google-ml-butler bot added kokoro:force-run Tests on submitted change ready to pull PR ready for merge process labels Mar 3, 2021

kokoro-team removed the kokoro:force-run Tests on submitted change label Mar 3, 2021

rthadur assigned gbaned Mar 3, 2021

rthadur added this to Assigned Reviewer in PR Queue via automation Mar 3, 2021

copybara-service bot merged commit 2812873 into tensorflow:master Mar 3, 2021

PR Queue automation moved this from Assigned Reviewer to Merged Mar 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TFL] Enhance MoveBinaryOpBeforeReshape pattern to enable fusion of 2D bias. #47510

[TFL] Enhance MoveBinaryOpBeforeReshape pattern to enable fusion of 2D bias. #47510

WindQAQ commented Mar 2, 2021

abattery Mar 2, 2021

WindQAQ Mar 2, 2021

abattery commented Mar 2, 2021

abattery commented Mar 3, 2021 •

edited

abattery commented Mar 3, 2021

abattery commented Mar 3, 2021

WindQAQ commented Mar 3, 2021

abattery commented Mar 3, 2021

WindQAQ commented Mar 3, 2021 •

edited

[TFL] Enhance MoveBinaryOpBeforeReshape pattern to enable fusion of 2D bias. #47510

[TFL] Enhance MoveBinaryOpBeforeReshape pattern to enable fusion of 2D bias. #47510

Conversation

WindQAQ commented Mar 2, 2021

abattery Mar 2, 2021

Choose a reason for hiding this comment

WindQAQ Mar 2, 2021

Choose a reason for hiding this comment

abattery commented Mar 2, 2021

abattery commented Mar 3, 2021 • edited

abattery commented Mar 3, 2021

abattery commented Mar 3, 2021

WindQAQ commented Mar 3, 2021

abattery commented Mar 3, 2021

WindQAQ commented Mar 3, 2021 • edited

abattery commented Mar 3, 2021 •

edited

WindQAQ commented Mar 3, 2021 •

edited