New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

TF-TRT Improve matrix multiplication conversion and enable dynamic shape mode #47215

Merged

copybara-service merged 1 commit into tensorflow:master from tfeher:trt_matmul_dynamic_shape6

Feb 26, 2021

Contributor

tfeher commented Feb 17, 2021

This PR improves the MatMul and BatchMatMul converters.

Unnecessary transpose of weights are removed. Transposing weights for IMatrixMultiplyLayer is not necessary, because IMatrixMultiplyLayer can directly pass the transpose flags to the underlaying GEMM call, which can use it to access elements with the correct stride without any actual transposition.
Restrictions caused by previous weight transpose ops eliminated.
Enabled explicit batch and dynamic shape input.
IFullyConnectedLayer (FC) usage fixed:
- FC layer is preferred over IMatrixMultiply because it is expected to give better performance. Moreover, currently only FC layer supprorts INT8 precision.
- Fixed and relaxed FC layer conversion condition.
- Fixed input tensor_a shaped handling. In dynamic shape mode care has to be taken to retain static dim where available and not to confuse unknown dims with -1 wildcard.
- Enabled rank > 2 for weights.
- Fixed conversion of BatchMatMul to FC: broadcast now preserves the information whether the input is tensor or weight, so that we can correctly check FC compatibility condition.

BatchMatMul involves a potential broadcast step. TRT requires that the input tensors have the same rank, with 1 values filled in the dimensions which need to be broadcasted. A helper function BroadcastTensors was added to make the tensors match in rank. In dynamic shape mode we need shape inference for this step. The DynamicReshape function was modified to allow insertion of multiple singleton dimensions.

Tagging @bixia1 for review and @DEKHTIARJonathan for visibility.
Tracker: #45481

google-ml-butler bot added the size:L label

google-ml-butler bot requested a review from joker-eph

February 17, 2021 14:48

google-cla bot added the cla: yes label

gbaned self-assigned this

gbaned added the comp:gpu:tensorrt label

gbaned added this to Assigned Reviewer in PR Queue via automation

gbaned requested a review from cheshire

February 17, 2021 15:10

bixia1 reviewed

View reviewed changes

Contributor

bixia1 left a comment

Thanks for your work!
I finished reviewing the code, but not the test yet. Sending out the comments I have now.

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes_test.cc Outdated Show resolved Hide resolved

tfeher force-pushed the trt_matmul_dynamic_shape6 branch from f9a52b0 to f7fb167 Compare

February 22, 2021 21:15

tfeher commented

View reviewed changes

Contributor Author

tfeher left a comment

Thanks @bixia1 for the review, I have addressed the issues!

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes_test.cc Outdated Show resolved Hide resolved

bixia1 reviewed

View reviewed changes

Contributor

bixia1 left a comment

Please remember to rebase and squash.

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes_test.cc Outdated Show resolved Hide resolved

tfeher force-pushed the trt_matmul_dynamic_shape6 branch from f7fb167 to 2fb6a21 Compare

February 24, 2021 11:18

tfeher commented

View reviewed changes

Contributor Author

tfeher left a comment

Thanks @bixia1 for the additional comments, I have addressed the issues!

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/convert/convert_nodes_test.cc Outdated Show resolved Hide resolved

tfeher mentioned this pull request

TF-TRT Slice op converters explicit batch mode #40736

Merged

bixia1 reviewed

View reviewed changes

Contributor

bixia1 left a comment

Thanks for your work! Just one remaining thing: there is one place where you said you put a more detail description but I couldn't find it by searching the string in the code.

tensorflow/compiler/tf2tensorrt/convert/convert_nodes.cc Outdated Show resolved Hide resolved


          TF-TRT Improve MatMul converter and enable dynamic shape mode for it

5136ce1

tfeher force-pushed the trt_matmul_dynamic_shape6 branch from 2fb6a21 to 5136ce1 Compare

February 24, 2021 17:39

bixia1 approved these changes

View reviewed changes

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer

google-ml-butler bot added kokoro:force-run ready to pull labels

Contributor Author

tfeher commented Feb 24, 2021

I have fixed the missing comment.

kokoro-team removed kokoro:force-run labels

copybara-service bot merged commit 22601eb into tensorflow:master

PR Queue automation moved this from Approved by Reviewer to Merged

tfeher mentioned this pull request

TF-TRT Dynamic Shapes Feature Tracker #45481

Open

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment