New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

TF-TRT C++ conversion #52012

Merged

copybara-service merged 1 commit into tensorflow:master from tfeher:trt_cpp_conversion_example

Oct 11, 2021

Contributor

tfeher commented Sep 15, 2021 •

edited

TF-TRT C++ interface to convert models.

Differences compared to the Python API:

It is assumed that the input graph is frozen. This assumption will be removed in a follow up PR.
convert_to_static_engine conversion param allows to convert dynamic engines to static engines.
Currently we do not provide a way to save the engine files as assets.

The steps for model conversion by ConvertAndBuild

Inline functions
Freeze the graph - omitted since we assume frozen input graph
Run Grappler with TRTOptimizer pass
Infer the graph to provide shape information (only in dynamic shape mode).
Infer the graph to build the engines
Convert the graph_def to have static engines

(On the Python side, steps 4-5 are done by a separate build function.)

Related PRs:

Load and select optimization profiles for static TRT engine #52047
Example code for the tensorflow/tensorrt repository TF-TRT C++ conversion example tensorrt#271

google-ml-butler bot added the size:XL label

google-ml-butler bot requested review from joker-eph, sanjoy and sherhut

September 15, 2021 13:50

google-cla bot added the cla: yes label

google-ml-butler bot added the awaiting review label

gbaned self-assigned this

gbaned added this to Assigned Reviewer in PR Queue via automation

tfeher force-pushed the trt_cpp_conversion_example branch from e6d0699 to 45680b7 Compare

September 16, 2021 07:48

This was referenced Sep 16, 2021

TF-TRT Add n_build_pass attribute #52033

Closed

TF-TRT Dynamic Shapes Feature Tracker #45481

Open

bixia1 self-requested a review

September 16, 2021 16:40

bixia1 added the comp:gpu:tensorrt label

Contributor

bixia1 commented Sep 16, 2021

We can avoid the change to the utilities for freezing the graph by updating out SavedModelBundle with the converted GraphDef. Something like this:

MetaGraphDef* meta_graph_def = &saved_model_bundle->meta_graph_def; 
*meta_graph_def->mutable_graph_def() = graph_def;

tfeher mentioned this pull request

Load and select optimization profiles for static TRT engine #52047

Merged

sanjoy removed their request for review

September 20, 2021 20:30

tfeher mentioned this pull request

TF-TRT C++ conversion example tensorflow/tensorrt#271

Merged

tfeher force-pushed the trt_cpp_conversion_example branch 2 times, most recently from afaa5af to d876eae Compare

September 26, 2021 22:49

tfeher marked this pull request as ready for review

September 26, 2021 22:51

Contributor Author

tfeher commented Sep 26, 2021

@bixia1 ready for review.

tfeher commented

View reviewed changes

tensorflow/core/BUILD Outdated Show resolved Hide resolved

bixia1 reviewed

View reviewed changes

Contributor

bixia1 left a comment

I am still working on reviewing trt_convert.cc .

tensorflow/core/BUILD Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/BUILD Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_test.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_test.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_test.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_test.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

bixia1 reviewed

View reviewed changes

Contributor

bixia1 left a comment

Still working on trt_convert.cc.

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

bixia1 reviewed

View reviewed changes

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated

Comment on lines 62 to 66

+                // We could set item_conifg.feed_nodes and item_config.fetch_nodes to the
+                // nodes in the signature def. Alternatively, we could also set collection
+                // 'train_op'. Grappler can use these to determine the input and outputs.
+                // If none of these are set, then it will use the SignatereDef from
+                // the MetaGraphDef. See grappler_item_builder.cc for details.

Contributor

bixia1 Oct 1, 2021

If we will go with the API where users provide input/output names, we should setup ItemConfig fetch_nodes/feed_nodes.
If we will go with the API where users provide SignatureDef, we need to clear the SignatureDef in the input meta_graph_def and keep only the one we need.

Also, if we will need to run grappler a few times, we probably want to avoid constructing GrapplerItem each time by somehow reusing the GrapplerItem, see TF code here.

Contributor Author

tfeher Oct 4, 2021 •

edited

I have set up feed_nodes and fetch_nodes.
If a previous conversion had RunMetaOptimizer(item, ..., out_graph_def), can we just replace the item's graph item.graph = out_graph_def before the next RunMetaOptimizer call? For now we have only a single grappler pass, but it might matter in the follow up PR where we need to inline the graph before freezing it.

tensorflow/compiler/tf2tensorrt/kernels/trt_engine_op.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflowbutler removed the awaiting review label

tfeher commented

View reviewed changes

Contributor Author

tfeher left a comment

Thanks @bixia1 for the review I have addressed most of the issues. Pleas have a look.

tensorflow/core/BUILD Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/BUILD Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated

Comment on lines 62 to 66

+                // We could set item_conifg.feed_nodes and item_config.fetch_nodes to the
+                // nodes in the signature def. Alternatively, we could also set collection
+                // 'train_op'. Grappler can use these to determine the input and outputs.
+                // If none of these are set, then it will use the SignatereDef from
+                // the MetaGraphDef. See grappler_item_builder.cc for details.

Contributor Author

tfeher Oct 4, 2021 •

edited

I have set up feed_nodes and fetch_nodes.
If a previous conversion had RunMetaOptimizer(item, ..., out_graph_def), can we just replace the item's graph item.graph = out_graph_def before the next RunMetaOptimizer call? For now we have only a single grappler pass, but it might matter in the follow up PR where we need to inline the graph before freezing it.

tensorflow/compiler/tf2tensorrt/trt_convert.cc Outdated Show resolved Hide resolved

tfeher force-pushed the trt_cpp_conversion_example branch 2 times, most recently from 09cd906 to b02e321 Compare

October 7, 2021 17:30

Contributor Author

tfeher commented Oct 7, 2021

@bixia1 I have removed the circular dependency, and added trt_convert_api as a dependency of trt_op_libs. Let me know if there are other issues that we want to address in this PR.

Contributor

bixia1 commented Oct 7, 2021

Can you fix the PR description, such as to remove the information that is obsolete? The PR description will become part of the commit message.

bixia1 reviewed

View reviewed changes

tensorflow/compiler/tf2tensorrt/trt_convert_api.cc Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.cc Outdated Show resolved Hide resolved

sherhut removed their request for review

October 8, 2021 12:23


          TF-TRT C++ interface for conversion

5cbde12

tfeher force-pushed the trt_cpp_conversion_example branch from b02e321 to 5cbde12 Compare

October 8, 2021 17:21

tfeher commented

View reviewed changes

Contributor Author

tfeher left a comment

Thanks @bixia1 for the review, I have addressed the issues.

tensorflow/compiler/tf2tensorrt/trt_convert_api.cc Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.h Outdated Show resolved Hide resolved

tensorflow/compiler/tf2tensorrt/trt_convert_api.cc Show resolved Hide resolved

PR Queue automation moved this from Assigned Reviewer to Approved by Reviewer

bixia1 approved these changes

View reviewed changes

google-ml-butler bot added kokoro:force-run ready to pull labels

kokoro-team removed the kokoro:force-run label

copybara-service bot merged commit e8366cb into tensorflow:master

google-ml-butler bot removed the ready to pull label

tfeher mentioned this pull request

Extend TF:TRT C++ API to handle non-frozen models #53082

Merged

tfeher mentioned this pull request

[TF:TRT] Fix Grappler params to enable layout optimizer for the C++ API #53507

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment