[multimodal] add tensorrt tutorial #2987

liangfu · 2023-03-01T20:40:45Z

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

review-notebook-app · 2023-03-11T00:33:00Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

github-actions · 2023-03-11T01:47:01Z

Job PR-2987-3abbc5a is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/3abbc5a/index.html

github-actions · 2023-03-14T01:41:02Z

Job PR-2987-92bba50 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/92bba50/index.html

github-actions · 2023-03-14T05:43:08Z

Job PR-2987-5cba8f8 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/5cba8f8/index.html

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb

zhiqiangdon · 2023-03-14T21:26:53Z

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb

+    "trt_predictor = MultiModalPredictor.load(path=model_path)\n",
+    "trt_predictor.optimize_for_inference()\n",
+    "\n",
+    "# Agagin, use first prediction for initialization (e.g., allocating memory)\n",


Typo agagin?

Maybe give more explanations of using the first prediction for initialization. Otherwise, users may wonder why this is necessary.

Typo agagin?

Nice catch. Fixed typo.

Maybe give more explanations of using the first prediction for initialization. Otherwise, users may wonder why this is necessary.

This is indicated with an example being allocating memory

The first time calling model forward takes more time than the following calls. I'm not sure whether we can explain this observation as model initialization since Pytorch uses dynamic graph and eager execution.

What if the batch size is changeable during predictions?

in my observation, it would automatically re-compile when batch_size dimension is larger than initialization.

Any Pytorch documentation regarding this phenomenon?

The re-compile only happens to tensorrt backend in onnxruntime, not directly related to pytorch.

This behavior is not well documented, it may related to Shape Inference for TensorRT Subgraphs

The above pytorch module also uses the first prediction as initialization. That's why we use Again here. So, this behavior exists for both pytorch and onnxruntime/tensorrt?

For pytorch, it is used for memory allocation; for onnxruntime/tensorrt, it is used for 1) fair comparison, 2) model compilation. (model compilation actually happens when calling optimize_for_inference() )

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb

github-actions · 2023-03-15T01:20:45Z

Job PR-2987-4a30526 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/4a30526/index.html

github-actions · 2023-03-15T06:07:13Z

Job PR-2987-133854a is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/133854a/index.html

github-actions · 2023-03-15T19:39:31Z

Job PR-2987-75ae324 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2987/75ae324/index.html

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb

zhiqiangdon

LGTM!

liangfu mentioned this pull request Mar 1, 2023

[MultiModal] Fusion model inference acceleration with TensorRT #2836

Merged

liangfu added 2 commits March 10, 2023 14:50

[multimodal] add tensorrt tutorial

ea253b2

convert to nb format

3abbc5a

liangfu force-pushed the trt-2 branch from fcfd874 to 3abbc5a Compare March 11, 2023 00:32

compare inference speed and evaluation metric

bad7ca8

liangfu added the model list checked You have updated the model list after modifying multimodal unit tests/docs label Mar 13, 2023

liangfu marked this pull request as ready for review March 13, 2023 21:26

liangfu added 3 commits March 13, 2023 16:52

remove kernelspec

4e79d92

minor fix

855ec64

minor change

92bba50

update index page

5cba8f8

zhiqiangdon reviewed Mar 14, 2023

View reviewed changes

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb Outdated Show resolved Hide resolved

zhiqiangdon reviewed Mar 14, 2023

View reviewed changes

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb Show resolved Hide resolved

zhiqiangdon reviewed Mar 14, 2023

View reviewed changes

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb Outdated Show resolved Hide resolved

zhiqiangdon reviewed Mar 14, 2023

View reviewed changes

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb Outdated Show resolved Hide resolved

address review comments

4a30526

bug fix

133854a

clear package installation logs

75ae324

zhiqiangdon reviewed Mar 15, 2023

View reviewed changes

docs/tutorials/multimodal/advanced_topics/tensorrt.ipynb Show resolved Hide resolved

zhiqiangdon approved these changes Mar 16, 2023

View reviewed changes

liangfu merged commit 19f4db7 into autogluon:master Mar 16, 2023

liangfu deleted the trt-2 branch March 16, 2023 18:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[multimodal] add tensorrt tutorial #2987

[multimodal] add tensorrt tutorial #2987

liangfu commented Mar 1, 2023

review-notebook-app bot commented Mar 11, 2023

github-actions bot commented Mar 11, 2023

github-actions bot commented Mar 14, 2023

github-actions bot commented Mar 14, 2023

zhiqiangdon Mar 14, 2023

zhiqiangdon Mar 14, 2023

liangfu Mar 15, 2023

zhiqiangdon Mar 15, 2023

zhiqiangdon Mar 15, 2023

liangfu Mar 15, 2023

zhiqiangdon Mar 15, 2023

liangfu Mar 15, 2023

zhiqiangdon Mar 15, 2023

liangfu Mar 16, 2023 •

edited

github-actions bot commented Mar 15, 2023

github-actions bot commented Mar 15, 2023

github-actions bot commented Mar 15, 2023

zhiqiangdon left a comment

[multimodal] add tensorrt tutorial #2987

[multimodal] add tensorrt tutorial #2987

Conversation

liangfu commented Mar 1, 2023

review-notebook-app bot commented Mar 11, 2023

github-actions bot commented Mar 11, 2023

github-actions bot commented Mar 14, 2023

github-actions bot commented Mar 14, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liangfu Mar 16, 2023 • edited

Choose a reason for hiding this comment

github-actions bot commented Mar 15, 2023

github-actions bot commented Mar 15, 2023

github-actions bot commented Mar 15, 2023

zhiqiangdon left a comment

Choose a reason for hiding this comment

liangfu Mar 16, 2023 •

edited