[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt #2983

liangfu · 2023-02-28T17:58:26Z

Issue #, if available:

Description of changes:

remove MultiModalOnnxPredictor
rename export_tensorrt -> optimize_for_inference
enable ORT_TENSORRT_ENGINE_CACHE_ENABLE to save engine build time in the case that TensorRT may take long time to optimize and build engine.
remove data dependency for optimize_for_inference

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

github-actions · 2023-03-06T22:34:40Z

Job PR-2983-9c30959 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2983/9c30959/index.html

multimodal/src/autogluon/multimodal/utils/export.py

…odel

(cherry picked from commit dd95466)

(cherry picked from commit 7f178e34f43e2cd3b139b20a7cfd3ac84f95a3e2)

github-actions · 2023-03-08T04:25:01Z

Job PR-2983-9865cad is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2983/9865cad/index.html

github-actions · 2023-03-09T01:20:21Z

Job PR-2983-60187f7 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2983/60187f7/index.html

multimodal/tests/unittests/others/test_deployment_onnx.py

multimodal/src/autogluon/multimodal/utils/export.py

zhiqiangdon · 2023-03-10T00:34:56Z

multimodal/src/autogluon/multimodal/utils/onnx.py

+                    "tensorrt package is not installed. The package can be install via `pip install tensorrt`."
+                )
+
+        self.sess = ort.InferenceSession(onnx_model.SerializeToString(), providers=providers)


The provider argument of OnnxModule should be easy-to-remember strings, e.g., tensorrt_gpu, onnx_gpu, tensorrt_cpu, onnx_cpu? Then we can map these strings to more complex providers used by ort.InferenceSession.

There wouldn't exist a tensorrt_cpu, since tensorrt would only serve the purpose of nvidia GPU.

onnx_gpu for OnnxModule: AMD also provide ROCm Execution Provider, which would make onnx_gpu argument for OnnxModule confusing.

I would say there is a trade-off between easy-to-rememberr constants vs transparency to onnxruntime config.

zhiqiangdon · 2023-03-10T00:53:46Z

multimodal/tests/unittests/others/test_deployment_onnx.py

+    tail_df = dataset.test_df.tail(2)
+
+    # Load a refresh predictor and optimize it for inference
+    for providers in [None, ["TensorrtExecutionProvider"], ["CUDAExecutionProvider"], ["CPUExecutionProvider"]]:


Is it better to use "TensorrtExecutionProvider" instead of ["TensorrtExecutionProvider"]?

Good question.

The providers argument provides a fallback mechanism so that we can always put preferred backend on top of the list. If it is the only item in list, the fallback mechanism won't take effact at all. For instance,

if providers=["TensorrtExecutionProviders"], and tensorrt package isn't installed on the system, we would raise an error.

zhiqiangdon · 2023-03-10T00:59:49Z

multimodal/src/autogluon/multimodal/utils/onnx.py

-        "onnxruntime would fallback to CUDAExecutionProvider instead of using TensorrtExecutionProvider."
-    )
+# TODO: Try a better workaround to lazy import tensorrt package.
+tensorrt_imported = False


Seems removable?

Yes, this might related to the import sequence with onnruntime package. But's it's hard to ensure onnxruntime wouldn't be imported beforehand.

github-actions · 2023-03-10T02:39:35Z

Job PR-2983-b3f7a19 is done.
Docs are uploaded to http://autogluon-staging.s3-website-us-west-2.amazonaws.com/PR-2983/b3f7a19/index.html

zhiqiangdon

LGTM

autogluon#2983)

liangfu mentioned this pull request Feb 28, 2023

[MultiModal] Fusion model inference acceleration with TensorRT #2836

Merged

liangfu force-pushed the remove-onnx-predictor-1 branch from 02eac4d to 728a285 Compare March 4, 2023 03:00

liangfu marked this pull request as ready for review March 4, 2023 03:01

liangfu added the model list checked You have updated the model list after modifying multimodal unit tests/docs label Mar 4, 2023

liangfu changed the title ~~[MultiModal] Remove MultiModalOnnxPredictor and unify export_onnx and export_tensorrt~~ [MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt Mar 4, 2023

liangfu force-pushed the remove-onnx-predictor-1 branch from d479a3a to 9c30959 Compare March 6, 2023 20:22

liangfu requested review from FANGAreNotGnu and zhiqiangdon March 6, 2023 22:43

zhiqiangdon reviewed Mar 7, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/utils/export.py Show resolved Hide resolved

liangfu added 7 commits March 7, 2023 18:07

remove MultiModalOnnxPredictor and in favor of replacing predictor._m…

12b6d9b

…odel

export_tensorrt -> optimize_for_inference

c275cfb

support caching TRT engine and profile

4a52757

improve robustness in export_onnx

ba87ee5

minor change

7f2bbad

address review comments

fc3a371

(cherry picked from commit dd95466)

lazy import onnxruntime package to work with tensorrt

9865cad

(cherry picked from commit 7f178e34f43e2cd3b139b20a7cfd3ac84f95a3e2)

liangfu force-pushed the remove-onnx-predictor-1 branch from 1e35ef7 to 9865cad Compare March 8, 2023 02:07

remove data dependency for optimize_for_inference

60187f7

zhiqiangdon reviewed Mar 9, 2023

View reviewed changes

multimodal/tests/unittests/others/test_deployment_onnx.py Outdated Show resolved Hide resolved

zhiqiangdon reviewed Mar 10, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/utils/export.py Show resolved Hide resolved

test with three execution providers

b3f7a19

zhiqiangdon reviewed Mar 10, 2023

View reviewed changes

multimodal/src/autogluon/multimodal/utils/export.py Show resolved Hide resolved

zhiqiangdon reviewed Mar 10, 2023

View reviewed changes

zhiqiangdon approved these changes Mar 10, 2023

View reviewed changes

liangfu merged commit 0c33a4c into autogluon:master Mar 10, 2023

liangfu deleted the remove-onnx-predictor-1 branch March 10, 2023 18:47

gradientsky pushed a commit to gradientsky/autogluon that referenced this pull request Mar 10, 2023

[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt (

4175518

autogluon#2983)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt #2983

[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt #2983

liangfu commented Feb 28, 2023 •

edited

github-actions bot commented Mar 6, 2023

github-actions bot commented Mar 8, 2023

github-actions bot commented Mar 9, 2023

zhiqiangdon Mar 10, 2023

liangfu Mar 10, 2023 •

edited

liangfu Mar 10, 2023

zhiqiangdon Mar 10, 2023

liangfu Mar 10, 2023

zhiqiangdon Mar 10, 2023

liangfu Mar 10, 2023

github-actions bot commented Mar 10, 2023

zhiqiangdon left a comment

[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt #2983

[MultiModal] Remove MultiModalOnnxPredictor and rename export_tensorrt #2983

Conversation

liangfu commented Feb 28, 2023 • edited

github-actions bot commented Mar 6, 2023

github-actions bot commented Mar 8, 2023

github-actions bot commented Mar 9, 2023

zhiqiangdon Mar 10, 2023

Choose a reason for hiding this comment

liangfu Mar 10, 2023 • edited

Choose a reason for hiding this comment

liangfu Mar 10, 2023

Choose a reason for hiding this comment

zhiqiangdon Mar 10, 2023

Choose a reason for hiding this comment

liangfu Mar 10, 2023

Choose a reason for hiding this comment

zhiqiangdon Mar 10, 2023

Choose a reason for hiding this comment

liangfu Mar 10, 2023

Choose a reason for hiding this comment

github-actions bot commented Mar 10, 2023

zhiqiangdon left a comment

Choose a reason for hiding this comment

liangfu commented Feb 28, 2023 •

edited

liangfu Mar 10, 2023 •

edited