[backends] Add functionality to TRT backend #1753

gs-olive · 2023-07-07T16:40:57Z

Add argument parsing for backend arguments to be passed to Torch-TRT
Add capability to specify IR and other Torch-TRT fields via command line interface
Add functionality to compilation path and clean up code

xuzhao9

LGTM. Just curious will you be interested in setting up a CI to maintain this?

gs-olive · 2023-07-12T18:44:54Z

Thank you for the review. I think we would be interested in setting up a CI to maintain it. @narendasan - what are your thoughts on this?

gs-olive · 2023-07-12T19:13:42Z

@xuzhao9 - what would you need from us to set up a CI to maintain the TRT backend?

- Add argument parsing for backend arguments to pass to TRT - Add capability to specify IR via command line CLI - Add functionality to compilation path and clean up code

xuzhao9 · 2023-07-12T22:26:36Z

@gs-olive Here is the guide on how to develop a CI: https://github.com/pytorch/benchmark/blob/main/userbenchmark/ADDING_USERBENCHMARKS.md

- Add functionality to perform benchmarking using the Torch-TRT backend, along with output metrics

gs-olive · 2023-07-14T20:37:26Z

Hi @xuzhao9 - thanks for the reference. I've added functionality to this PR in a new commit which adds the userbenchmark folder and adapts functionality from the main run.py. I also added a ci.yaml file. I wasn't sure how the benchmarks are invoked in CI, however. My intent was to have this sort of usage:

# Invokes a singular model
python run_benchmark.py torch_trt --model resnet18 --precision fp32 --bs 4 --ir ts

# Invokes all models in the directory
python run_benchmark.py torch_trt --precision fp32 --bs 4

xuzhao9 · 2023-07-21T14:14:51Z

torchbenchmark/util/backends/trt.py

+
+        trt_input = [
+            torch_tensorrt.Input(shape=input_.shape, dtype=input_.dtype)
+            for input_ in example_inputs


Is it necessary to add check that the type of example_inputs is List[tensor]? Actually, many models have different types of inputs.

One suggestion is to use pytree to traverse the input and cast them to torch_tensorrt.Input. Similar to this:

benchmark/torchbenchmark/util/env_check.py

Line 186 in 9d84f9e

from torch.utils._pytree import tree_map

Since the different ir choices for Torch-TRT process inputs differently, but all can handle Torch Tensor inputs, I passed the example inputs directly to the compiler instead, so the selected ir can handle the inputs/casting as necessary.

userbenchmark/torch_trt/__init__.py

xuzhao9 · 2023-07-21T14:20:38Z

userbenchmark/torch_trt/run.py

+        all_metrics = {}
+
+        for Model in list_models():
+            metrics = run_single_model(


This code will work for running a single model. However, it won't work to run a batch of models.

This is because there is no isolation between running the models. For example, model 1 might set some global torch configuration that will model 2 to be very slow or even crash (for example, torch.cudnn.benchmark). Some models have benign "memory leak" that won't cause problem in model training, but it will cause problem in benchmarking multiple models in the same process.

We suggest using the ModelTask() approach used by the torch-nightly userbenchmark: https://github.com/pytorch/benchmark/blob/main/userbenchmark/torch-nightly/run.py#L163
It will run each model in an isolated process, and doesn't have the limits mentioned above.

xuzhao9 · 2023-07-21T14:55:34Z

I suggest we can first remove the ci.yaml file, land this PR, then submit a follow-up PR to fix the userbenchmark as I commented and add the CI.

gs-olive · 2023-07-21T17:19:00Z

@xuzhao9 - Thank you for the detailed comments and review. This sounds like a good idea to me; I have removed the ci.yaml file as requested, and I will follow up with a subsequent PR addressing the remaining comments.

xuzhao9

LGTM, please address my inline comment about checking the input type of example_inputs, thanks!

gs-olive · 2023-07-21T20:19:58Z

I have addressed the comment regarding example inputs here: #1753 (comment), with this commit: 8ccf8e6

xuzhao9

LGTM, thanks!

facebook-github-bot · 2023-07-21T20:27:56Z

@xuzhao9 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2023-07-22T00:42:44Z

@xuzhao9 merged this pull request in 30a3c4a.

* Add support for Alpha architecture As documented, the real cycle counter is unsafe to use here, because it is a 32-bit integer which wraps every ~4s. Use gettimeofday instead, which has a limitation of a low-precision real-time-clock (~1ms), but no wrapping. Passes test suite. Support parsing /proc/cpuinfo on Alpha tabular_test: add a missing DoNotOptimize call

facebook-github-bot added the cla signed label Jul 7, 2023

gs-olive mentioned this pull request Jul 7, 2023

TorchBench Integration Part 1 pytorch/TensorRT#2076

Closed

xuzhao9 self-requested a review July 7, 2023 20:06

gs-olive force-pushed the trt_backend_improvements branch 3 times, most recently from 23724b7 to 0465851 Compare July 12, 2023 16:59

xuzhao9 approved these changes Jul 12, 2023

View reviewed changes

[backends] Add functionality to TRT backend

695842c

- Add argument parsing for backend arguments to pass to TRT - Add capability to specify IR via command line CLI - Add functionality to compilation path and clean up code

gs-olive force-pushed the trt_backend_improvements branch from 0465851 to 695842c Compare July 12, 2023 22:24

gs-olive mentioned this pull request Jul 12, 2023

TorchBench Integration Part 3 pytorch/TensorRT#2093

Closed

Add userbenchmark for Torch-TRT

20f2851

- Add functionality to perform benchmarking using the Torch-TRT backend, along with output metrics

gs-olive requested a review from xuzhao9 July 14, 2023 20:37

xuzhao9 reviewed Jul 21, 2023

View reviewed changes

Remove ci.yaml and minor fix to init

493230e

xuzhao9 approved these changes Jul 21, 2023

View reviewed changes

fix: Pass example inputs directly to Torch-TRT

8ccf8e6

xuzhao9 approved these changes Jul 21, 2023

View reviewed changes

xuzhao9 requested a review from frank-wei July 21, 2023 20:27

facebook-github-bot closed this in 30a3c4a Jul 22, 2023

facebook-github-bot added the Merged label Jul 22, 2023

[backends] Add functionality to TRT backend #1753

[backends] Add functionality to TRT backend #1753

Uh oh!

Conversation

gs-olive commented Jul 7, 2023

Uh oh!

xuzhao9 left a comment

Choose a reason for hiding this comment

Uh oh!

gs-olive commented Jul 12, 2023

Uh oh!

gs-olive commented Jul 12, 2023

Uh oh!

xuzhao9 commented Jul 12, 2023

Uh oh!

gs-olive commented Jul 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xuzhao9 Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

xuzhao9 Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

gs-olive Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xuzhao9 Jul 21, 2023

Choose a reason for hiding this comment

Uh oh!

xuzhao9 commented Jul 21, 2023

Uh oh!

gs-olive commented Jul 21, 2023

Uh oh!

xuzhao9 left a comment

Choose a reason for hiding this comment

Uh oh!

gs-olive commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xuzhao9 left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 21, 2023

Uh oh!

facebook-github-bot commented Jul 22, 2023

Uh oh!

Uh oh!

gs-olive commented Jul 14, 2023 •

edited

Loading

gs-olive commented Jul 21, 2023 •

edited

Loading