feat: introduce type propagation infrastructure #2469

bowang007 · 2023-11-15T01:10:27Z

Description

This PR introduces type propagation infrastructure through pytorch inference.
Inferred output types are used to set the TensorRT engines output types.

Related PR: #1853

Previous solution has several limitations.

Bug fix
New feature

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

This PR introduces type propagation infrastructure through pytorch inference. Inferred output types are used to set the TensorRT engines output types.

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

mfeliz-cruise · 2023-11-15T01:45:48Z

core/partitioning/shape_analysis.cpp

@@ -340,6 +340,19 @@ void getSegmentsOutputByRunning(

  seg_block.register_inshapes(input_shapes, shape_mode);
  seg_block.register_intypes(input_types);
+
+  // get output type for each segmented block so this can be used in conversion process
+  std::vector<at::ScalarType> output_types;


One thing to check: Because we're running this at the end of shape propagation does this work for fully convertible modules?

From memory I think that shape prop is only called from BuildHybridGraph->partition() which I don't think is called if all ops are convertible.

github-actions

Code conforms to C++ style guidelines

github-actions

Code conforms to Python style guidelines

mfeliz-cruise · 2023-12-08T23:19:53Z

core/conversion/conversion.cpp

@@ -265,6 +269,7 @@ void MarkOutputs(ConversionCtx* ctx, at::ArrayRef<const torch::jit::Value*> outp
      }

      if (!setOutput) {
+        out_tensor->setType(util::ScalarTypeToTRTDataType(out_types[out_idx++]));


The out_idx needs to be incremented even if one of the outputs is an input (see where setOutput may be set above)

gs-olive

Overall, looks good, just needs a few small fixes. Needs a test case for verification. Additionally, it seems some of the Torchscript test cases are failing due to this change:

ERROR conda.cli.main_run:execute(41): `conda run python -m pytest --junitxml=/tmp/test_results/ts_api_test_results.xml api/` failed. (See above for error)
Fatal Python error: Segmentation fault

Current thread 0x00007f4711145740 (most recent call first):
  File "/opt/python/cp38-cp38/lib/python3.8/site-packages/torch_tensorrt/ts/_compiler.py", line 266 in convert_method_to_trt_engine
  File "/__w/TensorRT/TensorRT/pytorch/tensorrt/tests/py/ts/api/test_classes.py", line 300 in test_detect_invalid_input_binding

This may be due to the reason in the comment above #2469 (comment)

feat: introduce type propagation infrastructure

3f2ecf1

This PR introduces type propagation infrastructure through pytorch inference. Inferred output types are used to set the TensorRT engines output types.

bowang007 requested a review from peri044 November 15, 2023 01:10

facebook-github-bot added the cla signed label Nov 15, 2023

bowang007 requested a review from narendasan November 15, 2023 01:10

github-actions bot added component: conversion Issues re: Conversion stage component: core Issues re: The core compiler component: partitioning labels Nov 15, 2023

github-actions bot requested a review from apbose November 15, 2023 01:10

bowang007 requested review from zewenli98 and gs-olive November 15, 2023 01:10

bowang007 mentioned this pull request Nov 15, 2023

Fix: fix float point 16 precision type issue #1853

Closed

7 tasks

github-actions bot approved these changes Nov 15, 2023

View reviewed changes

mfeliz-cruise reviewed Nov 15, 2023

View reviewed changes

bowang007 mentioned this pull request Nov 27, 2023

🐛 [Bug] Expected input tensors to have type Half, found type float #2113

Open

enabling type propagation when building a single TRT engine

8177c7c

github-actions bot approved these changes Dec 6, 2023

View reviewed changes

mfeliz-cruise reviewed Dec 8, 2023

View reviewed changes

gs-olive reviewed Dec 15, 2023

View reviewed changes

bowang007 mentioned this pull request Mar 20, 2024

🐛 [Bug] Encountered bug for nn.LSTM with half dtype #2666

Open

narendasan closed this Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: introduce type propagation infrastructure #2469

feat: introduce type propagation infrastructure #2469

bowang007 commented Nov 15, 2023 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

mfeliz-cruise Nov 15, 2023 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

mfeliz-cruise Dec 8, 2023

gs-olive left a comment •

edited

Loading

feat: introduce type propagation infrastructure #2469

feat: introduce type propagation infrastructure #2469

Conversation

bowang007 commented Nov 15, 2023 • edited Loading

Description

Checklist:

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

mfeliz-cruise Nov 15, 2023 • edited Loading

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

mfeliz-cruise Dec 8, 2023

Choose a reason for hiding this comment

gs-olive left a comment • edited Loading

Choose a reason for hiding this comment

bowang007 commented Nov 15, 2023 •

edited

Loading

mfeliz-cruise Nov 15, 2023 •

edited

Loading

gs-olive left a comment •

edited

Loading