Supporting 1D deconvolutions in TensorRT #1587

alecgunny · 2021-11-04T00:38:09Z

Description

Deconvolutions (or transposed convolutions depending on who you're asking) aren't supported in more recent versions of TensorRT (which I understand that the operator support matrix explicitly mentions). However, this operation was supported as of TRT 7.2.1, and it became a critical part of a low-latency pipeline my team works on, which is built on top of the 20.11 NGC container release (which runs 7.2.1.6).

Unfortunately, we need to begin moving past Python 3.6, which limits our ability to continue to leverage TensorRT. For reference, in the repo linked to below I've included a repro using the 21.10 container. Are there any plans to extend support to this operator in upcoming releases, and if so is there any rough timeline?

Environment

TensorRT Version: 8.0.3.4
NVIDIA GPU: V100 16GB
NVIDIA Driver Version: 465.19.01
CUDA Version: 11.4
CUDNN Version:
Operating System: Ubuntu
Python Version (if applicable): 3.8
Tensorflow Version (if applicable):
PyTorch Version (if applicable): 1.10
Baremetal or Container (if so, version): Container, 21.10-py3

Relevant Files

https://github.com/alecgunny/trt-env-repro

Steps To Reproduce

Full steps can be found in the README of the attached repo, with logs for both functioning and non-functioning cases included. The main issue seems to be that after the first deconvolution layer, TensorRT throws the error

[network.cpp::setWeightsName::3013] Error Code 1: Internal Error (The given weights is not used in the network!)

which stops the rest of the network build, cutting subsequent layers off.

The text was updated successfully, but these errors were encountered:

pranavm-nvidia · 2021-11-04T14:25:03Z

This should be fixed in TRT 8.2. Could you try out the 8.2 EA release?

alecgunny · 2021-11-05T16:50:44Z

Yes that's great to know, I'm working on building an environment for it now starting from the CUDA 11.4.2.-devel-ubuntu20.04 container. I have things just about working and will update when I am able to run, thanks very much.

Will this be the version released in the 21.11 NGC container? We also use Triton to serve our models at inference time, do you have any insights as to whether these versions will be coordinated in the next release for both containers?

pranavm-nvidia · 2021-11-05T18:27:51Z

@rajeevsrao Do you know which container(s) 8.2 will be part of?

rajeevsrao · 2021-11-05T18:30:08Z

@rajeevsrao Do you know which container(s) 8.2 will be part of?

21.12

alecgunny · 2021-11-09T00:40:38Z

@rajeevsrao got it, thank you

@pranavm-nvidia I tested using 8.2 and the logs indicate that things are working properly. The network is able to build and the inferred shapes match up correctly. I haven't been able to test the outputs to compare for accuracy since I'm having trouble building PyCuda in the container, but its good to know that it seems likely we'll be able to start using TensorRT in our pipeline again come December.

Thanks for your help, closing this issue as resolved.

wilbur-caper · 2021-12-02T03:20:26Z

@rajeevsrao Do you know which container(s) 8.2 will be part of?

21.12

@rajeevsrao hi the 21.12 mean nvcr.io/nvidia/tensorrt:21.12-py3 ? I can't find it

rajeevsrao · 2021-12-02T03:41:49Z

@rajeevsrao Do you know which container(s) 8.2 will be part of?

21.12

@rajeevsrao hi the 21.12 mean nvcr.io/nvidia/tensorrt:21.12-py3 ? I can't find it

@hererookie the monthly containers are usually shipped towards the end of the month. 21.12 will be available sometime near Dec 25th.Since TensorRT 8.2 GA was ready after the 21.11 container was finalized and validated it didn't make it to 21.11.

alecgunny · 2021-12-02T16:39:11Z

I don't know how far outside of your purview this is, but I know there's generally an attempt to align the software versions between concurrent container releases on NGC. Do you have any insight into whether this will be the version of TensorRT that gets released with the 21.12 Triton container?

wilbur-caper · 2021-12-03T02:17:14Z

This should be fixed in TRT 8.2. Could you try out the 8.2 EA release?

@pranavm-nvidia hi ,I meet the same error #1654 ,TensorRT 8.0 still not support 2D deconvolutions ? but on the tensorrt8 operator support matrix explicitly mentions https://github.com/onnx/onnx-tensorrt/blob/8.0-EA/docs/operators.md that support convtranspose 2D and 3D;

alecgunny closed this as completed Nov 9, 2021

zerollzeng mentioned this issue Dec 2, 2021

convert onnx to trt failed(invalid node convtranspose) #1654

Closed

alecgunny mentioned this issue Feb 11, 2022

Newer TensorRT and Triton containers ML4GW/DeepClean#10

Closed

alecgunny mentioned this issue Feb 26, 2022

Use updated Triton stateful backend behavior for snapshotter fastmachinelearning/gw-iaas#35

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supporting 1D deconvolutions in TensorRT #1587

Supporting 1D deconvolutions in TensorRT #1587

alecgunny commented Nov 4, 2021 •

edited

Loading

pranavm-nvidia commented Nov 4, 2021

alecgunny commented Nov 5, 2021

pranavm-nvidia commented Nov 5, 2021

rajeevsrao commented Nov 5, 2021

alecgunny commented Nov 9, 2021

wilbur-caper commented Dec 2, 2021

rajeevsrao commented Dec 2, 2021

alecgunny commented Dec 2, 2021

wilbur-caper commented Dec 3, 2021

Supporting 1D deconvolutions in TensorRT #1587

Supporting 1D deconvolutions in TensorRT #1587

Comments

alecgunny commented Nov 4, 2021 • edited Loading

Description

Environment

Relevant Files

Steps To Reproduce

pranavm-nvidia commented Nov 4, 2021

alecgunny commented Nov 5, 2021

pranavm-nvidia commented Nov 5, 2021

rajeevsrao commented Nov 5, 2021

alecgunny commented Nov 9, 2021

wilbur-caper commented Dec 2, 2021

rajeevsrao commented Dec 2, 2021

alecgunny commented Dec 2, 2021

wilbur-caper commented Dec 3, 2021

alecgunny commented Nov 4, 2021 •

edited

Loading