-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Description
I need to build Tensorrt 8.0 OSS with CUDA 11.3, and the build files were generated without errors. However on running inference using engine file, I see an error /Tensort8.0/TensorRT/plugin/instanceNormalizationPlugin/instanceNormalizationPlugin.cu (286) - Cuda Error in enqueue: 906 (operation would make the legacy stream depend on a capturing blocking stream)
terminate called after throwing an instance of 'nvinfer1::plugin::CudaError'
what(): std::exception
Aborted (core dumped)
Environment
TensorRT Version: 8.0
NVIDIA GPU: QUADRO RTX 5000
NVIDIA Driver Version: 465.19.01
CUDA Version: 11.3
CUDNN Version: 8.2
Operating System: Ubuntu
Python Version (if applicable): 3.8
Tensorflow Version (if applicable):
PyTorch Version (if applicable): 1.9
Baremetal or Container (if so, version): Baremetal
Command:
./trtexec --loadEngine=/Tensort8.0/TensorRT/build/out/model.engine --useCudaGraph --iterations=1 --avgRuns=1 --plugins=/Tensort8.0/TensorRT/build/out/libnvinfer_plugin.so --verbose
Error:
[05/05/2022-13:02:46] [I] Created input binding for input1 with dimensions 1x3x376x1224
[05/05/2022-13:02:46] [I] Created input binding for input2 with dimensions 1x3x376x1224
[05/05/2022-13:02:46] [I] Created output binding for _up with dimensions 1x2x376x1224
[05/05/2022-13:02:46] [I] Created output binding for _low with dimensions 1x2x47x153
[05/05/2022-13:02:46] [I] Starting inference
[05/05/2022-13:02:46] [E] [TRT] /Tensort8.0/TensorRT/plugin/instanceNormalizationPlugin/instanceNormalizationPlugin.cu (286) - Cuda Error in enqueue: 906 (operation would make the legacy stream depend on a capturing blocking stream)
terminate called after throwing an instance of 'nvinfer1::plugin::CudaError'
what(): std::exception
Aborted (core dumped)
How can I resolve this? Is there a cuda version mismatch? However as per the github page The recommended versions are cuda-11.3.1 + cuDNN-8.2. Kindly help debug this?