Onnxruntime and TensorRT inference time

## Description
**TensorRT runs slower than cuda on c# environment.**

model: resnet34 pretrained from pytorch.
input: 126,3,96,64 for 72 loops

1st conv1 and last fc layer modified.

Python 
TensorRT on ubuntu runs faster than onnxruntime gpu.

C#
Onnxruntime built from source with --use_tensorrt
tensorrt runs slower than cuda. It is very slow that the difference is 9 times slower.





## Environment

**TensorRT Version**:  7.2.2.3, 7.2.3.4
**NVIDIA GPU**: RTX 2080 Ti
**NVIDIA Driver Version**: 461.92
**CUDA Version**:  11.1, 11.2
**CUDNN Version**: 8.0. , 8.1., 8.2,
**Operating System**: Windows 10 x64
**Python Version (if applicable)**: 3.6
**Tensorflow Version (if applicable)**: 
**PyTorch Version (if applicable)**: 1.7.0
**Baremetal or Container (if so, version)**: 

The combination was 
TensorRT 7.2.2.3, cudnn 8.0. cuda 11.1
TensorRT 7.2.2.3, cudnn 8.0. cuda 11.2
TensorRT 7.2.3.4, cudnn 8.1. cuda 11.2
TensorRT 7.2.3.4, cudnn 8.2. cuda 11.2

all of which had a slow inference time running with TensorRT

## Relevant Files
![tensorrt2](https://user-images.githubusercontent.com/50005149/120437932-aa655380-c3bb-11eb-8508-c59a8c667219.png)
This is Nsight System on TensorRT inference. I am not sure what causes such delays before the full operation at the end.

![cuda](https://user-images.githubusercontent.com/50005149/120433332-0f1daf80-c3b6-11eb-9527-bb9acf86804e.png)
This is Nsight System on Cuda inference. Unlike TensorRT, there is very little delay before the full operation at the end




## Steps To Reproduce

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Onnxruntime and TensorRT inference time #1284

Description

Environment

Relevant Files

Steps To Reproduce

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Onnxruntime and TensorRT inference time #1284

Description

Description

Environment

Relevant Files

Steps To Reproduce

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions