Skip to content

Comments

Piotrm/resnet50 pyt triton perf fix#953

Merged
nv-kkudrynski merged 2 commits intoNVIDIA:masterfrom
piotrm-nvidia:piotrm/resnet50_pyt_triton_perf_fix
Jun 16, 2021
Merged

Piotrm/resnet50 pyt triton perf fix#953
nv-kkudrynski merged 2 commits intoNVIDIA:masterfrom
piotrm-nvidia:piotrm/resnet50_pyt_triton_perf_fix

Conversation

@piotrm-nvidia
Copy link
Contributor

Those two changes improve performance of:

  • ONNX runtime with TensorRT execution provider.
  • TensorRT runtime.

The results in Triton folder for ResNet50 PyTorch are still valid. The change makes it easier to achieve the same results with Quick Start Guide.

Scripts were modified to fix missing ORT_TENSORRT_FP16_ENABLE flag for
Triton Inference Server with ONNXRuntime and TensorRT execution provider.
ONNX to TensorRT converter was fixed to force FP16 precision for
TensorRT networks.
@nv-kkudrynski nv-kkudrynski merged commit 5c33a82 into NVIDIA:master Jun 16, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants