Accelerating-Inference-in-Tensorflow-using-TensorRT

What is TensorRT?

TensorRT is an optimization tool provided by NVIDIA that applies graph optimization and layer fusion, and finds the fastest implementation of a deep learning model. In other words, TensorRT will optimize our deep learning model so that we expect a faster inference time than the original model (before optimization), such as 5x faster or 2x faster. The bigger model we have, the bigger space for TensorRT to optimize the model. Furthermore, this TensorRT supports all NVIDIA GPU devices, such as 1080Ti, Titan XP for Desktop, and Jetson TX1, TX2 for embedded device.

Library used

Pre-requrement: Install TensorRT by following this tutorial here for Ubuntu dekstop or here for Jetson devices

Tensorflow 1.12
OpenCV 3.4.5.20
Pillow 5.2.0
Numpy 1.15.2
Matplotlib 3.0.0

Visualize the original and optimized graphs

One of the easiest way to do that is using netron here: https://lutzroeder.github.io/netron/

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
model		model
README.md		README.md
TF-TRT FP16.ipynb		TF-TRT FP16.ipynb
TF-TRT FP32.ipynb		TF-TRT FP32.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Accelerating-Inference-in-Tensorflow-using-TensorRT

What is TensorRT?

Library used

Visualize the original and optimized graphs

About

Releases

Packages

Languages

SarthakGarg19/Accelerating-Inference-in-Tensorflow-using-TensorRT

Folders and files

Latest commit

History

Repository files navigation

Accelerating-Inference-in-Tensorflow-using-TensorRT

What is TensorRT?

Library used

Visualize the original and optimized graphs

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages