EfficientDet-TensorRT

TensorRT speedup for EfficientDet models. The repo is based on https://github.com/zylo117/Yet-Another-EfficientDet-Pytorch.

Usage

Speeding up the EfficientDet models with TensorRT is mainly composed of 5 steps:

Modify original code to support TensorRT speedup (refer to link and link)
Convert Pytorch model to onnx file: python torch2onnx.py
Visualize the onnx file with netron: netron efficientdet-d0.onnx
Convert onnx file to TensorRT engine: bash onnx2trt.sh
Infer with TensorRT engine: python trt.py

Performance

On a single RTX 3090 GPU:

model	Input size	Inference Latency (before)	Inference Latency (after)
D0	128	35 ms	9 ms
D0	512	39 ms	25 ms

Note: The current codebase implement EfficientDet-D0 with input size 512 for TensorRT speedup. Other models or input sizes can be realized by modifying the related code. Feel free to ask questions.

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
benchmark		benchmark
efficientdet		efficientdet
efficientnet		efficientnet
projects		projects
res		res
test		test
tutorial		tutorial
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
backbone.py		backbone.py
coco_eval.py		coco_eval.py
efficientdet_test.py		efficientdet_test.py
efficientdet_test_videos.py		efficientdet_test_videos.py
onnx2trt.sh		onnx2trt.sh
original_readme.md		original_readme.md
readme.md		readme.md
torch2onnx.py		torch2onnx.py
train.py		train.py
trt.py		trt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EfficientDet-TensorRT

Usage

Performance

About

Releases

Packages

Contributors 16

Languages

License

kongyanye/EfficientDet-TensorRT

Folders and files

Latest commit

History

Repository files navigation

EfficientDet-TensorRT

Usage

Performance

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 16

Languages

Packages