TRTForYolov3

Desc

tensorRT for Yolov3

Test Enviroments

Ubuntu  16.04
TensorRT 5.0.2.6/4.0.1.6
CUDA 9.2

Models

Download the caffe model converted by official model:

Baidu Cloud here pwd: gbue
Google Drive here

If run model trained by yourself, comment the "upsample_param" blocks, and modify the prototxt the last layer as:

layer {
    #the bottoms are the yolo input layers
    bottom: "layer82-conv"
    bottom: "layer94-conv"
    bottom: "layer106-conv"
    top: "yolo-det"
    name: "yolo-det"
    type: "Yolo"
}

It also needs to change the yolo configs in "YoloConfigs.h" if different kernels.

Run Sample

#build source code
git submodule update --init --recursive
mkdir build
cd build && cmake .. && make && make install
cd ..

#for yolov3-608
./install/runYolov3 --caffemodel=./yolov3_608.caffemodel --prototxt=./yolov3_608.prototxt --input=./test.jpg --W=608 --H=608 --class=80

#for fp16
./install/runYolov3 --caffemodel=./yolov3_608.caffemodel --prototxt=./yolov3_608.prototxt --input=./test.jpg --W=608 --H=608 --class=80 --mode=fp16

#for int8 with calibration datasets
./install/runYolov3 --caffemodel=./yolov3_608.caffemodel --prototxt=./yolov3_608.prototxt --input=./test.jpg --W=608 --H=608 --class=80 --mode=int8 --calib=./calib_sample.txt

#for yolov3-416 (need to modify include/YoloConfigs for YoloKernel)
./install/runYolov3 --caffemodel=./yolov3_416.caffemodel --prototxt=./yolov3_416.prototxt --input=./test.jpg --W=416 --H=416 --class=80

Performance

Model	GPU	Mode	Inference Time
Yolov3-416	GTX 1060	Caffe	54.593ms
Yolov3-416	GTX 1060	float32	23.817ms
Yolov3-416	GTX 1060	int8	11.921ms
Yolov3-608	GTX 1060	Caffe	88.489ms
Yolov3-608	GTX 1060	float32	43.965ms
Yolov3-608	GTX 1060	int8	21.638ms
Yolov3-608	GTX 1080 Ti	float32	19.353ms
Yolov3-608	GTX 1080 Ti	int8	9.727ms
Yolov3-416	GTX 1080 Ti	float32	9.677ms
Yolov3-416	GTX 1080 Ti	int8	6.129ms

Eval Result

run above models with appending --evallist=labels.txt

int8 calibration data made from 200 pics selected in val2014 (see scripts dir)

Model	GPU	Mode	dataset	MAP(0.50)	MAP(0.75)
Yolov3-416	GTX 1060	Caffe	COCO val2014	81.76	52.05
Yolov3-416	GTX 1060	float32	COCO val2014	81.93	52.19
Yolov3-416	GTX 1060	int8	COCO val2014	86.41	57.11
Yolov3-416	GTX 1060	Caffe	COCO val2014	80.41	52.33
Yolov3-608	GTX 1060	float32	COCO val2014	80.6	52.43
Yolov3-608	GTX 1060	int8	COCO val2014	85.35	56.88

Notice:

caffe implementation is little different in yolo layer and nms, and it should be the similar result compared to tensorRT fp32.
Int8 mode gets better result in the val dataset, but not certainly in other test data. And exactly it is more often a little worse.

Details About Wrapper

see link TensorRTWrapper

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
include		include
scripts		scripts
tensorRTWrapper @ 757c379		tensorRTWrapper @ 757c379
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
calib_sample.txt		calib_sample.txt
dataReader.cpp		dataReader.cpp
eval.cpp		eval.cpp
labels_sample.txt		labels_sample.txt
main.cpp		main.cpp
readme.md		readme.md

License

FengWu1206/TensorRT-Yolov3

Folders and files

Latest commit

History

Repository files navigation

TRTForYolov3

Desc

Test Enviroments

Models

Run Sample

Performance

Eval Result

Details About Wrapper

About

Resources

License

Stars

Watchers

Forks

Languages