[Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing #370

wang-xinyu · 2022-10-14T11:21:41Z

PR types

Performance optimization

PR changes

Others - preprocessing

Describe

Add a YOLO CUDA preprocessing util
Yolov5/v5lite/v6/v7/v7end2end: integrate CUDA preprocessing, test and compare latency
cmake changes to support CUDA source files compile

CLAassistant · 2022-10-14T11:21:47Z

All committers have signed the CLA.

… into cuda_preproc

wang-xinyu · 2022-10-18T12:39:06Z

Latency includes preprocessing, inference and postprocessing, in milliseconds.
Tested on P40, TensorRT8.4.

Model	Latency(CPU preprocessing)	Latency(CUDA preprocessing)	Optimization
yolov5s	41	28	31.7% $\downarrow$
yolov5lite	40	22	45% $\downarrow$
yolov6s	25	11	56% $\downarrow$
yolov7	47	32	31.9% $\downarrow$
yolov7_e2e	27	16	40.7% $\downarrow$

CMakeLists.txt

fastdeploy/vision/detection/contrib/yolov5.cc

wang-xinyu · 2022-10-19T03:31:55Z

This CUDA preprocessing for YOLO is using warp affine method to do resizing, which is slightly different from cv::resize().
Hence the mAP is slightly different.
Below mAP(IoU=0.50:0.95 | area=all) results were tested on coco_val_2017, 5000 images, with TensorRT model.

Model	mAP(CPU preprocessing)	mAP(CUDA preprocessing)
yolov5s	0.372	0.368
yolov6s	0.424	0.418
yolov7	0.514	0.498

jiangjiajun

Windows的兼容下个PR再补充上

wang-xinyu added 3 commits October 14, 2022 19:00

add yolo cuda preprocessing

4ea3407

cmake build cuda src

757c151

yolov5 support cuda preprocessing

4365073

jiangjiajun and others added 11 commits October 14, 2022 19:30

Merge branch 'develop' into cuda_preproc

ef1475e

Merge branch 'PaddlePaddle:develop' into cuda_preproc

07eb593

yolov5 cuda preprocessing configurable

3f7e6ba

Merge branch 'PaddlePaddle:develop' into cuda_preproc

73f5d89

Merge branch 'cuda_preproc' of https://github.com/wang-xinyu/FastDeploy…

010e66d

… into cuda_preproc

yolov5 update get mat data api

e061b35

yolov5 check cuda preprocess args

389da0a

refactor cuda function name

d580801

yolo cuda preprocess padding value configurable

e3d7e1e

yolov5 release cuda memory

fdd6621

cuda preprocess pybind api update

fd3725a

wang-xinyu changed the title ~~Yolo cuda preprocessing util and yolov5 cuda preprocessing~~ [Model]Yolo cuda preprocessing Oct 18, 2022

wang-xinyu and others added 6 commits October 18, 2022 16:02

Merge branch 'develop' into cuda_preproc

1e1d2f3

move use_cuda_preprocessing option to yolov5 model

d43e604

yolov5lite cuda preprocessing

4f4c456

yolov6 cuda preprocessing

4c8f8d2

yolov7 cuda preprocessing

5e10efa

yolov7_e2e cuda preprocessing

1905986

wang-xinyu changed the title ~~[Model]Yolo cuda preprocessing~~ [Model]Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing Oct 18, 2022

remove cuda preprocessing in runtime option

8ce4ea6

jiangjiajun requested changes Oct 19, 2022

View reviewed changes

CMakeLists.txt Outdated Show resolved Hide resolved

fastdeploy/vision/detection/contrib/yolov5.cc Outdated Show resolved Hide resolved

wang-xinyu and others added 3 commits October 19, 2022 11:39

Merge branch 'develop' into cuda_preproc

6a7a81e

refine log and cmake variable name

05c289c

fix model runtime ptr type

2ba76dd

jiangjiajun approved these changes Oct 19, 2022

View reviewed changes

jiangjiajun changed the title ~~[Model]Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing~~ [Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing Oct 19, 2022

jiangjiajun merged commit c8d6c82 into PaddlePaddle:develop Oct 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing #370

[Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing #370

wang-xinyu commented Oct 14, 2022 •

edited

CLAassistant commented Oct 14, 2022 •

edited

wang-xinyu commented Oct 18, 2022

wang-xinyu commented Oct 19, 2022 •

edited

jiangjiajun left a comment

[Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing #370

[Model] Yolov5/v5lite/v6/v7/v7end2end: CUDA preprocessing #370

Conversation

wang-xinyu commented Oct 14, 2022 • edited

PR types

PR changes

Describe

CLAassistant commented Oct 14, 2022 • edited

wang-xinyu commented Oct 18, 2022

wang-xinyu commented Oct 19, 2022 • edited

jiangjiajun left a comment

Choose a reason for hiding this comment

wang-xinyu commented Oct 14, 2022 •

edited

CLAassistant commented Oct 14, 2022 •

edited

wang-xinyu commented Oct 19, 2022 •

edited