Speedup the Video Inference by Accelerating data-loading Stage #7832

chenxinfeng4 · 2022-04-26T07:45:24Z

Motivation

The video inference was not efficient, because there are many "resize", "padding", "to rgb" and "normalize" operations in CPU workload. Besides, the 'img_metas' is calculated for every frame, which is nonefficient. The CPU workload is really high, but the speed is slow.

Modification

I transformed the video "crop", "resize", "padding", "to rgb" in ffmpeg-based video reader, which is lite and CPU friendly. The video reader can support NVIDIA-VIDEO-DECODING if possible, to save more CPU workload. And the "normalize" is done in GPU. In all, I reduced the data-loading time and CPU workload.

Result

After this modification, the CPU workload can reduced from >1000% to 180%, and slightly improve the inference frame rate. It's significant in the case that dataloading is the heaviest cost than the network evalutation, such as the yolact.

CLAassistant · 2022-04-26T07:45:29Z

All committers have signed the CLA.

chenxinfeng4 · 2022-04-26T07:51:29Z

You can compare the original video_demo.py with my video_gpuaccel_demo.py. They have some argment inputs.
To see how much improved, please delete the post-operations in both codes. Because the post-operations are also CPU heavy.

# post-operations in `video_demo.py` and my `video_gpuaccel_demo.py`.
model.show_result(xxx)

```

demo/video_gpuaccel_demo.py

hhaAndroid · 2022-04-26T08:24:09Z

@chenxinfeng4 Thank you very much for your contribution.

chenxinfeng4 · 2022-04-26T10:21:36Z

I tested the speed of data loading. It's about 230FPS , 800%CPU.

    with torch.cuda.device(args.device), torch.no_grad():
        for frame_resize, frame_origin in zip(tqdm.tqdm(video_resize), video_origin):
            data = process_img(frame_resize, img_metas)
            continue

demo/video_gpuaccel_demo.py

ZwwWayne · 2022-04-27T08:21:26Z

lint failed.

RangiLyu

You can add some documents about this script in https://github.com/open-mmlab/mmdetection/blob/master/docs/en/1_exist_data_model.md

RangiLyu · 2022-04-27T08:29:24Z

Please use pre-commit hook to fix the lint.

demo/video_gpuaccel_demo.py

hhaAndroid · 2022-04-27T11:31:31Z

@chenxinfeng4 Thanks for your very fast response.

jbwang1997

LGTM

chenxinfeng4 · 2022-04-29T11:50:11Z

I'm not an expert of git. Why the PR is block? What should I do next?

jbwang1997 · 2022-05-01T01:18:10Z

I'm not an expert of git. Why the PR is block? What should I do next?

Hello @chenxinfeng4.

Merging is controlled by maintainers. We will merge this pr as soon as possible. Seems the lint fails again. Please remember to fix it.

Thanks for your quick response.

* add a faster inference for video * Fix typos * modify typo * modify the numpy array to torch gpu * fix lint * add description * add documents * fix typro * fix lint * fix lint * fix lint again * fix a mistake

…mmlab#7832) * add a faster inference for video * Fix typos * modify typo * modify the numpy array to torch gpu * fix lint * add description * add documents * fix typro * fix lint * fix lint * fix lint again * fix a mistake

chenxinfeng added 2 commits April 26, 2022 15:19

add a faster inference for video

c00197f

Merge branch 'master' of https://github.com/chenxinfeng4/mmdetection

5d2eec7

mm-assistant bot assigned hhaAndroid Apr 26, 2022

hhaAndroid reviewed Apr 26, 2022

View reviewed changes

demo/video_gpuaccel_demo.py Show resolved Hide resolved

hhaAndroid reviewed Apr 26, 2022

View reviewed changes

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

Fix typos

6f437da

hhaAndroid reviewed Apr 27, 2022

View reviewed changes

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

modify typo

b32c235

hhaAndroid mentioned this pull request Apr 27, 2022

run video_demo.py will cause the memory to be full #6426

Closed

modify the numpy array to torch gpu

c3215a6

ZwwWayne changed the base branch from master to dev April 27, 2022 08:21

RangiLyu reviewed Apr 27, 2022

View reviewed changes

jbwang1997 reviewed Apr 27, 2022

View reviewed changes

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

jbwang1997 reviewed Apr 27, 2022

View reviewed changes

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

demo/video_gpuaccel_demo.py Outdated Show resolved Hide resolved

chenxinfeng added 4 commits April 27, 2022 17:18

fix lint

00838a9

add description

94164bd

add documents

0a4a885

fix typro

47d6de1

hhaAndroid approved these changes Apr 27, 2022

View reviewed changes

jbwang1997 approved these changes Apr 27, 2022

View reviewed changes

fix lint

42082c0

ZwwWayne approved these changes Apr 28, 2022

View reviewed changes

fix lint

e327009

chenxinfeng added 2 commits May 1, 2022 21:33

fix lint again

8d13295

fix a mistake

a99a1aa

RangiLyu approved these changes May 7, 2022

View reviewed changes

ZwwWayne approved these changes May 8, 2022

View reviewed changes

ZwwWayne merged commit b1f40ef into open-mmlab:dev May 8, 2022

chenxinfeng4 mentioned this pull request Apr 15, 2023

To share my code "use tensorrt async can speed up mask-rcnn inference in video processing" #10158

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup the Video Inference by Accelerating data-loading Stage #7832

Speedup the Video Inference by Accelerating data-loading Stage #7832

chenxinfeng4 commented Apr 26, 2022

CLAassistant commented Apr 26, 2022 •

edited

chenxinfeng4 commented Apr 26, 2022

hhaAndroid commented Apr 26, 2022

chenxinfeng4 commented Apr 26, 2022

ZwwWayne commented Apr 27, 2022

RangiLyu left a comment

RangiLyu commented Apr 27, 2022

hhaAndroid commented Apr 27, 2022

jbwang1997 left a comment

chenxinfeng4 commented Apr 29, 2022

jbwang1997 commented May 1, 2022

Speedup the Video Inference by Accelerating data-loading Stage #7832

Speedup the Video Inference by Accelerating data-loading Stage #7832

Conversation

chenxinfeng4 commented Apr 26, 2022

Motivation

Modification

Result

CLAassistant commented Apr 26, 2022 • edited

chenxinfeng4 commented Apr 26, 2022

hhaAndroid commented Apr 26, 2022

chenxinfeng4 commented Apr 26, 2022

ZwwWayne commented Apr 27, 2022

RangiLyu left a comment

Choose a reason for hiding this comment

RangiLyu commented Apr 27, 2022

hhaAndroid commented Apr 27, 2022

jbwang1997 left a comment

Choose a reason for hiding this comment

chenxinfeng4 commented Apr 29, 2022

jbwang1997 commented May 1, 2022

CLAassistant commented Apr 26, 2022 •

edited