GPU not really utilized well #107

ichitaka · 2021-08-10T19:55:56Z

So I've run a few tests, as I've noticed that PyFeat is quite slow in comparison to something like OpenFace2. Turns out, while the CPU is always utilized, the GPU is not. This seems to be, because the data is loaded 1-by-1 through OpenCV instead of using a proper GPU library for lists of images. I think it makes sense, if a list of images is given to a Detector for detect_image, to load and predict the images batch wise through torch.DataLoaders.

TiankangXie · 2021-08-26T03:09:48Z

Hello @ichitaka ! Sorry for the late reply, but that's definitely an excellent suggestion! Basically, we used frame-by-frame reading because at that time we don't know if you could load videos with the Pytorch data loader. But in the next (soon) release we will address this problem! Thank you so much for the insights!

dexterdev · 2022-01-11T13:34:09Z

@TiankangXie : Any update from your side on this? I use resmasknet model to get emotion scores data from videos. And Py-Feat is extremely slow. Any temporary solutions at least?

TiankangXie · 2022-01-11T18:06:43Z

@dexterdev I am sorry for the late reply. Have you tried to increase the parameter - batch_size, and increase the skip_frames? Currently, these are the two parameters that could affect the processing speed. Increasing the batch_size and skip_frames should reduce the processing time proportionally.

We are reading the frames of the video by frames with the OpenCV cap.read function. This is not optimal and we are working on a major update soon!
Again thank you for the feedbacks!

dexterdev · 2022-01-12T09:53:41Z

No I havent tried those options. And also I was always trying to load all frames. Thank you. Looking forward for the updated Py-Feat :)

ljchang · 2022-08-05T15:41:32Z

Thanks @ichitaka for the excellent suggestion. We have completely refactored our toolbox to use the pytorch data loaders. We can report that there is now dramatic speedups on CPU, but also the ability to use GPUs as well. We are finishing up testing and will be updating our release very shortly (#133).

ichitaka · 2022-08-05T15:44:53Z

Great update!

dexterdev · 2022-08-05T16:04:19Z

Great. I think this is at right time for our requirements. Thanks for the great package and recent upgrade <3

ljchang · 2022-12-14T22:45:14Z

Hopefully this is now addressed in version 0.5.0 with #144

ejolly added the enhancement New feature or request label Mar 12, 2022

ExtReMLapin mentioned this issue Aug 4, 2022

add support for m1 GPU and refactor code to speed up detection with GPUs #133

Closed

ejolly mentioned this issue Dec 14, 2022

Refactor, model fixes, experimental m1 support #144

Merged

13 tasks

ljchang closed this as completed Dec 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU not really utilized well #107

GPU not really utilized well #107

ichitaka commented Aug 10, 2021

TiankangXie commented Aug 26, 2021

dexterdev commented Jan 11, 2022

TiankangXie commented Jan 11, 2022

dexterdev commented Jan 12, 2022

ljchang commented Aug 5, 2022

ichitaka commented Aug 5, 2022

dexterdev commented Aug 5, 2022

ljchang commented Dec 14, 2022

GPU not really utilized well #107

GPU not really utilized well #107

Comments

ichitaka commented Aug 10, 2021

TiankangXie commented Aug 26, 2021

dexterdev commented Jan 11, 2022

TiankangXie commented Jan 11, 2022

dexterdev commented Jan 12, 2022

ljchang commented Aug 5, 2022

ichitaka commented Aug 5, 2022

dexterdev commented Aug 5, 2022

ljchang commented Dec 14, 2022