Boost Inference speed #426

arnavmehta7 · 2022-12-27T18:43:09Z

❓ Questions

Hi I am currently using a GPU for inferencing on files, however it takes a lot of time for longer files with duration around 20minutes. GPU VRam is not an issue, but I was wondering if I could increase power or faster anyhow the inference on GPU.

Thanks

CarlGao4 · 2022-12-27T23:57:38Z

What is your GPU?

arnavmehta7 · 2022-12-28T05:15:28Z

Its all on cloud - Nvidia A100

CarlGao4 · 2022-12-28T06:22:18Z

You can use --segment argument. Increasing segment can make separation faster while requiring more memory.
Besides, you can compare the speed with completely CPU. For v3 models, it is about 0.8 times of audio length for each single model (or each model inside BagOfModels). For v4 models, it is about 3 times of audio length. (These figures are test results on my laptop) When using GPU, it should be about 20~50 times faster. If it is in this range, then the limit of GPU has been reached.

arnavmehta7 · 2022-12-28T06:24:07Z

I find v4 models faster than v3, like they were 2-3 minutes quick for the larger sample.

arnavmehta7 · 2022-12-28T06:31:55Z

@CarlGao4 So what is the value of default --segment ??

CarlGao4 · 2022-12-28T06:38:57Z

I find v4 models faster than v3, like they were 2-3 minutes quick for the larger sample.

This is because default models of v3 (mdx mdx_q mdx_extra mdx_extra_q) are both bags of models containing 4 single models each. It is 4 times slower than single models. But default model for v4 (htdemucs) is a single model, so it is faster. You can use htdemucs_ft and you will use about 2 times longer than v3.

CarlGao4 · 2022-12-28T06:39:21Z

@CarlGao4 So what is the value of default --segment ??

It depends on the model. e.g. All v3 models is 44

matiaszanolli · 2022-12-30T15:14:55Z

Hey there, which is the average inference speed you're getting with your A100?

I've been running some performance tests over an A100 (between other GPUs) and found the inference speed pretty much sticks at 48 seconds/s on the htdemucs (v4) model (the htdemucs_ft model is about 4 times slower, since it runs a sequence of 4 models instead of a single one). Are you getting close to those speeds?

arnavmehta7 added the question Further information is requested label Dec 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Boost Inference speed #426

Boost Inference speed #426

arnavmehta7 commented Dec 27, 2022

CarlGao4 commented Dec 27, 2022

arnavmehta7 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

arnavmehta7 commented Dec 28, 2022

arnavmehta7 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

matiaszanolli commented Dec 30, 2022

Boost Inference speed #426

Boost Inference speed #426

Comments

arnavmehta7 commented Dec 27, 2022

❓ Questions

CarlGao4 commented Dec 27, 2022

arnavmehta7 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

arnavmehta7 commented Dec 28, 2022

arnavmehta7 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

CarlGao4 commented Dec 28, 2022

matiaszanolli commented Dec 30, 2022