Limit transcription CPU threads #6467

JohnXLivingston · 2024-07-05T15:50:36Z

Describe the problem to be solved

v6.2.0-RC1 comes with an incredible feature: automatic subtitles generation.

But the models that are used can use a lot of CPU and RAM.
I did not see any option to limit their usage (as we can with video transcoding).

Describe the solution you would like

Would it be possible to add some options?

Chocobozzz · 2024-07-10T09:02:12Z

I don't think we can limit RAM but CPU yes there is a threads option

  --threads THREADS     number of threads used for CPU inference (default: 0)

lutangar · 2024-07-18T11:00:16Z

RAM usage largely depends on model size since it must be loaded to RAM (multiplied by the number of runners since models aren't shared in memory).

For example with whisper-ctranslate2 (which uses faster-whisper which is CPU friendly) the models tend to be larger than the one provided for openai-whisper :

large-v3 ~3GB
medium ~1.5GB
tiny ~75MB

Of course, transcript quality will get worse the further you decrease the size.

https://huggingface.co/Systran

JohnXLivingston · 2024-07-18T11:26:08Z

Thanks for the information @lutangar ! Good to know.

Chocobozzz added Type: Feature Request ✨ Component: Subtitles 💬 Component: Runners labels Jul 10, 2024

Chocobozzz changed the title ~~New transcription feature: is it possible to limit CPU and RAM usage?~~ Limit transcription CPU threads Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit transcription CPU threads #6467

Limit transcription CPU threads #6467

JohnXLivingston commented Jul 5, 2024

Chocobozzz commented Jul 10, 2024

lutangar commented Jul 18, 2024

JohnXLivingston commented Jul 18, 2024

Limit transcription CPU threads #6467

Limit transcription CPU threads #6467

Comments

JohnXLivingston commented Jul 5, 2024

Describe the problem to be solved

Describe the solution you would like

Chocobozzz commented Jul 10, 2024

lutangar commented Jul 18, 2024

JohnXLivingston commented Jul 18, 2024