Support for Distil-Whisper #533

sanchit-gandhi · 2023-11-01T17:06:38Z

Hey @guillaumekln! Thanks for this fantastic resource. We're looking at supporting the Distil-Whisper checkpoints in faster-whisper.

The checkpoints are fairly easy to convert: we just pin the number of decoder layers to 2 always, and load the 32 encoder layers.

For inference, we found a chunk length of 15-20s to be optimal for WER performance of the distilled model, see Table 23 of the paper:

Would you be open to a PR allowing the user to specify the chunk length and also the maximum generation length? This would enable full support of Distil-Whisper in faster-whisper!

Axbon · 2023-11-02T08:19:12Z

This would be amazing, one step closer to something that feels realtime*ish

ozancaglayan · 2023-11-02T10:06:32Z

Is the distilled large-v2 model still multilingual or does it lose that attribute due to how distilling was done?

AmgadHasan · 2023-11-03T17:42:19Z

Is the distilled large-v2 model still multilingual or does it lose that attribute due to how distilling was done?

It was trained on English audio only so it most probably lost its multilingual capabilities.

guilhermehge · 2023-11-03T18:17:50Z

Is the distilled large-v2 model still multilingual or does it lose that attribute due to how distilling was done?

From their repo: "Note: Distil-Whisper is currently only available for English speech recognition. Multilingual support will be provided soon."

hoonlight · 2023-11-05T00:54:36Z

Waiting for a multilingual model for this task. Looking forward to it

martinkallstrom · 2023-11-05T19:21:06Z

Subscribing for updates!

silvacarl2 · 2023-11-06T02:32:25Z

can't wait to test it, this will be awesome.

WikiLucas00 · 2023-11-06T18:56:05Z

hi @sanchit-gandhi. FYI, @guillaumekln's account seems inactive since September, which correlates with the moment he moved from his former company. I don't know if he plans to continue maintaining this repo or if other users such as OpenNMT devs (cc @vince62s @homink @nguyendc-systran) have ownership on the repo or forked it. Maybe consider forking it yourself under HF's Github namespace?

guillaumekln · 2023-11-06T21:07:04Z

Hi, I confirm that I'm no longer actively maintaining this repo but other people can still make it move forward. Please ping @nguyendc-systran to merge changes in faster-whisper. For anything related to CTranslate2, please ping @vince62s.

BBC-Esq · 2023-11-09T19:10:49Z

Hi, I confirm that I'm no longer actively maintaining this repo but other people can still make it move forward. Please ping @nguyendc-systran to merge changes in faster-whisper. For anything related to CTranslate2, please ping @vince62s.

Great job on Ctranslate2 and faster-whisper, glad I came across it awhile ago now...and good luck in the future.

WikiLucas00 · 2023-11-10T12:06:00Z

FYI, distil-whisper should now be supported by CTranslate2: https://github.com/OpenNMT/CTranslate2/releases/tag/v3.21.0

We "just" need to adapt faster-whisper in order to have faster-distil-whisper :)

metame-none · 2023-11-11T08:55:19Z

FYI, I create a PR: #557 to support distil-whisper, hope it helps.

AnkushMalaker mentioned this issue Nov 2, 2023

Compatibility with CTranslate2 / faster-whisper huggingface/distil-whisper#3

Open

mkiol mentioned this issue Nov 2, 2023

distil-whisper mkiol/dsnote#56

Closed

McCloudS mentioned this issue Nov 3, 2023

Distill Whisper #538

Open

remic33 mentioned this issue Nov 6, 2023

Distil-Whisper support? m-bain/whisperX#558

Open

metame-none mentioned this issue Nov 11, 2023

[feat] update distil-whisper #557

Merged

jsboige mentioned this issue Nov 22, 2023

Distill whisper jhj0517/Whisper-WebUI#60

Closed

ekaj2 mentioned this issue Feb 13, 2024

ValueError: Invalid model size 'distil-medium.en', expected one of: tiny.en, tiny, base.en, base, small.en, small, medium.en, medium, large-v1, large-v2, large-v3, large #683

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Distil-Whisper #533

Support for Distil-Whisper #533

sanchit-gandhi commented Nov 1, 2023 •

edited

Axbon commented Nov 2, 2023

ozancaglayan commented Nov 2, 2023

AmgadHasan commented Nov 3, 2023

guilhermehge commented Nov 3, 2023

hoonlight commented Nov 5, 2023

martinkallstrom commented Nov 5, 2023

silvacarl2 commented Nov 6, 2023

WikiLucas00 commented Nov 6, 2023

guillaumekln commented Nov 6, 2023

BBC-Esq commented Nov 9, 2023

WikiLucas00 commented Nov 10, 2023

metame-none commented Nov 11, 2023 •

edited

Support for Distil-Whisper #533

Support for Distil-Whisper #533

Comments

sanchit-gandhi commented Nov 1, 2023 • edited

Axbon commented Nov 2, 2023

ozancaglayan commented Nov 2, 2023

AmgadHasan commented Nov 3, 2023

guilhermehge commented Nov 3, 2023

hoonlight commented Nov 5, 2023

martinkallstrom commented Nov 5, 2023

silvacarl2 commented Nov 6, 2023

WikiLucas00 commented Nov 6, 2023

guillaumekln commented Nov 6, 2023

BBC-Esq commented Nov 9, 2023

WikiLucas00 commented Nov 10, 2023

metame-none commented Nov 11, 2023 • edited

sanchit-gandhi commented Nov 1, 2023 •

edited

metame-none commented Nov 11, 2023 •

edited