Support `text-to-speech` in `pipeline` function and in Optimum #22487

josephrocca · 2023-03-31T08:46:37Z

Feature request

SpeechT5 was recently added to Transformers:

Blog post: https://huggingface.co/blog/speecht5
Spaces demo: https://huggingface.co/spaces/Matthijs/speecht5-tts-demo
Models: https://huggingface.co/mechanicalsea/speecht5-tts

It would be great if text-to-speech could be supported across the Transformers stack.

Motivation

@xenova bumped into this as an issue when trying to get SpeechT5 working in the browser (Transformers.js).

Your contribution

Probably unable to help with this at the moment.

The text was updated successfully, but these errors were encountered:

sgugger · 2023-03-31T13:31:22Z

cc @sanchit-gandhi

sanchit-gandhi · 2023-04-04T17:03:17Z

Indeed, a TTS pipeline would be super helpful to run SpeechT5. We're currently planning on waiting till we have 1-2 more TTS models in the library before pushing ahead with a TTS pipeline, in order to verify that the pipeline is generalisable and gives a benefit over loading a single model + processor.

cc @hollance

josephrocca · 2023-04-04T17:26:09Z

Any viable contenders for the other 1-2 models? https://paperswithcode.com/task/text-to-speech-synthesis

mayankagarwals · 2023-04-06T16:19:45Z

Hey, I'd be more than happy to take up this task if we can decide on the other 1-2 models

xenova · 2023-04-06T16:51:16Z

Hey, I'd be more than happy to take up this task if we can decide on the other 1-2 models

We can probably just select the most popular models from the hub: https://huggingface.co/models?pipeline_tag=text-to-speech&sort=downloads

hollance · 2023-04-07T08:32:26Z

There is an open PR for FastSpeech2. I think this is a good new model to add. If anyone is interested in taking that PR to completion, that would be awesome!

xenova · 2023-04-18T02:24:55Z

Hey, I'd be more than happy to take up this task if we can decide on the other 1-2 models

Let me know if you need any help! I’m excited for this to be added 🔥

xenova · 2023-04-27T22:14:57Z

Here's another model which could fall into the text-to-speech category: #23036

jozefchutka · 2023-04-28T13:38:12Z

Just added one more #23050

bil-ash · 2023-07-22T14:43:28Z

Please add support for the mms-tts model as mentioned in above issue to the TTS pipeline.

xenova · 2023-07-22T14:54:18Z

Good news! This is currently being worked on: #24952 🚀🔥

josephrocca mentioned this issue Apr 1, 2023

[Feature request] Add text-to-speech with SpeechT5 xenova/transformers.js#59

Closed

sanchit-gandhi added Feature request Request for a new feature Core: Pipeline Internals of the library; Pipeline. labels Apr 4, 2023

xenova mentioned this issue Apr 8, 2023

[Question] WavLM support xenova/transformers.js#75

Closed

sanchit-gandhi mentioned this issue May 15, 2023

[New model] 🐸TTS advanced Text-to-Speech #23050

Open

2 tasks

ylacombe mentioned this issue Jul 20, 2023

Add Text-To-Speech pipeline #24952

Merged

5 tasks

bil-ash mentioned this issue Jul 22, 2023

[Feature request] Add support for Massively Multilingual Speech(MMS) model xenova/transformers.js#209

Closed

sanchit-gandhi closed this as completed in #24952 Aug 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support `text-to-speech` in `pipeline` function and in Optimum #22487

Support `text-to-speech` in `pipeline` function and in Optimum #22487

josephrocca commented Mar 31, 2023 •

edited

sgugger commented Mar 31, 2023

sanchit-gandhi commented Apr 4, 2023

josephrocca commented Apr 4, 2023

mayankagarwals commented Apr 6, 2023

xenova commented Apr 6, 2023

hollance commented Apr 7, 2023

xenova commented Apr 18, 2023

xenova commented Apr 27, 2023

jozefchutka commented Apr 28, 2023

bil-ash commented Jul 22, 2023 •

edited

xenova commented Jul 22, 2023

Support text-to-speech in pipeline function and in Optimum #22487

Support text-to-speech in pipeline function and in Optimum #22487

Comments

josephrocca commented Mar 31, 2023 • edited

Feature request

Motivation

Your contribution

sgugger commented Mar 31, 2023

sanchit-gandhi commented Apr 4, 2023

josephrocca commented Apr 4, 2023

mayankagarwals commented Apr 6, 2023

xenova commented Apr 6, 2023

hollance commented Apr 7, 2023

xenova commented Apr 18, 2023

xenova commented Apr 27, 2023

jozefchutka commented Apr 28, 2023

bil-ash commented Jul 22, 2023 • edited

xenova commented Jul 22, 2023

Support `text-to-speech` in `pipeline` function and in Optimum #22487

Support `text-to-speech` in `pipeline` function and in Optimum #22487

josephrocca commented Mar 31, 2023 •

edited

bil-ash commented Jul 22, 2023 •

edited