[Community Pipeline] Speech to Image 

## Intro

Community Pipelines are introduced in `diffusers==0.4.0` with the idea of allowing the community to quickly add, integrate, and share their custom pipelines on top of `diffusers`. 

You can find a guide about Community Pipelines [here](https://github.com/huggingface/diffusers/issues/841). You can also find all the community examples under [`examples/community/`](https://github.com/huggingface/diffusers/tree/main/examples/community). If you have questions about the Community Pipelines feature, please head to the [parent issue](https://github.com/huggingface/diffusers/issues/841).

## Idea: Speech to Image

You can use a `transformer` `automatic-speech-recognition` such as OpenAI `whisper` to transcribe the text, and pass that to Stable Diffusion. Together, this would create a nice `speech-to-image` pipeline.

Resources
* https://twitter.com/art_zucker/status/1579881525071728642
* https://huggingface.co/docs/transformers/model_doc/whisper

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Community Pipeline] Speech to Image #871

Intro

Idea: Speech to Image

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Community Pipeline] Speech to Image #871

Description

Intro

Idea: Speech to Image

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions