Skip to content

[Community Pipeline] Speech to Image  #871

@osanseviero

Description

@osanseviero

Intro

Community Pipelines are introduced in diffusers==0.4.0 with the idea of allowing the community to quickly add, integrate, and share their custom pipelines on top of diffusers.

You can find a guide about Community Pipelines here. You can also find all the community examples under examples/community/. If you have questions about the Community Pipelines feature, please head to the parent issue.

Idea: Speech to Image

You can use a transformer automatic-speech-recognition such as OpenAI whisper to transcribe the text, and pass that to Stable Diffusion. Together, this would create a nice speech-to-image pipeline.

Resources

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions