Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Enhancement]: generate images from text and show them while playing the book #2824

Open
dadino opened this issue Apr 5, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@dadino
Copy link

dadino commented Apr 5, 2024

Describe the feature/enhancement

This is a crazy idea and we're probably a few years away from being able to do it on our local machines. Treat this request more as a conversation.

When something like this is implemented to generate subtitles from the audio files, we could generate images for each page/minute/chapter/scene/whatever and show them in the player when listening to an audiobook.

When a audiobook is imported and his subtitles are generated, queue image generation using a predefined prompt (something like "an illustration for a book of this scene: %s") that could also be changed by the user (maybe I want a specific style for the illustration).

Since image generation prompts are usually short, I guess we should give all the previous (read) text to an LLM and ask it to "create a short prompt of the current scene/chapter to use in an image generator model, from this text: %s".

The file structure would be pretty simple, a folder with images and a text file with timestamps (like an srt). This could spark a community generated library of "book illustrations". It could also mean that ABS could just be a consumer of this format, not the creator.

Why?

I listen to audiobooks in small chunk of 5-15 minutes and having some context when I start a session would be great.
I also love cool images and having illustration for spaceship battles while listening to Expeditionary Forces, changing with every battle, switching to some tacticool Ruhar soldier when boots hit the ground or some crazy plan takes place on an alien planet, would throw me in book right away.

The Reddit post that sparked this idea for me

@dadino dadino added the enhancement New feature or request label Apr 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant