Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Enhancement]: Adding Transcription/Subtitle Viewing Support #2919

Open
18 tasks
mfcar opened this issue May 4, 2024 · 1 comment
Open
18 tasks

[Enhancement]: Adding Transcription/Subtitle Viewing Support #2919

mfcar opened this issue May 4, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@mfcar
Copy link
Contributor

mfcar commented May 4, 2024

Describe the feature/enhancement

Transcription/Subtitle support

Summary

Add initial support for transcriptions. Apple now supports transcriptions in podcasts.

Some audiobook files have transcriptions, and currently, we can use tools based on Whisper to transcribe audio to text.

In fact, most software based on Whisper support transcribing audio to text and exporting it as an SRT or VTT file. VTT is a native format for the web, and SRT is a common format for subtitles.

I'm creating this issue to discuss the best way to implement transcription support on the web player. I'm trying to implement some features on the pull request #

Podcast transcription is supported by:

Possible tasks:

Note: I think we need define a standard for multi-language transcriptions. For example use some prefix in the file name like en- for English and es- for Spanish.

UI Ideas on the Web Player:

What's the best way to display the transcription on the web player?

Screenshot 2024-05-04 at 20 47 33
  • Modal.
    • Good: More space to display multiple lines of text. Can float over the UI.
    • Bad: The modal can be intrusive.
  • Lateral Panel. (like the iTunes/Apple Music).
    • Good: More space to display multiple lines of text. Can have a search feature. Better for implement the seek feature (click on the line and seek to the corresponding time).
    • Bad: Take up space on the screen. Not good for small screens.
Screenshot 2024-05-02 at 09 09 47

Related

@mfcar mfcar added the enhancement New feature or request label May 4, 2024
@mfcar mfcar changed the title [Enhancement]: Transcription/Subtitle support [Enhancement]: Transcription/Subtitle Playing support May 4, 2024
@mfcar mfcar changed the title [Enhancement]: Transcription/Subtitle Playing support [Enhancement]: Adding Transcription/Subtitle Viewing Support May 5, 2024
@barolo
Copy link

barolo commented Jun 24, 2024

IMHO, when it comes to the UI you should combine both, the big panel for browsing and the panel below controls perhaps just with the current line but bigger. You have to take into account accessibility, some folks will want it to be resizable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants