You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is it possible to retrieve the timing of the spoken words from the gtts itself? (Like the ones that will be used in srt subtitle files, though I don't care about the exact format)
I know that there might be hacky ways of speech-to-text'ing it back or splitting text into separate word, but I would prefer not to go this route.
The text was updated successfully, but these errors were encountered:
Sorry for the delay—
Yeah, hmm that would be pretty difficult for what this library does (which is request a byte stream and saving it to a file), so there's nothing really that it could directly do for this, since it's not really aware of the resulting data.
Is it possible to retrieve the timing of the spoken words from the gtts itself? (Like the ones that will be used in srt subtitle files, though I don't care about the exact format)
I know that there might be hacky ways of speech-to-text'ing it back or splitting text into separate word, but I would prefer not to go this route.
The text was updated successfully, but these errors were encountered: