Even when the speakers starts talking after 10 sec, Whisper make the first timestamp to start at sec 0. How could I change that? #1130
Unanswered
romain130492
asked this question in
Q&A
Replies: 1 comment 4 replies
-
|
You can accomplish this by using word_level timestamps and then rebuilding the file yourself. I just finished that code I will publish it pretty quick. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello
I'm using Whisper,
when having a video with a speaker starting his speech at sec 10, I'm getting the first timestamp to be at sec 1. instead of sec 10.
Here is my config:
Config
POST
v1/audio/transcriptionsOutput:
1it should be00:00:10,000 --> 00:00:14,000, since there is no one talking at all for 10 sec.3, the speakers starts again talking at sec 28, but I'm getting the timestamp to be at sec 24. The silence is simply included in the timestamp with WhisperAny idea how I could fix that, maybe using a prompt?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions