Keyword Timestamp Identification in Milliseconds #542
Unanswered
stevenmills
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
The accuracy on Whisper in a few of my tests has been excellent. One drawback I have is the timestamp ranges provided do not infer enough accuracy. For example I've seen 6 word sentences within the audio take ~1 second to speak, but the range given on the phrase is 7 seconds long. I am in need of the start milliseconds and duration milliseconds of each word. Is there a technical limitation to Whisper being able to provide this information per word?
Beta Was this translation helpful? Give feedback.
All reactions