New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
word-level timestamps #12
Comments
Hi, Word-level timestamps are currently not possible. They usually require extensions to the model that are not implemented at this time. |
Thank you for the amazing work on this! |
Just checked out the whisper repo and world-level timestamp PR has been merged. I would be great indeed to have the same on faster-whiper. Great work! |
I just pushed an experimental branch implementing word-level timestamps! It would be great if you can test this early. Note that I implemented exactly the same logic as openai/whisper. So if there is a strange result and openai/whisper has the same result, you should report the issue to openai/whisper and not here. Here's how you can test this today: Install the development branch of faster-whisper
Install the development build of CTranslate2
Reconvert the modelThe model should be converted again with the latest version of CTranslate2 as the configuration needs to be updated with additional information:
Transcribe with word-level timestampssegments, _ = model.transcribe(audio_path, word_timestamps=True)
for segment in segments:
print(segment.words) |
just tested this with the tiny model and it worked! |
large-v2 seems to work too. Thanks again |
When I tested word timestamps on a bunch of file, I saw this error happening on some corner case:
|
Thank you for testing! Do you confirm the same file works without issue in openai/whisper? If yes, is it possible for you to share this input file? |
Hi, I really appreciate you sharing this implementation.
I found it to be very fast with accurate results.
I do not see word-level timestamps in the result. Are word level timestamps possible?
The text was updated successfully, but these errors were encountered: