Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transcription timestamp of every word in the output wav2vec2 #2965

Closed
harveenchadha opened this issue Nov 28, 2020 · 4 comments
Closed

Transcription timestamp of every word in the output wav2vec2 #2965

harveenchadha opened this issue Nov 28, 2020 · 4 comments
Assignees
Labels

Comments

@harveenchadha
Copy link
Contributor

harveenchadha commented Nov 28, 2020

❓ Questions and Help

How to get the timestamp of each word in transcription in wav2vec?

Often in ASR the timestamp of each word in duration matters a lot. Is there any mechanism that does this yet?

@alexeib
Copy link
Contributor

alexeib commented Dec 1, 2020

this is not really supported at the moment. you can try to add the code yourself to infer.py for viterbi decoding, or check if wav2letter decoder supports this (since our decoding is using wav2letter decoders). alternatively you can try to implement decoding with e.g. kaldi which should be straight forward since you can easily get emissions (if you do this please contribute back!) and get the timestamps from there

@irux
Copy link

irux commented Mar 24, 2021

@harveenchadha did you solve it ?

@harveenchadha
Copy link
Contributor Author

@irux not yet, but will be working on it.

@pushkal1234
Copy link

How to get a word time stamp in wav2vec 2.0?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

5 participants