Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

token level timestep #78

Open
Mddct opened this issue Dec 14, 2020 · 6 comments · Fixed by #86
Open

token level timestep #78

Mddct opened this issue Dec 14, 2020 · 6 comments · Fixed by #86
Labels
enhancement New feature or request

Comments

@Mddct
Copy link

Mddct commented Dec 14, 2020

Is it possible to output token level timestep?

eg:
hello 100-600
world 712-900
.......

@nglehuy nglehuy added the question Further information is requested label Dec 15, 2020
@nglehuy
Copy link
Collaborator

nglehuy commented Dec 15, 2020

@Mddct I don't quite understand your question and example. Currently we tokenize string (aka label) into list of characters or list of subwords.

@Mddct
Copy link
Author

Mddct commented Dec 15, 2020

Output label and its start time and end time in orign wav。

@nglehuy
Copy link
Collaborator

nglehuy commented Dec 15, 2020

@Mddct Oh, currently we don't support that feature yet. But I'll look into it. Anyway, if you have any idea of doing that, especially for rnn transducer, can you update here?

@nglehuy nglehuy added enhancement New feature or request and removed question Further information is requested labels Dec 15, 2020
@Mddct
Copy link
Author

Mddct commented Dec 16, 2020 via email

@nglehuy
Copy link
Collaborator

nglehuy commented Dec 16, 2020

There's this but for CTC, maybe we can apply it with some modification.

@Mddct
Copy link
Author

Mddct commented Dec 16, 2020

@usimarit It seems two steps:
1 calculate the mean_start_shift, mean_end_shift
2 apply shift on each start and end

I will evaluate the accuracy later. But for other language or own corpus, we need do force aligment to get time information。

@nglehuy nglehuy linked a pull request Dec 27, 2020 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants