Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue about alignment between label and frames. #4

Closed
hulianyuyy opened this issue Nov 1, 2021 · 4 comments
Closed

Issue about alignment between label and frames. #4

hulianyuyy opened this issue Nov 1, 2021 · 4 comments

Comments

@hulianyuyy
Copy link

Thanks for your great job. I'm wondering how to draw a picture like Fig.5 in your paper. The key point lies in how to align labels with frames. Could you provide some advice? Thanks in advance!

@ycmin95
Copy link
Collaborator

ycmin95 commented Nov 2, 2021

@hulianyuyy
There are two widely used ways to align the labels with features, the greedy alignment (selecting the most likely gloss at each step) and the dominant alignment (DNF, TMM'19). We adopt the greedy alignment for simplicity and show the network predictions.
The alignment between features and frames is simply based on its temporal receptive field, for example, the temporal receptive field of Subgloss-wise conv1d (C5-P2) is 6, so the logit z_t is corresponding to frames f_[t2, t2+6] (no padding setting).

@hulianyuyy
Copy link
Author

A question that confused me is how to find the ground truth label of frames (as marked in Fig.5 in your paper). As far as we know, the Phoenix Dataset only provide labels in video domain but not precisely assigned to frames.

@ycmin95
Copy link
Collaborator

ycmin95 commented Nov 2, 2021

@hulianyuyy
We mannually labelled several sequences for visualization. (This website may be helpful for Phoenix).

@hulianyuyy
Copy link
Author

Many thanks for you kind reply.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants