Skip to content
Discussion options

You must be logged in to vote

@jongwook I hope the ping is ok

Forget what I asked above, this is the right question:
Given the for loop is at position 2s of a 5s wav file, it takes all 5s audio_features for prediction at any point, right? So, does predicting the token at timepoint 2s also take future audio_features into consideration?
What would happen if the decoder only had access to the current audio_features?

Replies: 2 comments 11 replies

Comment options

You must be logged in to vote
8 replies
@SinanAkkoyun
Comment options

@atyshka
Comment options

@SinanAkkoyun
Comment options

@SinanAkkoyun
Comment options

@atyshka
Comment options

Answer selected by jongwook
Comment options

You must be logged in to vote
3 replies
@jongwook
Comment options

@SinanAkkoyun
Comment options

@jongwook
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants