You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, when I use the features provided by this repo,
some video and audio features are misaligned in their context length.
Example is attached below. It is described in the order of "vid, video shape, audio shape".
B3yOejNbNks_210.0_360.0 torch.Size([71, 2816]) torch.Size([70, 2048])
Can you provide how to align these features?
Thank you.
Best regards
The text was updated successfully, but these errors were encountered:
Thanks for your interest in our work. We directly crop the features using the shorter length. For example, in your case, the last row of video features is discarded, so that both video and audio features have 70 vectors.
THank you for the great work.
However, when I use the features provided by this repo,
some video and audio features are misaligned in their context length.
Example is attached below. It is described in the order of "vid, video shape, audio shape".
B3yOejNbNks_210.0_360.0 torch.Size([71, 2816]) torch.Size([70, 2048])
Can you provide how to align these features?
Thank you.
Best regards
The text was updated successfully, but these errors were encountered: