Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

about the feature extractor architecture #36

Closed
fransisca25 opened this issue May 8, 2023 · 6 comments
Closed

about the feature extractor architecture #36

fransisca25 opened this issue May 8, 2023 · 6 comments

Comments

@fransisca25
Copy link

Hi, I am really sorry to ask question here, but this is something important for my research. I really want to know more about the delta t from the frame wise features. There are intersections between the delta ts. Could you explain how long are the intersections between those delta ts? Or maybe you could mention the code about the delta t, so I can check it? Thank you.
Screenshot from 2023-05-08 14-49-30

@ycmin95
Copy link
Collaborator

ycmin95 commented May 8, 2023

Hi, thanks for your attention to our work and please feel free to post any question here,
the delta t here is implemented by conv1d on the temporal dimention with stride=1, and the implementation can be found here. The ablation results about the length of delta t can be found in the Table 4 of the main paper.

Hope can help you~

@fransisca25
Copy link
Author

This really helps me. Thank you so much for answering my question!

@atonyo11
Copy link

Hi @ycmin95 , @fransisca25, can you explain how to calculate the delta t? For example C5-P2-C5-P2? Thank you in advance!

@ycmin95
Copy link
Collaborator

ycmin95 commented Feb 25, 2024

Hi @atonyo11 ,
The equation for the receptive field calculation is RF_i = (RF_i-1) * stride + Ksize

For C5-P2-C5-P2:

-P2: (1-1)*2+2=2
-C5-P2: (2-1)*1+5=6
-P2-C5-P2: (6-1)*2+2=12
C5-P2-C5-P2: (12-1)*1+5=16

Hope this can help you understand~

@atonyo11
Copy link

@ycmin95 oh, I got it. Thank you very much!

@atonyo11
Copy link

@ycmin95 I understand that the delta t is the receptive field. In your paper context, does that mean the delta t is number of frames?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants