You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I wonder how you get the feature map variances. According to my understanding, you first need to extract representations of all the samples, which should give us a vector with a length of D (let's just fatten the 2d tensor or concatenate all tokens). Then you calculate the variance of each element in this vector over all the samples, which should give us D variances. Finally, you take the mean value of all D variances and get the variance ready to report.
Did I get you correctly? Sorry if I didn't catch up with your existing documentation or description.
Thank you and I'm looking forward to your reply.
Best,
The text was updated successfully, but these errors were encountered:
Yes, you explained correctly. I calculated the variance of the tokens per channel and then averaged them.
I will release the code after re-implementing it. I can't give a definite timeline, but I’ll try hard to release the whole code for the feature map variance by this Friday!
hello,
Thank you for your great work!
I wonder how you get the feature map variances. According to my understanding, you first need to extract representations of all the samples, which should give us a vector with a length of D (let's just fatten the 2d tensor or concatenate all tokens). Then you calculate the variance of each element in this vector over all the samples, which should give us D variances. Finally, you take the mean value of all D variances and get the variance ready to report.
Did I get you correctly? Sorry if I didn't catch up with your existing documentation or description.
Thank you and I'm looking forward to your reply.
Best,
The text was updated successfully, but these errors were encountered: