Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions in the paper #3

Closed
liqiokkk opened this issue May 8, 2021 · 1 comment
Closed

Some questions in the paper #3

liqiokkk opened this issue May 8, 2021 · 1 comment

Comments

@liqiokkk
Copy link

liqiokkk commented May 8, 2021

Hi, Author.
I want to know are EV and EV(hat) equivalent or approximate in the paper?
Are EV(hat)^l in the second half of formula 7 and formula 8 equivalent or approximate?
Thank you, looking forward to your answer.

@xwjabc
Copy link
Contributor

xwjabc commented May 8, 2021

Hi @liqi520, thank you for your interest in our work!
(1) \hat{EV} is an approximation of EV by considering each channel in the query, key and value vectors as internal heads.
(2) \hat{EV}^l in equation (7) and (8) are equivalent.
Hope it helps!

@xwjabc xwjabc closed this as completed Oct 8, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants