New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some questions about your code #7
Comments
Hi, |
But Figure 2 shows that the input of the decoder part should be the output of the CA module? |
Hi, if the CA still preserve the features from SA, it will have two outputs, one for Decoder and one for Aux head. Anyway, they are equal. |
Hi, in your code, I don't see the CA output for Decoder, only for Aux head. And the CA module is not used at test time. |
Hi, kindly mentioned again, there are equal implementation and gain equal performance, we object it as an aux head just for better design for the overall framework. You can stack them in series if need, meanwhile, as an aux head, it will not cause any computation during testing. |
My apologies. Sec. 3.2 mentions that:
Thanks for your time. |
Hi, the encoded tokens T^{K_s} is the output of SA modules, except as the input of decoder head, it's also the input of CA module (see Figure 3(b)). |
Yes, I missed that sentence and misunderstood Figure 2. Sorry again. |
Hi, thank you for your work. But I've noticed some inconsistencies with your paper:
But maybe I misunderstood something? Look forward to your reply.
The text was updated successfully, but these errors were encountered: