You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! Thank you for you code! But I am confused that the input of NAR model is the discrete tokens in the paper while in this repo the input of NAR model are the hidden state of AR model. Besides, the gradients will backward from the NAR model to the AR model. Have I misunderstand it?
The text was updated successfully, but these errors were encountered:
Hi! Thank you for you code! But I am confused that the input of NAR model is the discrete tokens in the paper while in this repo the input of NAR model are the hidden state of AR model. Besides, the gradients will backward from the NAR model to the AR model. Have I misunderstand it?
The text was updated successfully, but these errors were encountered: