-
Notifications
You must be signed in to change notification settings - Fork 6.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge internal changes #682
Conversation
Allow for kv-dim to be different from q-dim in encoder-decoder attention
Note that this matches the behavior of IndexedCachedDataset: https://github.com/fairinternal/fairseq-py/blob/63273e26eaac607b645c7d4ec4c24706b9be39c9/fairseq/data/indexed_dataset.py#L145
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@myleott has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
Summary: Pull Request resolved: facebookresearch/fairseq#682 Differential Revision: D15147735 Pulled By: myleott fbshipit-source-id: 4a5f12c0b24591f964fe1f465be3775a67578e79
Summary: Pull Request resolved: facebookresearch/fairseq#682 Differential Revision: D15147735 Pulled By: myleott fbshipit-source-id: 4a5f12c0b24591f964fe1f465be3775a67578e79
No description provided.