Skip to content

Conversation

lchu6
Copy link
Contributor

@lchu6 lchu6 commented Jul 13, 2024

It seems currently we are not repeating kv when GQA is used.

This PR fix this issue by adding kv repeat before passing kv to SDPA.

@lchu6
Copy link
Contributor Author

lchu6 commented Jul 13, 2024

@tridao

@tridao tridao merged commit 014c094 into state-spaces:main Jul 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants