Try Multi-Query Attention from PaLM #30
Labels
difficulty/easy
Easy issue to tackle, may take a couple hours
project/model
Related to modeling decisions and implementations
severity/should
Something that should be implemented/fixed
No description provided.
The text was updated successfully, but these errors were encountered: