Allocate dQ, dK, and dV as a catted tensor to save a downstream cat in nvFuser. #43
Annotations
1 warning
auto-cc
Node.js 16 actions are deprecated. Please update the following actions to use Node.js 20: Lightning-AI/probot@v5. For more information see: https://github.blog/changelog/2023-09-22-github-actions-transitioning-from-node-16-to-node-20/.
|