Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding Lightning Attention 2 Support #13

Open
James4Ever0 opened this issue Jan 28, 2024 · 0 comments
Open

Adding Lightning Attention 2 Support #13

James4Ever0 opened this issue Jan 28, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@James4Ever0
Copy link

James4Ever0 commented Jan 28, 2024

馃殌 Feature Request

Using newly proposed model architecture Lightning Attention 2 to increase context size and inference speed.

Motivation

Looks promising and easy to implement, only requires Triton and NVIDIA GPU.

Upvote & Fund

  • We're using Polar.sh so you can upvote and help fund this issue.
  • We receive the funding once the issue is completed & confirmed by you.
  • Thank you in advance for helping prioritize & fund our backlog.
Fund with Polar
@James4Ever0 James4Ever0 added the enhancement New feature or request label Jan 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant