Add Sage Attention for Lumina by rockerBOO · Pull Request #22 · sdbds/sd-scripts

rockerBOO · 2025-03-02T01:38:36Z

Requires https://github.com/thu-ml/SageAttention

Sage Attention 2: 3.8s/it
Flash Attention 2: 4.33s/it

Actual numbers may be different as not a full run to estimate with.

sdbds · 2025-03-02T10:32:57Z

sage attention cant use in training.
I think add SpargeAttn
can get more speed.

rockerBOO · 2025-03-02T19:26:49Z

Hmm good point but it does seem to be "training" the LoRA. The weights are present and seem to have actual weights. I will look a little deeper to see what is actually going on. Maybe it's just keeping the initialized weights or something.

thu-ml/SageAttention#60

SpargeAttn might require a significant "autotune" to produce a sparse attention model. thu-ml/SpargeAttn#6 I was initially exploring adding SpargeAttn but found that issue.

rockerBOO · 2025-03-02T23:15:38Z

Ok it seems it isn't adjusting the attention parameters but the others would be updated (like MLP/feed forward). I think keeping SageAttention could be fine but noting it is not for training? Possibly some work-arounds for the backwards that were applied for flash attention might make it work in the future.

Add Sage Attention for Lumina

a69884a

sdbds merged commit 09c4710 into sdbds:lumina Mar 3, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Sage Attention for Lumina#22

Add Sage Attention for Lumina#22
sdbds merged 1 commit intosdbds:luminafrom
rockerBOO:sage_attn

rockerBOO commented Mar 2, 2025

Uh oh!

sdbds commented Mar 2, 2025

Uh oh!

rockerBOO commented Mar 2, 2025 •

edited

Loading

Uh oh!

rockerBOO commented Mar 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rockerBOO commented Mar 2, 2025

Uh oh!

sdbds commented Mar 2, 2025

Uh oh!

rockerBOO commented Mar 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rockerBOO commented Mar 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rockerBOO commented Mar 2, 2025 •

edited

Loading