Skip to content

fix: correct Gemma3 rope settings and vram limit propagation#1583

Merged
leejet merged 1 commit into
masterfrom
fix/llm-rope-vram-limit
May 30, 2026
Merged

fix: correct Gemma3 rope settings and vram limit propagation#1583
leejet merged 1 commit into
masterfrom
fix/llm-rope-vram-limit

Conversation

@leejet
Copy link
Copy Markdown
Owner

@leejet leejet commented May 30, 2026

Summary

  • Update Gemma3 12B LLM RoPE handling to use NeoX RoPE with a 131072 context value for query and key tensors.
  • Forward max graph VRAM limits through the LTXAV embedder to both the LLM and projection components.

Related Issue / Discussion

N/A

Additional Information

N/A

Checklist

@leejet leejet merged commit d2797b8 into master May 30, 2026
14 checks passed
wbruna pushed a commit to wbruna/stable-diffusion.cpp that referenced this pull request May 30, 2026
@leejet leejet deleted the fix/llm-rope-vram-limit branch May 31, 2026 17:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant