You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
That does look like the RMSNorm computation, yes. However, RMSNorm was only added to PyTorch in 2.3 or 2.4 I think, so we need to do it manually until our minimum supported torch version catches up!
https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama4/modeling_llama4.py#L118
This is just the rms norm?
The text was updated successfully, but these errors were encountered: