You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When both inputs are bf16, the generated graph converts to f32, performs the multiply, then converts back to bf16. Two questions:
In this case, is the op math type effectively f32 (not bf16)?
If this upcast exists primarily for TPU accuracy/stability, would it be acceptable to gate it behind a flag (e.g., env option) so we can treat that path as a no-op and keep the op in native bf16 when desired?