Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[XLA:GPU] Disable cuDNN FMHA by default.
cuDNN FMHA dispatches pattern-matched regions to a FlashAttention kernel by default. FlashAttention does not preserve numerics, and thus an illegal optimization to have on by default. PiperOrigin-RevId: 631384623
- Loading branch information