Skip to content

Fix incorrect cast from BF16 to FP32 in SBGEMM#5712

Merged
martin-frbg merged 1 commit intoOpenMathLib:developfrom
murste01:develop
Mar 27, 2026
Merged

Fix incorrect cast from BF16 to FP32 in SBGEMM#5712
martin-frbg merged 1 commit intoOpenMathLib:developfrom
murste01:develop

Conversation

@murste01
Copy link
Copy Markdown
Contributor

This change fixes a regression in SBGEMM where C is assumed to be BF16, and so unconditionally attempts to cast elements of C from BF16 to FP32 resulting in incorrect outputs when beta=1.

This change fixes a regression in SBGEMM where C is assumed to be BF16,
and so unconditionally casts the output to FP32 resulting in incorrect
outputs when beta=1.
@martin-frbg martin-frbg added this to the 0.3.33 milestone Mar 26, 2026
@martin-frbg martin-frbg merged commit 3c188e4 into OpenMathLib:develop Mar 27, 2026
99 of 102 checks passed
@aditew01
Copy link
Copy Markdown
Contributor

@martin-frbg is there a way we can get a patch release for OpenBLAS?
The current bug in 0.3.32 blocks OpenBLAS upgrade in PyTorch, because it causes a unit test failure.

@martin-frbg
Copy link
Copy Markdown
Collaborator

Yes, I'll try to do 0.3.33 this weekend - though the spurious utest failure bothers me a lot less than the weird DDOT bug that my workaround for the weird Neoverse SDOT bug introduced. Coincidentally the Reference-LAPACK team is committing a flurry of fixes for their upcoming 3.13, so it looks like a good chance to import those as well.

@aditew01
Copy link
Copy Markdown
Contributor

that'll be much appreciated! 🙏

@ChipKerchner
Copy link
Copy Markdown
Contributor

I think this is what I've been pointing out for a while now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants