Ensure FMA optimizations kick in under embedded broadcast #116891

tannergooding · 2025-06-21T20:01:54Z

Follow up to #116804. The logic is correct, but it missed a case where embedded broadcast containment would interfere with the FMA optimization and cause it to be skipped entirely.

dotnet-policy-service · 2025-06-21T20:02:53Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

Copilot

Pull Request Overview

This PR ensures that FMA (Fused Multiply-Add/Subtract) optimizations are correctly applied even when constant vectors are candidates for embedded broadcast. It refactors the intrinsic classification and refines the containment logic to account for negative-zero broadcasts interfering with FMA.

Unified handling of AVX2/AVX512 FMA intrinsics in LowerFusedMultiplyOp
Updated containment logic in ContainCheckHWIntrinsic to skip embedded broadcasts when better FMA transformations exist
Added a new predicate OperIsVectorFusedMultiplyOp to identify FMA intrinsics

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
src/coreclr/jit/lowerxarch.cpp	Refactored FMA case switches, adjusted operand indexing, and enhanced embedded-broadcast checks
src/coreclr/jit/gentree.h	Declared `OperIsVectorFusedMultiplyOp` to flag FMA intrinsics
src/coreclr/jit/gentree.cpp	Defined `GenTree::OperIsVectorFusedMultiplyOp()` with documentation and intrinsic ID checks

Comments suppressed due to low confidence (2)

src/coreclr/jit/lowerxarch.cpp:9881

New logic skips embedded-broadcast folding for FMA when negative-zero constants are involved. Add targeted unit tests to cover both the ordinary embedded-broadcast path and the skipped path to prevent regressions.

                            containedOperand->IsCnsVec() && node->isEmbeddedBroadcastCompatibleHWIntrinsic(comp);

src/coreclr/jit/lowerxarch.cpp:1509

The operand index was changed from Op(1) to Op(2). Verify that this matches the intended third operand of the FMA node and doesn't introduce an off-by-one reference.

                GenTree* argOp = hwArg->Op(2);

src/coreclr/jit/lowerxarch.cpp

tannergooding · 2025-06-22T00:01:44Z

CC. @dotnet/jit-contrib, small improvement that with some diffs in common math helpers ensuring we can do the valid FMA transformations when embedded broadcast exists

kunalspathak

LGTM

Ensure FMA optimizations kick in under embedded broadcast

66547d6

Copilot AI review requested due to automatic review settings June 21, 2025 20:01

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jun 21, 2025

dotnet-policy-service bot assigned tannergooding Jun 21, 2025

Copilot AI reviewed Jun 21, 2025

View reviewed changes

src/coreclr/jit/lowerxarch.cpp Show resolved Hide resolved

kunalspathak approved these changes Jun 23, 2025

View reviewed changes

tannergooding merged commit 383f9af into dotnet:main Jun 23, 2025
110 checks passed

tannergooding deleted the fma-emb-broadcast branch June 23, 2025 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ensure FMA optimizations kick in under embedded broadcast #116891

Ensure FMA optimizations kick in under embedded broadcast #116891

Uh oh!

tannergooding commented Jun 21, 2025

Uh oh!

dotnet-policy-service bot commented Jun 21, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

tannergooding commented Jun 22, 2025

Uh oh!

kunalspathak left a comment

Uh oh!

Uh oh!

Uh oh!

Ensure FMA optimizations kick in under embedded broadcast #116891

Ensure FMA optimizations kick in under embedded broadcast #116891

Uh oh!

Conversation

tannergooding commented Jun 21, 2025

Uh oh!

dotnet-policy-service bot commented Jun 21, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

tannergooding commented Jun 22, 2025

Uh oh!

kunalspathak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!