release/21.x: [X86] Only fold AND/ANDNP back to VSELECT if we know the predicated mask select is legal (#156663) #157047

llvmbot · 2025-09-05T08:30:47Z

Backport 86879d4

Requested by: @RKSimon

llvmbot · 2025-09-05T08:30:53Z

@phoebewang What do you think about merging this PR to the release branch?

llvmbot · 2025-09-05T08:31:23Z

@llvm/pr-subscribers-backend-x86

Author: None (llvmbot)

Changes

Backport 86879d4

Requested by: @RKSimon

Full diff: https://github.com/llvm/llvm-project/pull/157047.diff

2 Files Affected:

(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+4)
(added) llvm/test/CodeGen/X86/pr156256.ll (+25)

diff --git a/llvm/lib/Target/X86/X86ISelLowering.cpp b/llvm/lib/Target/X86/X86ISelLowering.cpp
index 578519b1cc3c9..86877be48eca8 100644
--- a/llvm/lib/Target/X86/X86ISelLowering.cpp
+++ b/llvm/lib/Target/X86/X86ISelLowering.cpp
@@ -51775,6 +51775,8 @@ static SDValue combineAnd(SDNode *N, SelectionDAG &DAG,
     SDValue X, Y;
     EVT CondVT = VT.changeVectorElementType(MVT::i1);
     if (TLI.isTypeLegal(VT) && TLI.isTypeLegal(CondVT) &&
+        (VT.is512BitVector() || Subtarget.hasVLX()) &&
+        (VT.getScalarSizeInBits() >= 32 || Subtarget.hasBWI()) &&
         sd_match(N, m_And(m_Value(X),
                           m_OneUse(m_SExt(m_AllOf(
                               m_Value(Y), m_SpecificVT(CondVT),
@@ -55329,6 +55331,8 @@ static SDValue combineAndnp(SDNode *N, SelectionDAG &DAG,
     SDValue Src = N0.getOperand(0);
     EVT SrcVT = Src.getValueType();
     if (Src.getOpcode() == ISD::SETCC && SrcVT.getScalarType() == MVT::i1 &&
+        (VT.is512BitVector() || Subtarget.hasVLX()) &&
+        (VT.getScalarSizeInBits() >= 32 || Subtarget.hasBWI()) &&
         TLI.isTypeLegal(SrcVT) && N0.hasOneUse() && Src.hasOneUse())
       return DAG.getSelect(DL, VT, DAG.getNOT(DL, Src, SrcVT), N1,
                            getZeroVector(VT, Subtarget, DAG, DL));
diff --git a/llvm/test/CodeGen/X86/pr156256.ll b/llvm/test/CodeGen/X86/pr156256.ll
new file mode 100644
index 0000000000000..13caa6fee5878
--- /dev/null
+++ b/llvm/test/CodeGen/X86/pr156256.ll
@@ -0,0 +1,25 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 5
+; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f,+avx512dq | FileCheck %s --check-prefix=AVX512
+; RUN: llc < %s -mtriple=x86_64-- -mattr=+avx512f,+avx512dq,+avx512vl | FileCheck %s --check-prefix=AVX512VL
+
+define <16 x i16> @PR156256(<16 x i32> %a, <16 x i32> %b) {
+; AVX512-LABEL: PR156256:
+; AVX512:       # %bb.0:
+; AVX512-NEXT:    vpcmpnleud %zmm1, %zmm0, %k0
+; AVX512-NEXT:    vpmovm2d %k0, %zmm0
+; AVX512-NEXT:    vpmovdw %zmm0, %ymm0
+; AVX512-NEXT:    vpand {{\.?LCPI[0-9]+_[0-9]+}}(%rip), %ymm0, %ymm0
+; AVX512-NEXT:    retq
+;
+; AVX512VL-LABEL: PR156256:
+; AVX512VL:       # %bb.0:
+; AVX512VL-NEXT:    vpcmpnleud %zmm1, %zmm0, %k0
+; AVX512VL-NEXT:    vpmovm2d %k0, %zmm0
+; AVX512VL-NEXT:    vpmovdw %zmm0, %ymm0
+; AVX512VL-NEXT:    vpandd {{\.?LCPI[0-9]+_[0-9]+}}(%rip){1to8}, %ymm0, %ymm0
+; AVX512VL-NEXT:    retq
+  %icmp = icmp ugt <16 x i32> %a, %b
+  %sext = sext <16 x i1> %icmp to <16 x i16>
+  %and = and <16 x i16> %sext, splat (i16 16256)
+  ret <16 x i16> %and
+}

phoebewang · 2025-09-05T08:45:23Z

@phoebewang What do you think about merging this PR to the release branch?

LGTM.

github-actions · 2025-09-09T08:35:46Z

@RKSimon (or anyone else). If you would like to add a note about this fix in the release notes (completely optional). Please reply to this comment with a one or two sentence description of the fix. When you are done, please add the release:note label to this PR.

…ask select is legal (llvm#156663) By only checking type legality we didn't account for 128/256-bit ops being run on non-AVX512VL targets, or vXi8/i16 ops being run on non-AVX512BW targets This check is cropping up in several places now and I intend to hoist it out into a common helper, but this initial fix needs to be as clean as possible to be back ported to 21.X Fixes llvm#156256 (cherry picked from commit 86879d4)

llvmbot added this to the LLVM 21.x Release milestone Sep 5, 2025

github-project-automation bot added this to LLVM Release Status Sep 5, 2025

github-project-automation bot moved this to Needs Triage in LLVM Release Status Sep 5, 2025

llvmbot requested a review from phoebewang September 5, 2025 08:30

llvmbot added the backend:X86 label Sep 5, 2025

llvmbot mentioned this pull request Sep 5, 2025

LLVM 21.1.0 - ICE on eigen's basicstuff.cpp #156256

Closed

nikic moved this from Needs Triage to Needs Merge in LLVM Release Status Sep 8, 2025

tru force-pushed the issue156256 branch from 5e1bfc8 to 81d3b6e Compare September 9, 2025 08:35

tru merged commit 81d3b6e into llvm:release/21.x Sep 9, 2025
1 check was pending

github-project-automation bot moved this from Needs Merge to Done in LLVM Release Status Sep 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

release/21.x: [X86] Only fold AND/ANDNP back to VSELECT if we know the predicated mask select is legal (#156663) #157047

release/21.x: [X86] Only fold AND/ANDNP back to VSELECT if we know the predicated mask select is legal (#156663) #157047

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

phoebewang commented Sep 5, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

release/21.x: [X86] Only fold AND/ANDNP back to VSELECT if we know the predicated mask select is legal (#156663) #157047

release/21.x: [X86] Only fold AND/ANDNP back to VSELECT if we know the predicated mask select is legal (#156663) #157047

Uh oh!

Conversation

llvmbot commented Sep 5, 2025

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

llvmbot commented Sep 5, 2025

Uh oh!

phoebewang commented Sep 5, 2025

Uh oh!

Uh oh!

github-actions bot commented Sep 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants