[LLVM][CodeGen][SVE] Refactor isel of 128-bit constant splats. by paulwalker-arm · Pull Request #185652 · llvm/llvm-project

paulwalker-arm · 2026-03-10T13:44:28Z

Rather than lower constant splats that only SVE supports to scalable vectors this patch maintains the use of fixed length vectors but adds isel patterns to select the necessary SVE instructions.

Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

Rather than lower constant splats that only SVE supports to scalable vectors this patch maintains the use of fixed length vectors but adds isel patterns to select the necessary SVE instructions. Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

llvmbot · 2026-03-10T13:45:03Z

@llvm/pr-subscribers-backend-aarch64

Author: Paul Walker (paulwalker-arm)

Changes

Rather than lower constant splats that only SVE supports to scalable vectors this patch maintains the use of fixed length vectors but adds isel patterns to select the necessary SVE instructions.

Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

Full diff: https://github.com/llvm/llvm-project/pull/185652.diff

3 Files Affected:

(modified) llvm/lib/Target/AArch64/AArch64ISelLowering.cpp (+3-4)
(modified) llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td (+3)
(modified) llvm/lib/Target/AArch64/SVEInstrFormats.td (+3)

diff --git a/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp b/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
index cd9de6c729649..20c4ff566defc 100644
--- a/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
+++ b/llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
@@ -16024,10 +16024,9 @@ static SDValue trySVESplat64(SDValue Op, SelectionDAG &DAG,
     return SDValue();
 
   SDLoc DL(Op);
-  SDValue SplatVal = DAG.getSplatVector(MVT::nxv2i64, DL,
-                                        DAG.getConstant(Val64, DL, MVT::i64));
-  SDValue Res = convertFromScalableVector(DAG, MVT::v2i64, SplatVal);
-  return DAG.getNode(AArch64ISD::NVCAST, DL, VT, Res);
+  SDValue SplatVal = DAG.getNode(AArch64ISD::DUP, DL, MVT::v2i64,
+                                 DAG.getConstant(Val64, DL, MVT::i64));
+  return DAG.getNode(AArch64ISD::NVCAST, DL, VT, SplatVal);
 }
 
 static SDValue ConstantBuildVector(SDValue Op, SelectionDAG &DAG,
diff --git a/llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td b/llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
index 273249b9ff44c..fab3dd41e89a1 100644
--- a/llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
+++ b/llvm/lib/Target/AArch64/AArch64SVEInstrInfo.td
@@ -982,6 +982,9 @@ let Predicates = [HasSVE_or_SME] in {
   def : Pat<(nxv2i64 (splat_vector (i64 (SVECpyDupImm64Pat i32:$a, i32:$b)))),
             (DUP_ZI_D $a, $b)>;
 
+  def : Pat<(v2i64 (AArch64dup (i64 (SVECpyDupImm64Pat i32:$a, i32:$b)))),
+            (EXTRACT_SUBREG (DUP_ZI_D $a, $b), zsub)>;
+
   // Duplicate immediate FP into all vector elements.
   def : Pat<(nxv2f16 (splat_vector (f16 fpimm:$val))),
             (DUP_ZR_H (MOVi32imm (bitcast_fpimm_to_i32 f16:$val)))>;
diff --git a/llvm/lib/Target/AArch64/SVEInstrFormats.td b/llvm/lib/Target/AArch64/SVEInstrFormats.td
index 9d988b5654a6e..af1c8676a99ce 100644
--- a/llvm/lib/Target/AArch64/SVEInstrFormats.td
+++ b/llvm/lib/Target/AArch64/SVEInstrFormats.td
@@ -2176,6 +2176,9 @@ multiclass sve_int_dup_mask_imm<string asm> {
             (!cast<Instruction>(NAME) i64:$imm)>;
   def : Pat<(nxv2bf16 (splat_vector (bf16 (SVELogicalBFPImmPat i64:$imm)))),
             (!cast<Instruction>(NAME) i64:$imm)>;
+
+  def : Pat<(v2i64 (AArch64dup (i64 (SVELogicalImm64Pat i64:$imm)))),
+            (EXTRACT_SUBREG (!cast<Instruction>(NAME) i64:$imm), zsub)>;
 }
 
 //===----------------------------------------------------------------------===//

huntergr-arm · 2026-03-10T14:08:32Z

Looks ok overall.

Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

Does this mean you expect this PR to enable matching patterns for the immediate form which didn't before? If so, could you please add a test if it's not too convoluted to do so?

paulwalker-arm · 2026-03-10T14:15:40Z

Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

Does this mean you expect this PR to enable matching patterns for the immediate form which didn't before? If so, could you please add a test if it's not too convoluted to do so?

No, this patch is pretty much NFC.

david-arm · 2026-03-10T14:19:44Z

-                                        DAG.getConstant(Val64, DL, MVT::i64));
-  SDValue Res = convertFromScalableVector(DAG, MVT::v2i64, SplatVal);
-  return DAG.getNode(AArch64ISD::NVCAST, DL, VT, Res);
+  SDValue SplatVal = DAG.getNode(AArch64ISD::DUP, DL, MVT::v2i64,


Is it possible to write a test that shows the benefit?

I don't believe so. This is purely refactoring at this stage.

david-arm

LGTM!

MacDue

FYI: This looks like it regresses some MUL_ZI patterns.

E.g.,

define <2 x i64> @mul_v2i64(<2 x i64> %a) {
entry:
  %mul = mul <2 x i64> %a, splat (i64 123)
  ret <2 x i64> %mul
}

Used to lower to the immediate form, but it now results in:

	mov	z1.d, #123                  
	ptrue	p0.d, vl2
	mul	z0.d, p0/m, z0.d, z1.d

It looks like the reason is the SVE_1_Op_Imm_Arith_Any_Predicate pattern only expects a splat_vector. Noticed while rebasing #165559.

llvm#185652)" This reverts commit 9ba92ff.

…s. (llvm#185652)" This reverts commit 9940c6c.

paulwalker-arm · 2026-03-12T11:14:11Z

Thanks @MacDue. #186090 adds test coverage and a fix.

…185652) Rather than lower constant splats that only SVE supports to scalable vectors this patch maintains the use of fixed length vectors but adds isel patterns to select the necessary SVE instructions. Doing this means we can extend coverage to include SVE operations that take an immediate operand without needing to convert more of the DAG to scalable vectors, which can potentially prevent larger NEON patterns from matching.

paulwalker-arm requested review from david-arm and huntergr-arm March 10, 2026 13:44

llvmbot added the backend:AArch64 label Mar 10, 2026

david-arm reviewed Mar 10, 2026

View reviewed changes

huntergr-arm approved these changes Mar 10, 2026

View reviewed changes

david-arm approved these changes Mar 11, 2026

View reviewed changes

paulwalker-arm merged commit 9ba92ff into llvm:main Mar 11, 2026
12 checks passed

paulwalker-arm deleted the sve-neon-sized-constants branch March 11, 2026 11:32

MacDue reviewed Mar 11, 2026

View reviewed changes

MacDue added a commit to MacDue/llvm-project that referenced this pull request Mar 11, 2026

Revert "[LLVM][CodeGen][SVE] Refactor isel of 128-bit constant splats. (

9940c6c

llvm#185652)" This reverts commit 9ba92ff.

MacDue added a commit to MacDue/llvm-project that referenced this pull request Mar 11, 2026

Reapply "[LLVM][CodeGen][SVE] Refactor isel of 128-bit constant splat…

f7108c2

…s. (llvm#185652)" This reverts commit 9940c6c.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLVM][CodeGen][SVE] Refactor isel of 128-bit constant splats.#185652

[LLVM][CodeGen][SVE] Refactor isel of 128-bit constant splats.#185652
paulwalker-arm merged 1 commit intollvm:mainfrom
paulwalker-arm:sve-neon-sized-constants

paulwalker-arm commented Mar 10, 2026

Uh oh!

llvmbot commented Mar 10, 2026

Uh oh!

huntergr-arm commented Mar 10, 2026

Uh oh!

paulwalker-arm commented Mar 10, 2026

Uh oh!

david-arm Mar 10, 2026

Uh oh!

paulwalker-arm Mar 10, 2026

Uh oh!

david-arm left a comment

Uh oh!

Uh oh!

MacDue left a comment

Uh oh!

paulwalker-arm commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

paulwalker-arm commented Mar 10, 2026

Uh oh!

llvmbot commented Mar 10, 2026

Uh oh!

huntergr-arm commented Mar 10, 2026

Uh oh!

paulwalker-arm commented Mar 10, 2026

Uh oh!

david-arm Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

paulwalker-arm Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

david-arm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

MacDue left a comment

Choose a reason for hiding this comment

Uh oh!

paulwalker-arm commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants