[x86] improve cost model for oversized shuffles #55170

rotateright · 2022-04-28T14:30:20Z

I was trying some examples with https://reviews.llvm.org/D123494 and noticed that AArch64 seems smarter about decomposing shuffle costs via mask:

define void @cross_talk(<8 x i32> %a, <8 x i32> %b) {
  %s = shufflevector <8 x i32> %a, <8 x i32> %b, <8 x i32> <i32 8, i32 0, i32 1, i32 2, i32 3, i32 8, i32 8, i32 8>
  ret void
}

If we don't care about element order, that can be turned into the much simpler (especially for a 128-bit vector target):

define void @identity_and_splat(<8 x i32> %a, <8 x i32> %b) {
  %s = shufflevector <8 x i32> %a, <8 x i32> %b, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 8, i32 8, i32 8, i32 8>
  ret void
}

That transform happens with AArch64, but that doesn't happen with x86 because:

% opt -mtriple=x86_64 -passes="print<cost-model>" -disable-output shufcost.ll 
Printing analysis 'Cost Model Analysis' for function 'cross_talk':
Cost Model: Found an estimated cost of 12 for instruction:   %s = shufflevector <8 x i32> %a, <8 x i32> %b, <8 x i32> <i32 8, i32 0, i32 1, i32 2, i32 3, i32 8, i32 8, i32 8>

Printing analysis 'Cost Model Analysis' for function 'identity_and_splat':
Cost Model: Found an estimated cost of 12 for instruction:   %s = shufflevector <8 x i32> %a, <8 x i32> %b, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 8, i32 8, i32 8, i32 8>

llvmbot · 2022-04-28T14:31:04Z

@llvm/issue-subscribers-backend-x86

fhahn · 2022-04-28T14:34:46Z

Looks like GitHub-actions removes any user provided labels when it adds the new-issue label :(

llvmbot · 2022-04-28T14:35:24Z

@llvm/issue-subscribers-backend-x86

rotateright · 2022-04-28T16:13:53Z

https://reviews.llvm.org/D100486 seems like it would help. It was committed, but it is currently reverted.

rotateright added the backend:X86 label Apr 28, 2022

github-actions bot added new issue and removed backend:X86 labels Apr 28, 2022

fhahn added backend:X86 and removed new issue labels Apr 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[x86] improve cost model for oversized shuffles #55170

[x86] improve cost model for oversized shuffles #55170

rotateright commented Apr 28, 2022 •

edited by VoltrexKeyva

llvmbot commented Apr 28, 2022

fhahn commented Apr 28, 2022

llvmbot commented Apr 28, 2022

rotateright commented Apr 28, 2022

[x86] improve cost model for oversized shuffles #55170

[x86] improve cost model for oversized shuffles #55170

Comments

rotateright commented Apr 28, 2022 • edited by VoltrexKeyva

llvmbot commented Apr 28, 2022

fhahn commented Apr 28, 2022

llvmbot commented Apr 28, 2022

rotateright commented Apr 28, 2022

rotateright commented Apr 28, 2022 •

edited by VoltrexKeyva