Poor code generated for whole-vector shifts

|  |  |
| --- | --- |
| Bugzilla Link | [10037](https://llvm.org/bz10037) |
| Resolution | WONTFIX |
| Resolved on | May 31, 2011 11:09 |
| Version | 2.9 |
| OS | Linux |
| Reporter | LLVM Bugzilla Contributor |
| CC | @sunfishcode |

## Extended Description 
Consider this:

define <4 x i32> @&#8203;lshr(<4 x i32> %x) {
  %a = bitcast <4 x i32> %x to i128
  %b = lshr i128 %a, 32
  %c = bitcast i128 %b to <4 x i32>
  ret <4 x i32> %c
}

it codegens to:

        movd    %xmm0, %rcx
        pextrq  $1, %xmm0, %rax
        shrdq   $32, %rax, %rcx
        movd    %rcx, %xmm0
        shrq    $32, %rax
        movd    %rax, %xmm1
        punpcklqdq      %xmm1, %xmm0    # xmm0 = xmm0[0],xmm1[0]

however it could codegen to a simple
        psrldq
since the shift is by a multiple of 8.

The best way to solve this might be to turn this construct into a shufflevector
at the IR level.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Poor code generated for whole-vector shifts #10409

Extended Description

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development


Bugzilla Link	10037
Resolution	WONTFIX
Resolved on	May 31, 2011 11:09
Version	2.9
OS	Linux
Reporter	LLVM Bugzilla Contributor
CC	@sunfishcode

Poor code generated for whole-vector shifts #10409

Description

Extended Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions