[AMDGPU] Add tests for vector rebroadcast. #91322

PeddleSpam · 2024-05-07T12:43:09Z

No description provided.

llvmbot · 2024-05-07T13:21:13Z

@llvm/pr-subscribers-backend-amdgpu

Author: Leon Clark (PeddleSpam)

Changes

Full diff: https://github.com/llvm/llvm-project/pull/91322.diff

1 Files Affected:

(added) llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll (+39)

diff --git a/llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll b/llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll
new file mode 100644
index 0000000000000..50c5dadfcbb15
--- /dev/null
+++ b/llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll
@@ -0,0 +1,39 @@
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx900 -verify-machineinstrs < %s | FileCheck -check-prefix=GFX9 %s
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1010 -verify-machineinstrs < %s | FileCheck -check-prefix=GFX10 %s
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx1100 -verify-machineinstrs < %s | FileCheck -check-prefix=GFX11 %s
+
+define <4 x float> @rebroadcast_v4f32(ptr addrspace(1) %arg0) {
+; GFX9-LABEL: rebroadcast_v4f32:
+; GFX9:       ; %bb.0: ; %entry
+; GFX9-NEXT:  s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
+; GFX9-NEXT:  global_load_dwordx4 v[0:3], v[0:1], off
+; GFX9-NEXT:  s_waitcnt vmcnt(0)
+; GFX9-NEXT:  v_mov_b32_e32 v0, v1
+; GFX9-NEXT:  v_mov_b32_e32 v2, v1
+; GFX9-NEXT:  v_mov_b32_e32 v3, v1
+; GFX9-NEXT:  s_setpc_b64 s[30:31]
+;
+; GFX10-LABEL: rebroadcast_v4f32:
+; GFX10:       ; %bb.0: ; %entry
+; GFX10-NEXT:  s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
+; GFX10-NEXT:  global_load_dwordx4 v[0:3], v[0:1], off
+; GFX10-NEXT:  s_waitcnt vmcnt(0)
+; GFX10-NEXT:  v_mov_b32_e32 v0, v1
+; GFX10-NEXT:  v_mov_b32_e32 v2, v1
+; GFX10-NEXT:  v_mov_b32_e32 v3, v1
+; GFX10-NEXT:  s_setpc_b64 s[30:31]
+;
+; GFX11-LABEL: rebroadcast_v4f32:
+; GFX11:       ; %bb.0: ; %entry
+; GFX11-NEXT:  s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
+; GFX11-NEXT:  global_load_b128 v[0:3], v[0:1], off
+; GFX11-NEXT:  s_waitcnt vmcnt(0)
+; GFX11-NEXT:  v_mov_b32_e32 v0, v1
+; GFX11-NEXT:  v_mov_b32_e32 v2, v1
+; GFX11-NEXT:  v_mov_b32_e32 v3, v1
+; GFX11-NEXT:  s_setpc_b64 s[30:31]
+entry:
+  %val0 = load <4 x float>, ptr addrspace(1) %arg0
+  %val1 = shufflevector <4 x float> %val0, <4 x float> undef, <4 x i32> <i32 1, i32 1, i32 1, i32 1>
+  ret <4 x float> %val1
+}

arsenm

Can this merge in with another test? Should it test more vector sizes? Probably should link to follow up patch context

llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll

PeddleSpam · 2024-05-10T19:09:21Z

Can this merge in with another test? Should it test more vector sizes? Probably should link to follow up patch context

I've added tests for more vector types/sizes. It's a lot to merge with another file but I can if you'd prefer.

llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll

PeddleSpam requested review from jayfoad, arsenm and bcahoon May 7, 2024 12:43

PeddleSpam marked this pull request as ready for review May 7, 2024 13:20

llvmbot added the backend:AMDGPU label May 7, 2024

arsenm reviewed May 7, 2024

View reviewed changes

llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll Outdated Show resolved Hide resolved

Leon Clark added 2 commits May 10, 2024 17:11

[AMDGPU] Add tests for vector rebroadcast.

03dc079

Address review comments.

95079eb

PeddleSpam force-pushed the shuffle_splat branch from 3e36490 to 95079eb Compare May 10, 2024 17:47

arsenm reviewed May 13, 2024

View reviewed changes

llvm/test/CodeGen/AMDGPU/vector_rebroadcast.ll Outdated Show resolved Hide resolved

Address review comments.

ceeab15

arsenm approved these changes May 13, 2024

View reviewed changes

PeddleSpam merged commit bd67986 into llvm:main May 13, 2024
4 checks passed

PeddleSpam deleted the shuffle_splat branch May 13, 2024 18:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Add tests for vector rebroadcast. #91322

[AMDGPU] Add tests for vector rebroadcast. #91322

PeddleSpam commented May 7, 2024

llvmbot commented May 7, 2024

arsenm left a comment

PeddleSpam commented May 10, 2024

[AMDGPU] Add tests for vector rebroadcast. #91322

[AMDGPU] Add tests for vector rebroadcast. #91322

Conversation

PeddleSpam commented May 7, 2024

llvmbot commented May 7, 2024

arsenm left a comment

Choose a reason for hiding this comment

PeddleSpam commented May 10, 2024