[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used #73604

KanRobert · 2023-11-28T01:57:39Z

b/c VEX prefix can not encode R16-R31.

llvmbot · 2023-11-28T01:58:08Z

@llvm/pr-subscribers-backend-x86

Author: Shengchen Kan (KanRobert)

Changes

b/c VEX prefix can not encode R16-R31.

Full diff: https://github.com/llvm/llvm-project/pull/73604.diff

2 Files Affected:

(modified) llvm/lib/Target/X86/X86EvexToVex.cpp (+4)
(added) llvm/test/CodeGen/X86/apx/evex-to-vex.ll (+34)

diff --git a/llvm/lib/Target/X86/X86EvexToVex.cpp b/llvm/lib/Target/X86/X86EvexToVex.cpp
index 88366558562c51e..fda6c15fed34db7 100644
--- a/llvm/lib/Target/X86/X86EvexToVex.cpp
+++ b/llvm/lib/Target/X86/X86EvexToVex.cpp
@@ -125,6 +125,10 @@ static bool usesExtendedRegister(const MachineInstr &MI) {
     if (Reg >= X86::YMM16 && Reg <= X86::YMM31)
       return true;
 
+    // Check for GPR with indexes between 16 - 31.
+    if (X86II::isApxExtendedReg(Reg))
+      return true;
+
     return false;
   };
 
diff --git a/llvm/test/CodeGen/X86/apx/evex-to-vex.ll b/llvm/test/CodeGen/X86/apx/evex-to-vex.ll
new file mode 100644
index 000000000000000..51a547c9a3a5131
--- /dev/null
+++ b/llvm/test/CodeGen/X86/apx/evex-to-vex.ll
@@ -0,0 +1,34 @@
+; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
+; Check EVEX is not compressed into VEX when egpr is used.
+; RUN: llc < %s -mtriple=x86_64-unknown-unknown -mattr=+avx512vl,+egpr -show-mc-encoding | FileCheck %s
+
+define void @test_x86_vcvtps2ph_256_m(ptr nocapture %d, <8 x float> %a) nounwind {
+; CHECK-LABEL: test_x86_vcvtps2ph_256_m:
+; CHECK:       # %bb.0: # %entry
+; CHECK-NEXT:    pushq %rbp # encoding: [0x55]
+; CHECK-NEXT:    pushq %r15 # encoding: [0x41,0x57]
+; CHECK-NEXT:    pushq %r14 # encoding: [0x41,0x56]
+; CHECK-NEXT:    pushq %r13 # encoding: [0x41,0x55]
+; CHECK-NEXT:    pushq %r12 # encoding: [0x41,0x54]
+; CHECK-NEXT:    pushq %rbx # encoding: [0x53]
+; CHECK-NEXT:    movq %rdi, %r16 # encoding: [0xd5,0x18,0x89,0xf8]
+; CHECK-NEXT:    #APP
+; CHECK-NEXT:    #NO_APP
+; CHECK-NEXT:    vcvtps2ph $3, %ymm0, (%r16) # encoding: [0x62,0xfb,0x7d,0x28,0x1d,0x00,0x03]
+; CHECK-NEXT:    popq %rbx # encoding: [0x5b]
+; CHECK-NEXT:    popq %r12 # encoding: [0x41,0x5c]
+; CHECK-NEXT:    popq %r13 # encoding: [0x41,0x5d]
+; CHECK-NEXT:    popq %r14 # encoding: [0x41,0x5e]
+; CHECK-NEXT:    popq %r15 # encoding: [0x41,0x5f]
+; CHECK-NEXT:    popq %rbp # encoding: [0x5d]
+; CHECK-NEXT:    vzeroupper # encoding: [0xc5,0xf8,0x77]
+; CHECK-NEXT:    retq # encoding: [0xc3]
+entry:
+  %0 = load i32, ptr %d, align 4
+  call void asm sideeffect "", "~{eax},~{ebx},~{ecx},~{edx},~{edi},~{esi},~{ebp},~{esp},~{r8d},~{r9d},~{r10d},~{r11d},~{r12d},~{r13d},~{r14d},~{r15d},~{dirflag},~{fpsr},~{flags}"()
+  %1 = tail call <8 x i16> @llvm.x86.vcvtps2ph.256(<8 x float> %a, i32 3)
+  store <8 x i16> %1, ptr %d, align 16
+  ret void
+}
+
+declare <8 x i16> @llvm.x86.vcvtps2ph.256(<8 x float>, i32) nounwind readonly

XinWang10

LGTM

KanRobert · 2023-11-28T03:40:22Z

Thanks @XinWang10 !

[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used

44dc80a

b/c VEX prefix can not encode R16-R31.

llvmbot added the backend:X86 label Nov 28, 2023

KanRobert requested review from RKSimon, phoebewang, e-kud, XinWang10 and topperc November 28, 2023 01:58

XinWang10 approved these changes Nov 28, 2023

View reviewed changes

simplify the test

f5b8133

KanRobert merged commit a3b7b2d into llvm:main Nov 28, 2023
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used #73604

[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used #73604

KanRobert commented Nov 28, 2023

llvmbot commented Nov 28, 2023

XinWang10 left a comment

KanRobert commented Nov 28, 2023

[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used #73604

[X86][CodeGen] Not compress EVEX into VEX when R16-R31 is used #73604

Conversation

KanRobert commented Nov 28, 2023

llvmbot commented Nov 28, 2023

XinWang10 left a comment

Choose a reason for hiding this comment

KanRobert commented Nov 28, 2023