[AMDGPU] Fix wrong operand value when floating-point value is used as operand of type i16 #84106

shiltian · 2024-03-06T02:56:41Z

Based on the section "OPF_INV2PI_16" of the spec, when a floating-point value is
used as operand of type i16, the 32-bit representation of the constant
truncated to the 16 LSBs should be used. Currently we directly use the FP16
representation, which doesn't conform with the spec.

For example, when 0.5 is used, for now we take it as 0x3800 because that is
the encoding of <half 0.5>. Instead, it should be 0x3f000000 truncated to 16
LSB, which is 0x0000.

github-actions · 2024-03-06T03:04:35Z

⚠️ C/C++ code formatter, clang-format found issues in your code. ⚠️

You can test this locally with the following command:

git-clang-format --diff 3f7aa042b657671319f994ad3fb7c3eb79a6fe00 57a134c493f8cb2afe1e31a7a4fa4270463706f5 -- llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

View the diff from clang-format here.

diff --git a/llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp b/llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
index 5050aec261..fd8ce9e07f 100644
--- a/llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
+++ b/llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp
@@ -2065,8 +2065,8 @@ bool AMDGPUOperand::isInlinableImm(MVT type) const {
         break;
       }
       return isInlineableLiteralOp16(
-          static_cast<uint16_t>(FPLiteral.bitcastToAPInt().getZExtValue()), type,
-          AsmParser->hasInv2PiInlineImm());
+          static_cast<uint16_t>(FPLiteral.bitcastToAPInt().getZExtValue()),
+          type, AsmParser->hasInv2PiInlineImm());
     }
 
     // Check if single precision literal is inlinable

… operand of type i16 Based on the section "OPF_INV2PI_16" of the spec, when a floating-point value is used as operand of type `i16`, the 32-bit representation of the constant truncated to the 16 LSBs should be used. Currently we directly use the FP16 representation, which doesn't conform with the spec. For example, when `0.5` is used, for now we take it as `0x3800` because that is the encoding of `<half 0.5>`. Instead, it should be `0x3f000000` truncated to 16 LSB, which is `0x0000`.

shiltian · 2024-03-06T05:11:56Z

Is there any script to update those tests? It is quite a nightmare to update them manually.

arsenm · 2024-03-06T09:30:17Z

llvm/lib/Target/AMDGPU/AsmParser/AMDGPUAsmParser.cpp

@@ -2045,9 +2047,26 @@ bool AMDGPUOperand::isInlinableImm(MVT type) const {
      return false;

    if (type.getScalarSizeInBits() == 16) {


This is redundant with the switch over the handled 16-bit types

jayfoad · 2024-03-06T09:37:01Z

llvm/test/MC/AMDGPU/gfx10_asm_vop1.s

 v_cvt_f16_u16_e32 v5, 0.5
-// GFX10: encoding: [0xff,0xa0,0x0a,0x7e,0x00,0x38,0x00,0x00]
+// GFX10: encoding: [0x80,0xa0,0x0a,0x7e]

 v_cvt_f16_u16_e32 v5, -4.0
-// GFX10: encoding: [0xff,0xa0,0x0a,0x7e,0x00,0xc4,0x00,0x00]
+// GFX10: encoding: [0x80,0xa0,0x0a,0x7e]


Note these two instructions now assemble to identical binary since the low 16 bits of f32 0.5 and f32 -4.0 are identical. Can you add tests with a more interesting literal, like inv2pi, which has non-0 low 16 bits?

shiltian · 2024-03-06T15:05:03Z

This patch will be merged into 530f0e6 when it is relanded.

shiltian requested review from rampitec and arsenm March 6, 2024 02:57

shiltian force-pushed the fp-i16 branch from a2eb517 to 719b812 Compare March 6, 2024 03:02

shiltian force-pushed the fp-i16 branch 3 times, most recently from 70996de to e48c168 Compare March 6, 2024 05:00

shiltian force-pushed the fp-i16 branch from e48c168 to 57a134c Compare March 6, 2024 05:11

shiltian requested a review from jayfoad March 6, 2024 05:11

arsenm reviewed Mar 6, 2024

View reviewed changes

jayfoad reviewed Mar 6, 2024

View reviewed changes

shiltian closed this Mar 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Fix wrong operand value when floating-point value is used as operand of type i16 #84106

[AMDGPU] Fix wrong operand value when floating-point value is used as operand of type i16 #84106

shiltian commented Mar 6, 2024

github-actions bot commented Mar 6, 2024 •

edited

shiltian commented Mar 6, 2024 •

edited

arsenm Mar 6, 2024

jayfoad Mar 6, 2024

shiltian commented Mar 6, 2024

		@@ -2045,9 +2047,26 @@ bool AMDGPUOperand::isInlinableImm(MVT type) const {
		return false;

		if (type.getScalarSizeInBits() == 16) {

[AMDGPU] Fix wrong operand value when floating-point value is used as operand of type i16 #84106

[AMDGPU] Fix wrong operand value when floating-point value is used as operand of type i16 #84106

Conversation

shiltian commented Mar 6, 2024

github-actions bot commented Mar 6, 2024 • edited

shiltian commented Mar 6, 2024 • edited

arsenm Mar 6, 2024

Choose a reason for hiding this comment

jayfoad Mar 6, 2024

Choose a reason for hiding this comment

shiltian commented Mar 6, 2024

github-actions bot commented Mar 6, 2024 •

edited

shiltian commented Mar 6, 2024 •

edited