[x86] Add tan intrinsic part 4 #90503

farzonl · 2024-04-29T17:51:49Z

This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics.
Which was discussed in this RFC:
https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294

If you want an overarching view of how this will all connect see:
#90088

Much of this change was following how G_FSIN and G_FCOS were used.

Changes:

llvm/docs/GlobalISel/GenericOpcode.rst - Document the G_FTAN opcode
llvm/docs/LangRef.rst - Document the tan intrinsic
llvm/include/llvm/Analysis/VecFuncs.def - Associate the tan intrinsic as a vector function similar to the tanf libcall.
llvm/include/llvm/CodeGen/BasicTTIImpl.h - Map the tan intrinsic to ISD::FTAN
llvm/include/llvm/CodeGen/ISDOpcodes.h - Define ISD opcodes for FTAN and STRICT_FTAN
llvm/include/llvm/IR/Intrinsics.td - Create the tan intrinsic
llvm/include/llvm/IR/RuntimeLibcalls.def - Define tan libcall mappings
llvm/include/llvm/Target/GenericOpcodes.td - Define the G_FTAN Opcode
llvm/include/llvm/Support/TargetOpcodes.def - Create a G_FTAN Opcode handler
llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td - Map G_FTAN to ftan
llvm/include/llvm/Target/TargetSelectionDAG.td - Define ftan, strict_ftan, and any_ftan and map them to the ISD opcodes for FTAN and STRICT_FTAN
llvm/lib/Analysis/VectorUtils.cpp - Associate the tan intrinsic as a vector intrinsic
llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp Map the tan intrinsic to G_FTAN Opcode
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp - Add G_FTAN to the list of floating point math operations also associate G_FTAN with the TAN_F runtime lib.
llvm/lib/CodeGen/GlobalISel/Utils.cpp - More floating point math operation common behaviors.
llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp - List the function expansion operations for FTAN and STRICT_FTAN. Also define both opcodes in PromoteNode.
llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp - More FTAN and STRICT_FTAN handling in the legalizer
llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h - Define SoftenFloatRes_FTAN and ExpandFloatRes_FTAN.
llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp - Define FTAN as a legal vector operation.
llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp - Define FTAN as a legal vector operation.
llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp - define tan as an intrinsic that doesn't return NaN.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp Map LibFunc_tan, LibFunc_tanf, and LibFunc_tanl to ISD::FTAN. Map Intrinsic::tan to ISD::FTAN and add selection dag handling for Intrinsic::tan.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp - Define ftan and strict_ftan names for the equivalent ISD opcodes.
llvm/lib/CodeGen/TargetLoweringBase.cpp -Define a Tan128 libcall and ISD::FTAN as a target lowering action.
llvm/lib/Target/X86/X86ISelLowering.cpp - Add x86_64 lowering for tan intrinsic

resolves #70082

llvmbot · 2024-04-29T18:02:32Z

@llvm/pr-subscribers-llvm-transforms
@llvm/pr-subscribers-llvm-support
@llvm/pr-subscribers-llvm-ir
@llvm/pr-subscribers-backend-x86
@llvm/pr-subscribers-llvm-selectiondag
@llvm/pr-subscribers-llvm-globalisel

@llvm/pr-subscribers-llvm-analysis

Author: Farzon Lotfi (farzonl)

Changes

This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics.
Which was discussed in this RFC:
https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294

If you want an overarching view of how this will all connect see:
#90088

Much of this change was following how G_FSIN and G_FCOS were used.

The X86 Lowering and test cases of this pr should be credited to
https://github.com/junaire
https://reviews.llvm.org/D146905

Changes:

llvm/docs/GlobalISel/GenericOpcode.rst - Document the G_FTAN opcode
llvm/docs/LangRef.rst - Document the tan intrinsic
llvm/include/llvm/Analysis/VecFuncs.def - Associate the tan intrinsic as a vector function similar to the tanf libcall.
llvm/include/llvm/CodeGen/BasicTTIImpl.h - Map the tan intrinsic to ISD::FTAN
llvm/include/llvm/CodeGen/ISDOpcodes.h - Define ISD opcodes for FTAN and STRICT_FTAN
llvm/include/llvm/IR/Intrinsics.td - Create the tan intrinsic
llvm/include/llvm/IR/RuntimeLibcalls.def - Define tan libcall mappings
llvm/include/llvm/Target/GenericOpcodes.td - Define the G_FTAN Opcode
llvm/include/llvm/Support/TargetOpcodes.def - Create a G_FTAN Opcode handler
llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td - Map G_FTAN to ftan
llvm/include/llvm/Target/TargetSelectionDAG.td - Define ftan, strict_ftan, and any_ftan and map them to the ISD opcodes for FTAN and STRICT_FTAN
llvm/lib/Analysis/VectorUtils.cpp - Associate the tan intrinsic as a vector intrinsic
llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp Map the tan intrinsic to G_FTAN Opcode
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp - Add G_FTAN to the list of floating point math operations also associate G_FTAN with the TAN_F runtime lib.
llvm/lib/CodeGen/GlobalISel/Utils.cpp - More floating point math operation common behaviors.
llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp - List the function expansion operations for FTAN and STRICT_FTAN. Also define both opcodes in PromoteNode.
llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp - More FTAN and STRICT_FTAN handling in the legalizer
llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h - Define SoftenFloatRes_FTAN and ExpandFloatRes_FTAN.
llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp - Define FTAN as a legal vector operation.
llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp - Define FTAN as a legal vector operation.
llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp - define tan as an intrinsic that doesn't return NaN.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp Map LibFunc_tan, LibFunc_tanf, and LibFunc_tanl to ISD::FTAN. Map Intrinsic::tan to ISD::FTAN and add selection dag handling for Intrinsic::tan.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp - Define ftan and strict_ftan names for the equivalent ISD opcodes.
llvm/lib/CodeGen/TargetLoweringBase.cpp -Define a Tan128 libcall and ISD::FTAN as a target lowering action.
llvm/lib/Target/X86/X86ISelLowering.cpp - Add x86_64 lowering for tan intrinsic

Patch is 40.09 KiB, truncated to 20.00 KiB below, full version: https://github.com/llvm/llvm-project/pull/90503.diff

28 Files Affected:

(modified) llvm/docs/GlobalISel/GenericOpcode.rst (+2-2)
(modified) llvm/docs/LangRef.rst (+37)
(modified) llvm/include/llvm/Analysis/VecFuncs.def (+1)
(modified) llvm/include/llvm/CodeGen/BasicTTIImpl.h (+3)
(modified) llvm/include/llvm/CodeGen/ISDOpcodes.h (+2)
(modified) llvm/include/llvm/IR/Intrinsics.td (+1)
(modified) llvm/include/llvm/IR/RuntimeLibcalls.def (+5)
(modified) llvm/include/llvm/Support/TargetOpcodes.def (+3)
(modified) llvm/include/llvm/Target/GenericOpcodes.td (+7)
(modified) llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td (+1)
(modified) llvm/include/llvm/Target/TargetSelectionDAG.td (+6)
(modified) llvm/lib/Analysis/VectorUtils.cpp (+1)
(modified) llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp (+2)
(modified) llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp (+5)
(modified) llvm/lib/CodeGen/GlobalISel/Utils.cpp (+2)
(modified) llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp (+7)
(modified) llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp (+23)
(modified) llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h (+2)
(modified) llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp (+1)
(modified) llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp (+3)
(modified) llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp (+1)
(modified) llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp (+10)
(modified) llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp (+2)
(modified) llvm/lib/CodeGen/TargetLoweringBase.cpp (+3-1)
(modified) llvm/lib/Target/X86/X86ISelLowering.cpp (+14-9)
(modified) llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir (+3)
(added) llvm/test/CodeGen/X86/llvm.tan.ll (+60)
(modified) llvm/test/CodeGen/X86/vec-libcalls.ll (+202)

diff --git a/llvm/docs/GlobalISel/GenericOpcode.rst b/llvm/docs/GlobalISel/GenericOpcode.rst
index 492d30280f4776..52dc039df7779e 100644
--- a/llvm/docs/GlobalISel/GenericOpcode.rst
+++ b/llvm/docs/GlobalISel/GenericOpcode.rst
@@ -592,8 +592,8 @@ G_FLOG, G_FLOG2, G_FLOG10
 
 Calculate the base-e, base-2, or base-10 respectively.
 
-G_FCEIL, G_FCOS, G_FSIN, G_FSQRT, G_FFLOOR, G_FRINT, G_FNEARBYINT
-^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+G_FCEIL, G_FCOS, G_FSIN, G_FTAN, G_FSQRT, G_FFLOOR, G_FRINT, G_FNEARBYINT
+^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
 
 These correspond to the standard C functions of the same name.
 
diff --git a/llvm/docs/LangRef.rst b/llvm/docs/LangRef.rst
index 37662f79145d67..ece85481814ede 100644
--- a/llvm/docs/LangRef.rst
+++ b/llvm/docs/LangRef.rst
@@ -15270,6 +15270,43 @@ trapping or setting ``errno``.
 When specified with the fast-math-flag 'afn', the result may be approximated
 using a less accurate calculation.
 
+'``llvm.tan.*``' Intrinsic
+^^^^^^^^^^^^^^^^^^^^^^^^^^
+
+Syntax:
+"""""""
+
+This is an overloaded intrinsic. You can use ``llvm.tan`` on any
+floating-point or vector of floating-point type. Not all targets support
+all types however.
+
+::
+
+      declare float     @llvm.tan.f32(float  %Val)
+      declare double    @llvm.tan.f64(double %Val)
+      declare x86_fp80  @llvm.tan.f80(x86_fp80  %Val)
+      declare fp128     @llvm.tan.f128(fp128 %Val)
+      declare ppc_fp128 @llvm.tan.ppcf128(ppc_fp128  %Val)
+
+Overview:
+"""""""""
+
+The '``llvm.tan.*``' intrinsics return the tangent of the operand.
+
+Arguments:
+""""""""""
+
+The argument and return value are floating-point numbers of the same type.
+
+Semantics:
+""""""""""
+
+Return the same value as a corresponding libm '``tan``' function but without
+trapping or setting ``errno``.
+
+When specified with the fast-math-flag 'afn', the result may be approximated
+using a less accurate calculation.
+
 '``llvm.pow.*``' Intrinsic
 ^^^^^^^^^^^^^^^^^^^^^^^^^^
 
diff --git a/llvm/include/llvm/Analysis/VecFuncs.def b/llvm/include/llvm/Analysis/VecFuncs.def
index 10f1333cf8885c..de94621791f54f 100644
--- a/llvm/include/llvm/Analysis/VecFuncs.def
+++ b/llvm/include/llvm/Analysis/VecFuncs.def
@@ -54,6 +54,7 @@ TLI_DEFINE_VECFUNC("llvm.sin.f32", "vsinf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("cosf", "vcosf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("llvm.cos.f32", "vcosf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("tanf", "vtanf", FIXED(4), "_ZGV_LLVM_N4v")
+TLI_DEFINE_VECFUNC("llvm.tan.f32", "vtanf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("asinf", "vasinf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("acosf", "vacosf", FIXED(4), "_ZGV_LLVM_N4v")
 TLI_DEFINE_VECFUNC("atanf", "vatanf", FIXED(4), "_ZGV_LLVM_N4v")
diff --git a/llvm/include/llvm/CodeGen/BasicTTIImpl.h b/llvm/include/llvm/CodeGen/BasicTTIImpl.h
index 92b51438b4cb3c..2d950ceb57e10a 100644
--- a/llvm/include/llvm/CodeGen/BasicTTIImpl.h
+++ b/llvm/include/llvm/CodeGen/BasicTTIImpl.h
@@ -1924,6 +1924,9 @@ class BasicTTIImplBase : public TargetTransformInfoImplCRTPBase<T> {
     case Intrinsic::cos:
       ISD = ISD::FCOS;
       break;
+    case Intrinsic::tan:
+      ISD = ISD::FTAN;
+      break;
     case Intrinsic::exp:
       ISD = ISD::FEXP;
       break;
diff --git a/llvm/include/llvm/CodeGen/ISDOpcodes.h b/llvm/include/llvm/CodeGen/ISDOpcodes.h
index 6429947958ee91..c38c83f5b80dfa 100644
--- a/llvm/include/llvm/CodeGen/ISDOpcodes.h
+++ b/llvm/include/llvm/CodeGen/ISDOpcodes.h
@@ -415,6 +415,7 @@ enum NodeType {
   STRICT_FLDEXP,
   STRICT_FSIN,
   STRICT_FCOS,
+  STRICT_FTAN,
   STRICT_FEXP,
   STRICT_FEXP2,
   STRICT_FLOG,
@@ -934,6 +935,7 @@ enum NodeType {
   FCBRT,
   FSIN,
   FCOS,
+  FTAN,
   FPOW,
   FPOWI,
   /// FLDEXP - ldexp, inspired by libm (op0 * 2**op1).
diff --git a/llvm/include/llvm/IR/Intrinsics.td b/llvm/include/llvm/IR/Intrinsics.td
index a2678d69ce4062..c25591facbd3fa 100644
--- a/llvm/include/llvm/IR/Intrinsics.td
+++ b/llvm/include/llvm/IR/Intrinsics.td
@@ -1025,6 +1025,7 @@ let IntrProperties = [IntrNoMem, IntrSpeculatable, IntrWillReturn] in {
   def int_powi : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>, llvm_anyint_ty]>;
   def int_sin  : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
   def int_cos  : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
+  def int_tan  : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
   def int_pow  : DefaultAttrsIntrinsic<[llvm_anyfloat_ty],
                            [LLVMMatchType<0>, LLVMMatchType<0>]>;
   def int_log  : DefaultAttrsIntrinsic<[llvm_anyfloat_ty], [LLVMMatchType<0>]>;
diff --git a/llvm/include/llvm/IR/RuntimeLibcalls.def b/llvm/include/llvm/IR/RuntimeLibcalls.def
index 5e082769fa974c..f4f93bf7ba24c5 100644
--- a/llvm/include/llvm/IR/RuntimeLibcalls.def
+++ b/llvm/include/llvm/IR/RuntimeLibcalls.def
@@ -197,6 +197,11 @@ HANDLE_LIBCALL(COS_F64, "cos")
 HANDLE_LIBCALL(COS_F80, "cosl")
 HANDLE_LIBCALL(COS_F128, "cosl")
 HANDLE_LIBCALL(COS_PPCF128, "cosl")
+HANDLE_LIBCALL(TAN_F32, "tanf")
+HANDLE_LIBCALL(TAN_F64, "tan")
+HANDLE_LIBCALL(TAN_F80, "tanl")
+HANDLE_LIBCALL(TAN_F128,"tanl")
+HANDLE_LIBCALL(TAN_PPCF128, "tanl")
 HANDLE_LIBCALL(SINCOS_F32, nullptr)
 HANDLE_LIBCALL(SINCOS_F64, nullptr)
 HANDLE_LIBCALL(SINCOS_F80, nullptr)
diff --git a/llvm/include/llvm/Support/TargetOpcodes.def b/llvm/include/llvm/Support/TargetOpcodes.def
index cb98f96af522f7..559a588c251482 100644
--- a/llvm/include/llvm/Support/TargetOpcodes.def
+++ b/llvm/include/llvm/Support/TargetOpcodes.def
@@ -781,6 +781,9 @@ HANDLE_TARGET_OPCODE(G_FCOS)
 /// Floating point sine.
 HANDLE_TARGET_OPCODE(G_FSIN)
 
+/// Floating point Tangent.
+HANDLE_TARGET_OPCODE(G_FTAN)
+
 /// Floating point square root.
 HANDLE_TARGET_OPCODE(G_FSQRT)
 
diff --git a/llvm/include/llvm/Target/GenericOpcodes.td b/llvm/include/llvm/Target/GenericOpcodes.td
index 8380d2738d164b..c40498e5542154 100644
--- a/llvm/include/llvm/Target/GenericOpcodes.td
+++ b/llvm/include/llvm/Target/GenericOpcodes.td
@@ -988,6 +988,13 @@ def G_FSIN : GenericInstruction {
   let hasSideEffects = false;
 }
 
+// Floating point tangent of a value.
+def G_FTAN : GenericInstruction {
+  let OutOperandList = (outs type0:$dst);
+  let InOperandList = (ins type0:$src1);
+  let hasSideEffects = false;
+}
+
 // Floating point square root of a value.
 // This returns NaN for negative nonzero values.
 // NOTE: Unlike libm sqrt(), this never sets errno. In all other respects it's
diff --git a/llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td b/llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td
index 8fa0e4b86d6dc9..560d3b434d07d5 100644
--- a/llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td
+++ b/llvm/include/llvm/Target/GlobalISel/SelectionDAGCompat.td
@@ -148,6 +148,7 @@ def : GINodeEquiv<G_BUILD_VECTOR, build_vector>;
 def : GINodeEquiv<G_FCEIL, fceil>;
 def : GINodeEquiv<G_FCOS, fcos>;
 def : GINodeEquiv<G_FSIN, fsin>;
+def : GINodeEquiv<G_FTAN, ftan>;
 def : GINodeEquiv<G_FABS, fabs>;
 def : GINodeEquiv<G_FSQRT, fsqrt>;
 def : GINodeEquiv<G_FFLOOR, ffloor>;
diff --git a/llvm/include/llvm/Target/TargetSelectionDAG.td b/llvm/include/llvm/Target/TargetSelectionDAG.td
index 1684b424e3b442..bf2413c09e4c9a 100644
--- a/llvm/include/llvm/Target/TargetSelectionDAG.td
+++ b/llvm/include/llvm/Target/TargetSelectionDAG.td
@@ -509,6 +509,7 @@ def fneg       : SDNode<"ISD::FNEG"       , SDTFPUnaryOp>;
 def fsqrt      : SDNode<"ISD::FSQRT"      , SDTFPUnaryOp>;
 def fsin       : SDNode<"ISD::FSIN"       , SDTFPUnaryOp>;
 def fcos       : SDNode<"ISD::FCOS"       , SDTFPUnaryOp>;
+def ftan       : SDNode<"ISD::FTAN"       , SDTFPUnaryOp>;
 def fexp2      : SDNode<"ISD::FEXP2"      , SDTFPUnaryOp>;
 def fexp10     : SDNode<"ISD::FEXP10"     , SDTFPUnaryOp>;
 def fpow       : SDNode<"ISD::FPOW"       , SDTFPBinOp>;
@@ -562,6 +563,8 @@ def strict_fsin       : SDNode<"ISD::STRICT_FSIN",
                                SDTFPUnaryOp, [SDNPHasChain]>;
 def strict_fcos       : SDNode<"ISD::STRICT_FCOS",
                                SDTFPUnaryOp, [SDNPHasChain]>;
+def strict_ftan       : SDNode<"ISD::STRICT_FTAN",
+                               SDTFPUnaryOp, [SDNPHasChain]>;
 def strict_fexp2      : SDNode<"ISD::STRICT_FEXP2",
                                SDTFPUnaryOp, [SDNPHasChain]>;
 def strict_fpow       : SDNode<"ISD::STRICT_FPOW",
@@ -1514,6 +1517,9 @@ def any_fsin       : PatFrags<(ops node:$src),
 def any_fcos       : PatFrags<(ops node:$src),
                               [(strict_fcos node:$src),
                                (fcos node:$src)]>;
+def any_ftan       : PatFrags<(ops node:$src),
+                              [(strict_ftan node:$src),
+                               (ftan node:$src)]>;
 def any_fexp2      : PatFrags<(ops node:$src),
                               [(strict_fexp2 node:$src),
                                (fexp2 node:$src)]>;
diff --git a/llvm/lib/Analysis/VectorUtils.cpp b/llvm/lib/Analysis/VectorUtils.cpp
index 917094267d05ae..30728ed5875090 100644
--- a/llvm/lib/Analysis/VectorUtils.cpp
+++ b/llvm/lib/Analysis/VectorUtils.cpp
@@ -68,6 +68,7 @@ bool llvm::isTriviallyVectorizable(Intrinsic::ID ID) {
   case Intrinsic::sqrt: // Begin floating-point.
   case Intrinsic::sin:
   case Intrinsic::cos:
+  case Intrinsic::tan:
   case Intrinsic::exp:
   case Intrinsic::exp2:
   case Intrinsic::log:
diff --git a/llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp b/llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
index e26c6ca3d61692..05c8d54f30c91c 100644
--- a/llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
+++ b/llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp
@@ -1945,6 +1945,8 @@ unsigned IRTranslator::getSimpleIntrinsicOpcode(Intrinsic::ID ID) {
       return TargetOpcode::G_FSIN;
     case Intrinsic::sqrt:
       return TargetOpcode::G_FSQRT;
+    case Intrinsic::tan:
+      return TargetOpcode::G_FTAN;
     case Intrinsic::trunc:
       return TargetOpcode::G_INTRINSIC_TRUNC;
     case Intrinsic::readcyclecounter:
diff --git a/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp b/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
index 6a76ad7f5db749..ca2fb24b5b85c4 100644
--- a/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
+++ b/llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
@@ -448,6 +448,8 @@ static RTLIB::Libcall getRTLibDesc(unsigned Opcode, unsigned Size) {
     RTLIBCASE(SIN_F);
   case TargetOpcode::G_FCOS:
     RTLIBCASE(COS_F);
+  case TargetOpcode::G_FTAN:
+    RTLIBCASE(TAN_F);
   case TargetOpcode::G_FLOG10:
     RTLIBCASE(LOG10_F);
   case TargetOpcode::G_FLOG:
@@ -1036,6 +1038,7 @@ LegalizerHelper::libcall(MachineInstr &MI, LostDebugLocObserver &LocObserver) {
   case TargetOpcode::G_FREM:
   case TargetOpcode::G_FCOS:
   case TargetOpcode::G_FSIN:
+  case TargetOpcode::G_FTAN:
   case TargetOpcode::G_FLOG10:
   case TargetOpcode::G_FLOG:
   case TargetOpcode::G_FLOG2:
@@ -2890,6 +2893,7 @@ LegalizerHelper::widenScalar(MachineInstr &MI, unsigned TypeIdx, LLT WideTy) {
   case TargetOpcode::G_FFLOOR:
   case TargetOpcode::G_FCOS:
   case TargetOpcode::G_FSIN:
+  case TargetOpcode::G_FTAN:
   case TargetOpcode::G_FLOG10:
   case TargetOpcode::G_FLOG:
   case TargetOpcode::G_FLOG2:
@@ -4656,6 +4660,7 @@ LegalizerHelper::fewerElementsVector(MachineInstr &MI, unsigned TypeIdx,
   case G_INTRINSIC_TRUNC:
   case G_FCOS:
   case G_FSIN:
+  case G_FTAN:
   case G_FSQRT:
   case G_BSWAP:
   case G_BITREVERSE:
diff --git a/llvm/lib/CodeGen/GlobalISel/Utils.cpp b/llvm/lib/CodeGen/GlobalISel/Utils.cpp
index 4e3781cb4e9d59..cac4039930c38e 100644
--- a/llvm/lib/CodeGen/GlobalISel/Utils.cpp
+++ b/llvm/lib/CodeGen/GlobalISel/Utils.cpp
@@ -821,6 +821,7 @@ bool llvm::isKnownNeverNaN(Register Val, const MachineRegisterInfo &MRI,
   case TargetOpcode::G_FREM:
   case TargetOpcode::G_FSIN:
   case TargetOpcode::G_FCOS:
+  case TargetOpcode::G_FTAN:
   case TargetOpcode::G_FMA:
   case TargetOpcode::G_FMAD:
     if (SNaN)
@@ -1701,6 +1702,7 @@ bool llvm::isPreISelGenericFloatingPointOpcode(unsigned Opc) {
   case TargetOpcode::G_FREM:
   case TargetOpcode::G_FRINT:
   case TargetOpcode::G_FSIN:
+  case TargetOpcode::G_FTAN:
   case TargetOpcode::G_FSQRT:
   case TargetOpcode::G_FSUB:
   case TargetOpcode::G_INTRINSIC_ROUND:
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
index 46e54b5366d66a..aa3150b58ac689 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeDAG.cpp
@@ -4429,6 +4429,11 @@ void SelectionDAGLegalize::ConvertNodeToLibcall(SDNode *Node) {
                     RTLIB::COS_F80, RTLIB::COS_F128,
                     RTLIB::COS_PPCF128, Results);
     break;
+  case ISD::FTAN:
+  case ISD::STRICT_FTAN:
+    ExpandFPLibCall(Node, RTLIB::TAN_F32, RTLIB::TAN_F64, RTLIB::TAN_F80,
+                    RTLIB::TAN_F128, RTLIB::TAN_PPCF128, Results);
+    break;
   case ISD::FSINCOS:
     // Expand into sincos libcall.
     ExpandSinCosLibCall(Node, Results);
@@ -5373,6 +5378,7 @@ void SelectionDAGLegalize::PromoteNode(SDNode *Node) {
   case ISD::FSQRT:
   case ISD::FSIN:
   case ISD::FCOS:
+  case ISD::FTAN:
   case ISD::FLOG:
   case ISD::FLOG2:
   case ISD::FLOG10:
@@ -5397,6 +5403,7 @@ void SelectionDAGLegalize::PromoteNode(SDNode *Node) {
   case ISD::STRICT_FSQRT:
   case ISD::STRICT_FSIN:
   case ISD::STRICT_FCOS:
+  case ISD::STRICT_FTAN:
   case ISD::STRICT_FLOG:
   case ISD::STRICT_FLOG2:
   case ISD::STRICT_FLOG10:
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
index abe5be76382556..2fa2bdafd46802 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
@@ -131,6 +131,8 @@ void DAGTypeLegalizer::SoftenFloatResult(SDNode *N, unsigned ResNo) {
     case ISD::FSQRT:       R = SoftenFloatRes_FSQRT(N); break;
     case ISD::STRICT_FSUB:
     case ISD::FSUB:        R = SoftenFloatRes_FSUB(N); break;
+    case ISD::STRICT_FTAN:
+    case ISD::FTAN:        R = SoftenFloatRes_FTAN(N); break;
     case ISD::STRICT_FTRUNC:
     case ISD::FTRUNC:      R = SoftenFloatRes_FTRUNC(N); break;
     case ISD::LOAD:        R = SoftenFloatRes_LOAD(N); break;
@@ -773,6 +775,12 @@ SDValue DAGTypeLegalizer::SoftenFloatRes_FSUB(SDNode *N) {
                                                RTLIB::SUB_PPCF128));
 }
 
+SDValue DAGTypeLegalizer::SoftenFloatRes_FTAN(SDNode *N) {
+  return SoftenFloatRes_Unary(
+      N, GetFPLibCall(N->getValueType(0), RTLIB::TAN_F32, RTLIB::TAN_F64,
+                      RTLIB::TAN_F80, RTLIB::TAN_F128, RTLIB::TAN_PPCF128));
+}
+
 SDValue DAGTypeLegalizer::SoftenFloatRes_FTRUNC(SDNode *N) {
   return SoftenFloatRes_Unary(N, GetFPLibCall(N->getValueType(0),
                                               RTLIB::TRUNC_F32,
@@ -1361,6 +1369,10 @@ void DAGTypeLegalizer::ExpandFloatResult(SDNode *N, unsigned ResNo) {
   case ISD::FSQRT:      ExpandFloatRes_FSQRT(N, Lo, Hi); break;
   case ISD::STRICT_FSUB:
   case ISD::FSUB:       ExpandFloatRes_FSUB(N, Lo, Hi); break;
+  case ISD::STRICT_FTAN:
+  case ISD::FTAN:
+    ExpandFloatRes_FTAN(N, Lo, Hi);
+    break;
   case ISD::STRICT_FTRUNC:
   case ISD::FTRUNC:     ExpandFloatRes_FTRUNC(N, Lo, Hi); break;
   case ISD::LOAD:       ExpandFloatRes_LOAD(N, Lo, Hi); break;
@@ -1730,6 +1742,15 @@ void DAGTypeLegalizer::ExpandFloatRes_FSUB(SDNode *N, SDValue &Lo,
                                         RTLIB::SUB_PPCF128), Lo, Hi);
 }
 
+void DAGTypeLegalizer::ExpandFloatRes_FTAN(SDNode *N, SDValue &Lo,
+                                           SDValue &Hi) {
+  ExpandFloatRes_Unary(N,
+                       GetFPLibCall(N->getValueType(0), RTLIB::TAN_F32,
+                                    RTLIB::TAN_F64, RTLIB::TAN_F80,
+                                    RTLIB::TAN_F128, RTLIB::TAN_PPCF128),
+                       Lo, Hi);
+}
+
 void DAGTypeLegalizer::ExpandFloatRes_FTRUNC(SDNode *N,
                                              SDValue &Lo, SDValue &Hi) {
   ExpandFloatRes_Unary(N, GetFPLibCall(N->getValueType(0),
@@ -2423,6 +2444,7 @@ void DAGTypeLegalizer::PromoteFloatResult(SDNode *N, unsigned ResNo) {
     case ISD::FSIN:
     case ISD::FSQRT:
     case ISD::FTRUNC:
+    case ISD::FTAN:
     case ISD::FCANONICALIZE: R = PromoteFloatRes_UnaryOp(N); break;
 
     // Binary FP Operations
@@ -2854,6 +2876,7 @@ void DAGTypeLegalizer::SoftPromoteHalfResult(SDNode *N, unsigned ResNo) {
   case ISD::FSIN:
   case ISD::FSQRT:
   case ISD::FTRUNC:
+  case ISD::FTAN:
   case ISD::FCANONICALIZE: R = SoftPromoteHalfRes_UnaryOp(N); break;
 
   // Binary FP Operations
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
index 4a2c7b355eb528..3d29e98524a07a 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
@@ -566,6 +566,7 @@ class LLVM_LIBRARY_VISIBILITY DAGTypeLegalizer {
   SDValue SoftenFloatRes_FSIN(SDNode *N);
   SDValue SoftenFloatRes_FSQRT(SDNode *N);
   SDValue SoftenFloatRes_FSUB(SDNode *N);
+  SDValue SoftenFloatRes_FTAN(SDNode *N);
   SDValue SoftenFloatRes_FTRUNC(SDNode *N);
   SDValue SoftenFloatRes_LOAD(SDNode *N);
   SDValue SoftenFloatRes_SELECT(SDNode *N);
@@ -645,6 +646,7 @@ class LLVM_LIBRARY_VISIBILITY DAGTypeLegalizer {
   void ExpandFloatRes_FSIN      (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandFloatRes_FSQRT     (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandFloatRes_FSUB      (SDNode *N, SDValue &Lo, SDValue &Hi);
+  void ExpandFloatRes_FTAN(SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandFloatRes_FTRUNC    (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandFloatRes_LOAD      (SDNode *N, SDValue &Lo, SDValue &Hi);
   void ExpandFloatRes_XINT_TO_FP(SDNode *N, SDValue &Lo, SDValue &Hi);
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp
index 8f87ee8e09393a..18c044f0984fb0 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorOps.cpp
@@ -402,6 +402,7 @@ SDValue VectorLegalizer::LegalizeOp(SDValue Op) {
   case ISD::FSQRT:
   case ISD::FSIN:
   case ISD::FCOS:
+  case ISD::FTAN:
   case ISD::FLDEXP:
   case ISD::FPOWI:
   case ISD::FPOW:
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
index 985c9f16ab97cd..77ccaefc4cb268 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeVectorTypes.cpp
@@ -107,6 +107,7 @@ void DAGTypeLegalizer::ScalarizeVectorResult(SDNode *N, unsigned ResNo) {
   case ISD::FROUNDEVEN:
   case ISD::FSIN:
   case ISD::FSQRT:
+  case ISD::FTAN:
   case ISD::FTRUNC:
   case ISD::SIGN_EXTEND:
   case ISD::SINT_TO_FP:
@@ -1111,6 +1112,7 @@ void DAGTypeLegalizer::SplitVectorResult(SDNode *N, unsigned ResNo) {
   case ISD::VP_FROUNDEVEN:
   case ISD::FSIN:
   case ISD::FSQRT: case ISD::VP_SQRT:
+  case ISD::FTAN:
   case ISD::FTRUNC:
   case ISD::VP_FROUNDTOZERO:
   case ISD::SINT_TO_FP:
@@ -4308,6 +4310,7 @@ void DAGTypeLegalizer::WidenVectorResult(SDNode *N, unsigned ResNo) {
   case ISD::FROUNDEVEN:
   case ISD::FSIN:
   case ISD::FSQRT:
+  case ISD::FTAN:
   case ISD::FTRUNC:
     if (unrollExpandedOp())
       break;
diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
index dfbfaa8c894f55..0041a210ec77c3 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAG.cpp
@@ -5269,6 +5269,7 @@ bool SelectionDAG::isKnownNeverNaN(SDValue Op, bool SNaN, unsigned Depth) const
   case ISD::FREM:
   case ISD::FSIN:
   case ISD::FCOS:
+  case ISD::FTAN:
   case ISD::FMA:
   case ISD::FMAD: {
     if (SNaN)
diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
index 5caf868c83a296..8a467f28f99ab2 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
@@ -6690,6 +6690,7 @@ void SelectionDAGBuilder::visitIntrinsicCall(const CallInst &I,
   case Intrinsic::fabs:
   case Intrinsic::sin:
   case Intrinsic::cos:
+  case Intri...
[truncated]

tschuett · 2024-04-29T18:24:00Z

Nit: the declare is not necessary anymore. You touched the IRTranslator without a test at, e.g., https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll

farzonl · 2024-04-29T19:01:39Z

Nit: the declare is not necessary anymore. You touched the IRTranslator without a test at, e.g., https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/AArch64/GlobalISel/arm64-irtranslator.ll

The aarch64 backend tag is a misnomer because adding G_FTAN caused the legalizer to complain in this arm64 test case: llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

I could add

declare float @llvm.tan.f32(float)
define float @test_tan_f32(float %x) {
  ; CHECK-LABEL: name:            test_tan_f32
  ; CHECK: %{{[0-9]+}}:_(s32) = G_FTAN %{{[0-9]+}}
  %y = call float @llvm.tan.f32(float %x)
  ret float %y
}

But I feel like this should be added when a tan intrinsic lowering is done for arm64 in llvm/lib/Target/AArch64/AArch64ISelLowering.cpp

Also the llvm/test/CodeGen/X86/GlobalISel/x86_64-irtranslator.ll didn't have any cosine or sine cases so I didn't think to add a tangent one.

tschuett · 2024-04-29T19:08:29Z

Firstly, it is a nit. The IRTranslator translates LLVM-IR into GMIR for GlobalIsel. Testing the IRTranslator is independent of any SDAG changes.

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

llvm/lib/Target/X86/X86ISelLowering.cpp

farzonl · 2024-05-03T19:24:38Z

@RKSimon @topperc When you have time could I get a review. Thank you!

farzonl · 2024-05-08T21:53:54Z

@RKSimon @topperc Sorry to ping again, if you have time this week, I would really appreciate a review! Thanks in advance.

llvm/include/llvm/Analysis/VecFuncs.def

llvm/lib/Target/X86/X86ISelLowering.cpp

llvm/test/CodeGen/X86/llvm.tan.ll

llvm/lib/Target/X86/X86ISelLowering.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

…. NFC. Avoids a problem identified in #90503

…. NFC. Avoids a problem identified in llvm#90503

…trinsic (#93314)" (#93721) This reverts commit b15a0a3. This should undo PR: #93314 will need to re-open #91421 wait for #90503 to land

farzonl · 2024-05-30T16:46:28Z

@RKSimon all issues you raised have been addressed can I get another review?
@topperc There is one outstanding issue I want clarification on via setOperationAction usage in Expand cases, all other issues are addressed.

@junaire

copy over X86ISelLowering and x86 test cases from @junaire's https://reviews.llvm.org/D146905 fix test case run clang format fix unit test and documentation

github-actions · 2024-05-31T00:39:34Z

✅ With the latest revision this PR passed the C/C++ code formatter.

…nd for f32\f64 cases

…trinsic (llvm#93314)" (llvm#93721) This reverts commit b15a0a3. This should undo PR: llvm#93314 will need to re-open llvm#91421 wait for llvm#90503 to land

llvm/test/CodeGen/X86/llvm.tan.ll

farzonl · 2024-06-03T19:03:03Z

@RKSimon @topperc there are no more outstanding issues on this PR. Please let me know if this is ready for final review.

RKSimon

LGTM - cheers

farzonl marked this pull request as ready for review April 29, 2024 18:02

llvmbot added backend:AArch64 backend:X86 llvm:globalisel llvm:support llvm:SelectionDAG SelectionDAGISel as well llvm:ir llvm:analysis labels Apr 29, 2024

farzonl requested review from topperc and RKSimon April 29, 2024 18:07

tschuett reviewed Apr 29, 2024

View reviewed changes

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir Outdated Show resolved Hide resolved

topperc reviewed Apr 30, 2024

View reviewed changes

llvm/lib/Target/X86/X86ISelLowering.cpp Outdated Show resolved Hide resolved

farzonl mentioned this pull request May 7, 2024

[clang][hlsl] Add tan intrinsic part 1 #90276

Merged

farzonl force-pushed the add-tan-intrinsic-x86 branch from 1cfa415 to 1f6e3e5 Compare May 8, 2024 05:01

topperc reviewed May 8, 2024

View reviewed changes

llvm/include/llvm/Analysis/VecFuncs.def Show resolved Hide resolved

llvm/lib/Target/X86/X86ISelLowering.cpp Outdated Show resolved Hide resolved

llvm/lib/Target/X86/X86ISelLowering.cpp Outdated Show resolved Hide resolved

RKSimon reviewed May 9, 2024

View reviewed changes

RKSimon added a commit that referenced this pull request May 9, 2024

[DAG] Add clang-format off/on wrappers around compact switch handlers…

df21ee4

…. NFC. Avoids a problem identified in #90503

nhasabni pushed a commit to nhasabni/llvm-project that referenced this pull request May 14, 2024

[DAG] Add clang-format off/on wrappers around compact switch handlers…

8ecfbc2

…. NFC. Avoids a problem identified in llvm#90503

farzonl removed the backend:AArch64 label May 28, 2024

farzonl force-pushed the add-tan-intrinsic-x86 branch from 1f6e3e5 to ae988cd Compare May 28, 2024 17:34

llvmbot added the llvm:transforms label May 28, 2024

This was referenced May 29, 2024

[clang] Add tanf16 builtin and support for tan constrained intrinsic #93314

Merged

Revert "[clang] Add tanf16 builtin and support for tan constrained intrinsic (#93314)" #93721

Merged

farzonl added a commit that referenced this pull request May 29, 2024

Revert "[clang] Add tanf16 builtin and support for tan constrained in…

7348bb2

…trinsic (#93314)" (#93721) This reverts commit b15a0a3. This should undo PR: #93314 will need to re-open #91421 wait for #90503 to land

farzonl added 2 commits May 30, 2024 20:13

start of tan intrinsic

8dcbf0d

copy over X86ISelLowering and x86 test cases from @junaire's https://reviews.llvm.org/D146905 fix test case run clang format fix unit test and documentation

address pr comments

a55e3e6

farzonl force-pushed the add-tan-intrinsic-x86 branch from ae988cd to 6f3ab07 Compare May 31, 2024 00:36

address pr comments remove tan expand cases as redundant in x86 backe…

8eb5829

…nd for f32\f64 cases

farzonl force-pushed the add-tan-intrinsic-x86 branch from 6f3ab07 to 8eb5829 Compare May 31, 2024 00:42

RKSimon reviewed May 31, 2024

View reviewed changes

llvm/test/CodeGen/X86/llvm.tan.ll Outdated Show resolved Hide resolved

fix f80 test case

757d47c

farzonl self-assigned this May 31, 2024

RKSimon approved these changes Jun 3, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[x86] Add tan intrinsic part 4 #90503

[x86] Add tan intrinsic part 4 #90503

farzonl commented Apr 29, 2024 •

edited

llvmbot commented Apr 29, 2024 •

edited

tschuett commented Apr 29, 2024

farzonl commented Apr 29, 2024 •

edited

tschuett commented Apr 29, 2024

farzonl commented May 3, 2024

farzonl commented May 8, 2024

farzonl commented May 30, 2024

github-actions bot commented May 31, 2024 •

edited

farzonl commented Jun 3, 2024

RKSimon left a comment

[x86] Add tan intrinsic part 4 #90503

Are you sure you want to change the base?

[x86] Add tan intrinsic part 4 #90503

Conversation

farzonl commented Apr 29, 2024 • edited

llvmbot commented Apr 29, 2024 • edited

tschuett commented Apr 29, 2024

farzonl commented Apr 29, 2024 • edited

tschuett commented Apr 29, 2024

farzonl commented May 3, 2024

farzonl commented May 8, 2024

farzonl commented May 30, 2024

github-actions bot commented May 31, 2024 • edited

farzonl commented Jun 3, 2024

RKSimon left a comment

Choose a reason for hiding this comment

farzonl commented Apr 29, 2024 •

edited

llvmbot commented Apr 29, 2024 •

edited

farzonl commented Apr 29, 2024 •

edited

github-actions bot commented May 31, 2024 •

edited