AMDGPU stepthomas atomic csub no rtn forms ver2 #1

stepthomas · 2023-10-10T09:27:58Z

[HIP][Clang][Driver] Correctly specify test requirements as Linux + x86 + AMDGPU; temporarily retain targeted XFAILs for Hexagon & PS.
[lldb] [debugserver] Add spaces between sentences in a CMake warning. NFC.
[AMDGPU][GFX11] Do not rewrite V_FMA/FMAC_* to V_FMAAK_F16_t16 on operand legalization. ([AMDGPU][GFX11] Do not rewrite V_FMA/FMAC_* to V_FMAAK_F16_t16 on operand legalization. llvm/llvm-project#66202)
[clang][dataflow] HTML logger: Mark iterations that have converged. ([clang][dataflow] HTML logger: Mark iterations that have converged. llvm/llvm-project#68204)
Revert "[RISCV] Generaize reduction tree matching to all integer reductions ([RISCV] Generaize reduction tree matching to all integer reductions llvm/llvm-project#68014)"
[flang][hlfir] Pass vector subscripted elemental call arg by address ([flang][hlfir] Pass vector subscripted elemental call arg by address llvm/llvm-project#68097)
[libc] Change the GPU to use builtin memory functions ([libc] Change the GPU to use builtin memory functions llvm/llvm-project#68003)
[compiler-rt] Don't redefine LLVM_COMMON_CMAKE_UTILS if it's defined ([compiler-rt] Don't redefine LLVM_COMMON_CMAKE_UTILS if it's defined llvm/llvm-project#66761)
[lldb][FreeBSD] fix i386 size_t error when using LLDB_LOGF ([lldb][FreeBSD] fix i386 size_t error when using LLDB_LOGF llvm/llvm-project#68210)
[lldb][test] Skip platform attach test on Windows
[CodeGen] Respect pointer-overflow sanitizer for void pointers ([CodeGen] Respect pointer-overflow sanitizer for void pointers llvm/llvm-project#67772)
[SLP][NFC]Add a test for reused extracts corner case, NFC.
[HIP] Support compressing device binary ([HIP] Support compressing device binary llvm/llvm-project#67162)
[clang-repl] Disable InterpreterExceptionTest on RISC-V ([clang-repl] Disable InterpreterExceptionTest on RISC-V llvm/llvm-project#68216)
Fix Clang Sphinx build
Fix Sphinx build with incorrect heading levels; NFC
[X86] Add combine tests for pointers of mixed sizes (NFC) ([X86] Add combine tests for pointers of mixed sizes (NFC) llvm/llvm-project#68219)
Add explanatory comment to CODEOWNERS (NFC)
[llvm] Add myself to CODEOWNERS (NFC)
[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst.
[mlir][ArmSME] Split the Op definition (nfc) ([mlir][ArmSME] Split the Op definition (nfc) llvm/llvm-project#67985)
Fix test hip-offload-compress-zlib.hip
[AArch64][PAC] Specify Defs and Uses of PAUTH_(PROLOGUE|EPILOGUE)
[flang][runtime] Added Assign runtime to CUDA build closure. ([flang][runtime] Added Assign runtime to CUDA build closure. llvm/llvm-project#68171)
[libc++] Remove dead code in legacy_debug_handler.cpp ([libc++] Remove dead code in legacy_debug_handler.cpp llvm/llvm-project#68155)
[LLD][COFF] Mark operator== const to avoid ambiguity in C++20. ([LLD][COFF] Mark operator== const to avoid ambiguity in C++20. llvm/llvm-project#68119)
[libc][NFC] Fix -Wdangling-else when compiling libc with gcc >= 7 ([libc][NFC] Fix -Wdangling-else when compiling libc with gcc >= 7 llvm/llvm-project#67833)
[mlir][sparse] fix codegen header ordering of methods into sections ([mlir][sparse] fix codegen header ordering of methods into sections llvm/llvm-project#68175)
[lldb] Mark operator== const to avoid ambiguity in C++20. ([lldb] Mark operator== const to avoid ambiguity in C++20. llvm/llvm-project#68224)
[clang] Choose non-templated ctor as deduction guide unambiguously ([clang] Choose non-templated ctor as deduction guide unambiguously llvm/llvm-project#66487)
[bazel] Port 8d6d4f8
[RISCV] Don't try to form VECREDUCE without vector instructions
[libc++][NFC] Document missing __pstl_merge function in PSTL basis operations
[RISCV] Generaize reduction tree matching to all integer reductions ([RISCV] Generaize reduction tree matching to all integer reductions llvm/llvm-project#68014) (reapply)
Re-generate pow-4.ll in preparation for D141060
Auto-generate test checks for tests affected by D141060
Regenerate test checks for tests affected by D141060
Revert "[clang] Predefined macros for float128 support ([clang] Predefined macros for float128 support llvm/llvm-project#67196)"
[NFC]Rename InstrProf::getFuncName{,orExternalSymbol} to getFuncOrValName{,IfDefined} ([NFC]Rename InstrProf::getFuncName{,orExternalSymbol} to getValueName{,orExternalSymbol} llvm/llvm-project#68240)
[LangRef] Specify NaN behavior more precisely ([LangRef] Specify NaN behavior more precisely llvm/llvm-project#66579)
[LinkerWrapper] Fix resolution of weak symbols during LTO ([LinkerWrapper] Fix resolution of weak symbols during LTO llvm/llvm-project#68215)
[Libomptarget] Make the DeviceRTL configuration globals weak ([Libomptarget] Make the DeviceRTL configuration globals weak llvm/llvm-project#68220)
[Libomptarget] Explicitly pass the OpenMP device libraries to tests ([Libomptarget] Explicitly pass the OpenMP device libraries to tests llvm/llvm-project#68225)
[RISCV][ISel] Fix comment to match direction of predicate in code. NFC. ([RISCV][ISel] Fix comment to match direction of predicate in code. NFC. llvm/llvm-project#68248)
[clang-format][NFC] AlignTokens: Rename Changes[i] to CurrentChange ([clang-format][NFC] AlignTokens: Rename Changes[i] to CurrentChange llvm/llvm-project#68152)
[clang-format][NFC] AlignTokenSequence: Rename Changes[i] to CurrentC…
[libc] Fix typo in long double negative block ([libc] Fix typo in long double negative block llvm/llvm-project#68243)
[lld/ELF] Don't relax R_X86_64_(REX_)GOTPCRELX when offset is too far
[clang] Default x86-64's medium code model -mlarge-data-threshold to 65535 ([clang] Default x86-64's medium code model -mlarge-data-threshold to 65535 llvm/llvm-project#67506)
[clang-format][NFC] AlignTokenSequence: Skip loop iteration
[Libomptarget] Disable AMDGPU complex math test after recent patch
[RISCV] Relax vslide*_vl patterns to allow any mask. NFC ([RISCV] Relax vslide*_vl patterns to allow any mask. NFC llvm/llvm-project#68203)
[RISCV][GlobalISel] Legalize G_FRAME_INDEX ([RISCV][GlobalISel] Legalize G_FRAME_INDEX llvm/llvm-project#67746)
[RISCV] Fix illegal build_vector when lowering double id buildvec on RV32 ([RISCV] Fix illegal build_vector when lowering double id buildvec on RV32 llvm/llvm-project#67017)
[AMDGPU]: Allow combining into v_dot4
[mlir][sparse] Print new syntax ([mlir][sparse] Print new syntax llvm/llvm-project#68130)
[Clang][CodeGen][NFC] Add (broken) test case for GH67937
[libc++][NFC] Fix broken formatting in comment
Introduce and use codegen::createTargetMachineForTriple()
opt: Don't exit when we can't create a TargetMachine
[libc] Add x86-64 stack protector support.
Revert "[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst."
[Clang][CodeGen] Fix use of CXXThisValue with StrictVTablePointers ([Clang][CodeGen] Fix use of CXXThisValue with StrictVTablePointers llvm/llvm-project#68169)
[runtimes] Fix parsing of LIB{CXX,CXXABI,UNWIND}_TEST_PARAMS ([runtimes] Fix parsing of LIB{CXX,CXXABI,UNWIND}_TEST_PARAMS llvm/llvm-project#67691)
[libc++] Explicitly pass execution policies to _LIBCPP_PSTL_CUSTOMIZATION_POINT ([libc++] Explicitly pass execution policies to _LIBCPP_PSTL_CUSTOMIZATION_POINT llvm/llvm-project#68238)
Revert "[DAG] Attempt shl narrowing in SimplifyDemandedBits"
use std::make_unique rather than reset+new
[libc++] Fix implementation of iota_view::size ([libc++] Fix implementation of iota_view::size llvm/llvm-project#67819)
Type: Clarify comment for isIEEELikeFPTy
InstCombine: Add baseline test for SimplifyDemandedFPClass
[Modules] no_undeclared_includes modules (Apple Darwin) don't work the clang modules ([Modules] no_undeclared_includes modules (Apple Darwin) don't work the clang modules llvm/llvm-project#68241)
[Clang][Driver] Add new flags to control IR verification ([Clang][Driver] Add new flags to control IR verification llvm/llvm-project#68172)
DWARFContext: use std::make_unique rather than reset+new
[BOLT] Fix 32-bit overflow in checkOffsets/checkVMA ([BOLT] Fix 32-bit overflow in checkOffsets/checkVMA llvm/llvm-project#68274)
[Clang] Implement the 'counted_by' attribute
[mlir][python] Enable py312. ([mlir][python] Enable py312. llvm/llvm-project#68009)
[Support] Rename HashBuilderImpl to HashBuilder (NFC) ([Support] Rename HashBuilderImpl to HashBuilder (NFC) llvm/llvm-project#68173)
[Support] Rename llvm::support::endianness to llvm::endianness ([Support] Rename llvm::support::endianness to llvm::endianness llvm/llvm-project#68174)
InstCombine: Introduce SimplifyDemandedUseFPClass
[mlgo][coro] Assign coro split-ed functions a FunctionLevel ([mlgo][coro] Assign coro split-ed functions a FunctionLevel llvm/llvm-project#68263)
[mlir][Vector] Add Broadcast -> CastOp reordering to SinkVectorBroadcasting patterns. ([mlir][Vector] Add Broadcast -> CastOp reordering to SinkVectorBroadcasting patterns. llvm/llvm-project#68257)
Revert "[HIP] Support compressing device binary ([HIP] Support compressing device binary llvm/llvm-project#67162)"
[mlgo] Fix state-tracking-coro.ll test
[clang-tidy]: Add TagDecl into LastTagDeclRanges in UseUsingCheck only when it is a definition ([clang-tidy]: Add TagDecl into LastTagDeclRanges in UseUsingCheck only when it is a definition llvm/llvm-project#67639)
[clang][ExprConst] Don't try to evaluate value-dependent DeclRefExprs ([clang][ExprConst] Don't try to evaluate value-dependent DeclRefExprs llvm/llvm-project#67778)
[MLIR][NVGPU] Change name wgmma.descriptor to warpgroup.descriptor (NFC) ([MLIR][NVGPU] Change name wgmma.descriptor to warpgroup.descriptor (NFC) llvm/llvm-project#67526)
[ValueTracking] Return ConstantRange instead of setting limits (NFC)
[SystemZ][z/OS] Update lowerCall ([SystemZ][z/OS] Update lowerCall llvm/llvm-project#68259)
[clang][Interp] Only lazily visit constant globals
[mlir][Transform] NFC - Fix missing field in copy constructor
[clang][Interp] Support LambdaThisCaptures
[ValueTracking] Add SimplifyQuery ctor without TLI (NFC)
[InstSimplify] Add missing const qualifier (NFC)
[mlir] Change the class name of the GenerateWarpgroupDescriptor ([mlir] Change the class name of the GenerateWarpgroupDescriptor llvm/llvm-project#68286)
[mlir][nvgpu] Improve nvgpu->nvvm transformation of warpgroup.mma Op (NFC) ([mlir][nvgpu] Improve nvgpu->nvvm transformation of warpgroup.mma Op (NFC) llvm/llvm-project#67325)
[Clang] Fix constant evaluating a captured variable in a lambda ([Clang] Fix constant evaluating a captured variable in a lambda llvm/llvm-project#68090)
[clang-format][doc] Update the Linux kernel coding style URL
Revert "InstCombine: Introduce SimplifyDemandedUseFPClass"
Reapply [compiler-rt] Check for and use -lunwind when linking with -nodefaultlibs (Reapply [compiler-rt] Check for and use -lunwind when linking with -nodefaultlibs llvm/llvm-project#66584)
[BOLT][RISCV] Implement TLS le/ie relocations ([BOLT][RISCV] Implement TLS le/ie relocations llvm/llvm-project#67112)
[MLIR][NVGPU] Introduce nvgpu.wargroup.mma.store Op for Hopper GPUs ([MLIR][NVGPU] Introduce nvgpu.wargroup.mma.store Op for Hopper GPUs llvm/llvm-project#65441)
[BOLT][RISCV] Handle long tail calls ([BOLT][RISCV] Handle long tail calls llvm/llvm-project#67098)
[Lex] Introduce Preprocessor::LexTokensUntilEOF()
[bazel] fix typo
[mlir][bazel] Fix after d20fbc9
[mlir][bazel] Sort targets list.
[GVN] Remove users from ICF when RAUWing loads
[AArch64] [LoopVectorize] Use either fixed-width or scalable VF when tail-folding ([AArch64][LoopVectorize] Use either fixed-width or scalable VF when tail-folding llvm/llvm-project#67543)
[mlir] Speed up FuncToLLVM using a SymbolTable ([mlir] Speed up FuncToLLVM using a SymbolTable llvm/llvm-project#68082)
[GVN] Fix after llvm@46aac94
[Clang] Handle consteval expression in array bounds expressions ([Clang] Handle consteval expression in array bounds expressions llvm/llvm-project#66222)
[lldb][DWARFASTParserClang][NFCI] Extract DW_AT_data_member_location calculation logic ([lldb][DWARFASTParserClang][NFCI] Extract DW_AT_data_member_location calculation logic llvm/llvm-project#68231)
[mlir] Fix empty-tensor-elimination around self-copies ([mlir]: fix a issue and refine some code (#67977) llvm/llvm-project#68129)
[flang]Pass to add vscale range attribute ([flang]Pass to add vscale range attribute llvm/llvm-project#68103)
[Lex] Handle repl_input_end in Preprocessor::LexTokensUntilEOF()
[CostModel][X86] getShuffleCost - add fallback (to half vector) for bfloat vector shuffle costs
Revert "[X86] Change target of __builtin_ia32_cmp[p|s][s|d] from avx into sse/sse2 ([X86] Change target of __builtin_ia32_cmp[p|s][s|d] from avx into sse/sse2 llvm/llvm-project#67410)"
[AMDGPU][CodeGen] Fold immediates in src1 operands of V_MAD/MAC/FMA/FMAC. ([AMDGPU][CodeGen] Fold immediates in src1 operands of V_MAD/MAC/FMA/FMAC. llvm/llvm-project#68002)
[mlir][docs] Cleanup documentations [NFC] ([mlir][docs] Cleanup documentations [NFC] llvm/llvm-project#67945)
[InstCombine] Add pre-commit tests for [InstCombine] Canonicalize (X +/- Y) & Y into ~X & Y when Y is a power of 2 llvm/llvm-project#67915. NFC.
[mlir][bufferization] Add dump_alias_sets option to transform op ([mlir][bufferization] Add dump_alias_sets option to transform op llvm/llvm-project#68289)
[mlir][tensor][bufferize] tensor.empty bufferizes to allocation ([mlir][tensor][bufferize] tensor.empty bufferizes to allocation llvm/llvm-project#68201)
[C2X] N3007 Type inference for object definitions
Fix LLVM Sphinx build
Inline operator== and operator!= (Inline operator== and operator!= llvm/llvm-project#67958)
[CVP] Add pre-commit cttz/ctpop tests. NFC.
[mlir][MemRef] Add a pattern to simplify `extract_strided_metadata(ca… ([mlir][MemRef] Add a pattern to simplify `extract_strided_metadata(ca… llvm/llvm-project#68291)
[mlir][tensor][bufferize] Reshapes: Fix memory side effects and memory space ([mlir][tensor][bufferize] Reshapes: Fix memory side effects and memory space llvm/llvm-project#68195)
AMDGPU/GlobalISel: Handle mubuf load/store for more types (AMDGPU/GlobalISel: Handle mubuf load/store for more types llvm/llvm-project#68268)
[BitcodeReader] Replace unsupported constexprs in metadata with undef
[Documentation] Fix some invalid references in sphinx documentation ([Documentation] Fix some invalid references in sphinx documentation llvm/llvm-project#68239)
[CVP] Add additional cttz tests. NFC.
Revert "[C2X] N3007 Type inference for object definitions"
Attributor: Add a few nofpclass tests
Attributor: Fix not propagating nofpclass arguments through transitive callers
[SLP][NFC]Add insertsubvector test with small source vector, NFC.
[libc++] Make future_error constructor standard-compliant
[IR]Add NumSrcElts param to is..Mask static function in ShuffleVectorInst.
[libc++] Use correct size for deallocation of arrays in shared_ptr ([libc++] Use correct size for deallocation of arrays in shared_ptr llvm/llvm-project#68233)
[flang]Fix tests broken on non-Aarch64 builds ([flang]Fix tests broken on non-Aarch64 builds llvm/llvm-project#68306)
[llvm] Replace uses of Type::getPointerTo (NFC)
[RISCV][CostModel] VPIntrinsics have same cost as their non-vp counterparts ([RISCV][CostModel] VPIntrinsics have same cost as their non-vp counterparts llvm/llvm-project#67178)
[clang][Sema][NFC] Remove an unnecessary static_cast
Reapply "InstCombine: Introduce SimplifyDemandedUseFPClass"
[clang] Replace uses of Type::getPointerType (NFC)
[InstCombine] Add test coverage for sext/zext boolean additions (NFC)
[mlir][dataflow] Remove early exit in dead code analysis for zero-operand returns ([mlir][dataflow] Remove early exit in dead code analysis for zero-operand returns llvm/llvm-project#68151)
Reland "[HIP] Support compressing device binary"
[DX] Add support for program signatures ([DX] Add support for program signatures llvm/llvm-project#67346)
[mlir][ArmSME] Switch to using custom documentation ([mlir][ArmSME] Switch to using custom documentation llvm/llvm-project#68110)
[clang] Subscribe to DR changes
[Libomptarget] Fix lookup of the libcgpu.a library
[TableGen][GISel] Fix incorrect binding of predicate operands upon PredicateUsesOperands = 1 ([TableGen][GISel] Fix incorrect binding of predicate operands upon PredicateUsesOperands = 1 llvm/llvm-project#68125)
Tli nfc fix mechanism propagating mangled names for tli function mappings ac3 (Tli nfc fix mechanism propagating mangled names for tli function mappings ac3 llvm/llvm-project#67308)
[DX] Fix changed meaning of 'Signature' after [DX] Add support for program signatures llvm/llvm-project#67346
[C2X] N3007 Type inference for object definitions
[JITLink] Some cleanups to EHFrameSupport ([JITLink] Some cleanups to EHFrameSupport llvm/llvm-project#66707)
[Support] Deprecate system_endianness ([Support] Deprecate system_endianness llvm/llvm-project#68279)
Ensure NoTrapAfterNoreturn is false for the wasm backend (Ensure NoTrapAfterNoreturn is false for the wasm backend llvm/llvm-project#65876)
[Flang][OpenMP] NFC: Port three tests with minimal changes to HLFIR flow
[clang-tidy][IncludeCleaner] Fix analysis supression in presence of verbatim spellings ([clang-tidy][IncludeCleaner] Fix analysis supression in presence of verbatim spellings llvm/llvm-project#68185)
[mlir][llvm] Fix elem type passing into getelementptr ([mlir][llvm] Fix elem type passing into getelementptr llvm/llvm-project#68136)
Specified particular known to be good versions of Sphinx and dependencies
[PowerPC] Add the SCV instruction. ([PowerPC] Add the SCV instruction. llvm/llvm-project#68063)
Fix MLIR FuncOp documentation: declaration must be private (NFC)
InstCombine: Handle copysign in SimplifyDemandedFPClass
[libc++] Remove UB in list, forward_list and __hash_table
AMDGPU/GlobalISel: Add global-isel run lines to shrink add/sub test
AMDGPU/GlobalISel: Add test for packed sub selection
Fix the ARM bots
[ValueTracking] Try to infer range of select from true and false values. ([ValueTracking] Try to infer range of select from true and false values. llvm/llvm-project#68256)
[libcxx] replaces SFINAE with requires-expressions in bind_front and bind_back ([libcxx] replaces SFINAE with requires-expressions in bind_front and bind_back llvm/llvm-project#68249)
[RISCV] Use early return to simplify isFPImmLegal [nfc]
[mlir][openacc][NFC] Remove useless OptionalAttr with UnitAttr ([mlir][openacc][NFC] Remove useless OptionalAttr with UnitAttr llvm/llvm-project#68337)
[VectorCombine] foldBitcastShuf - compute scale factors using shuffle type element size instead of element count. NFCI.
[DX] Fix copypasta that caused big-endian failure
[LLVM][DWARF] Add support for monolithic types in .debug_names ([LLVM][DWARF] Add support for monolithic types in .debug_names llvm/llvm-project#68131)
[scudo] Fix the use of ASSERT_CAPABILITY in TSD ([scudo] Fix the use of ASSERT_CAPABILITY in TSD llvm/llvm-project#68273)
Use BlockFrequency type in more places (NFC) (Use BlockFrequency type in more places (NFC) llvm/llvm-project#68266)
Revert "[LLVM][DWARF] Add support for monolithic types in .debug_names ([LLVM][DWARF] Add support for monolithic types in .debug_names llvm/llvm-project#68131)"
Fixes and closes clang::ASTWriter can create a crashing PCH if an incorrect hasErrors value is passed llvm/llvm-project#53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. (Fixes and closes #53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. llvm/llvm-project#68127)
[clang] Correct behavior of LLVM_UNREACHABLE_OPTIMIZE=OFF for Release builds ([clang] Correct behavior of LLVM_UNREACHABLE_OPTIMIZE=OFF for Release builds llvm/llvm-project#68284)
[libc++] Add std::fpclassify overloads for floating-point. ([libc++] Add std::fpclassify overloads for floating-point. llvm/llvm-project#67913)
[flang][openacc] Provide extent in bounds when available ([flang][openacc] Provide extent in bounds when available llvm/llvm-project#68162)
[flang][openacc] Add support for allocatable and pointer arrays in reduction ([flang][openacc] Add support for allocatable and pointer arrays in reduction llvm/llvm-project#68261)
Revert "Fixes and closes clang::ASTWriter can create a crashing PCH if an incorrect hasErrors value is passed llvm/llvm-project#53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. (Fixes and closes #53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. llvm/llvm-project#68127)"
ValueTracking: Use fcAllFlags for unknown value (ValueTracking: Use fcAllFlags for unknown value llvm/llvm-project#66393)
Introduce the initial support for OpenMP kernel language ([OpenMP] Introduce the initial support for OpenMP kernel language llvm/llvm-project#66844)
[OpenMP] Prevent AMDGPU from overriding visibility on DT_nohost variables ([OpenMP] Prevent AMDGPU from overriding visibility on DT_nohost variables llvm/llvm-project#68264)
MachineFunctionPass: Clear properties before running function (MachineFunctionPass: Clear properties before running function llvm/llvm-project#67962)
[mlir][memref] Fix emulate narrow types for strided memref offset ([mlir][memref] Fix emulate narrow types for strided memref offset llvm/llvm-project#68181)
[LLDB] Allow specifying a custom exports file ([LLDB] Allow specifying a custom exports file llvm/llvm-project#68013)
Revert "[mlir][llvm] Fix elem type passing into getelementptr ([mlir][llvm] Fix elem type passing into getelementptr llvm/llvm-project#68136)"
[Support, ADT] Move llvm::endianness to bit.h ([Support, ADT] Move llvm::endianness to bit.h llvm/llvm-project#68280)
BlockFrequencyInfo: Add PrintBlockFreq helper (BlockFrequencyInfo: Add PrintBlockFreq helper llvm/llvm-project#67512)
[clang-format] Fix an error message
[LLD][COFF] Add support for --time-trace ([LLD][COFF] Add support for --time-trace llvm/llvm-project#68236)
[MLIR] NFC. Move remaining affine test cases to its dialect dir ([MLIR] NFC. Move remaining affine test cases to its dialect dir llvm/llvm-project#67921)
[DWARFLinkerParallel] Use llvm::endianness::native (NFC)
[ProfileData] Remove getHostEndianness (NFC)
[BOLT][NFC] Add MCSubtargetInfo to MCPlusBuilder ([BOLT][NFC] Add MCSubtargetInfo to MCPlusBuilder llvm/llvm-project#68223)
[clang][Diagnostics] Add bitfield source range to zero width diags ([clang][Diagnostics] Add bitfield source range to zero width diags llvm/llvm-project#68312)
[BOLT] Improve handling of relocations targeting specific instructions ([BOLT] Improve handling of relocations targeting specific instructions llvm/llvm-project#66395)
[flang][hlfir] Fix c_null_ptr lowering in structure constructors ([flang][hlfir] Fix c_null_ptr lowering in structure constructors llvm/llvm-project#68321)
[mlir][Transform] Add a transform.match.operation_empty op to allow s… ([mlir][Transform] Add a transform.match.operation_empty op to allow s… llvm/llvm-project#68319)
[flang][nfc] replace fir.dispatch_table with more generic fir.type_info ([flang][nfc] replace fir.dispatch_table with more generic fir.type_info llvm/llvm-project#68309)
[AMDGPU] Use correct operand order for shifts ([AMDGPU] Use correct operand order for shifts llvm/llvm-project#68299)
[llvm-rc] add support for MENUEX ([llvm-rc] add support for MENUEX llvm/llvm-project#67464)
[clang][Sema] Only check RVV types if we have them ([clang][Sema] Only check RVV types if we have them llvm/llvm-project#67669)
[NFC][compiler-rt] Fix typo in FuzzedDataProvider.h doc
Reland "AMDGPU: Duplicate instead of COPY constants from VGPR to SGPR (AMDGPU: Duplicate instead of COPY constants from VGPR to SGPR llvm/llvm-project#66882)"
[lldb][DWARFASTParserClang] Check DW_AT_declaration to determine static data members ([lldb][DWARFASTParserClang] Check DW_AT_declaration to determine static data members llvm/llvm-project#68300)
Re-apply "[AArch64] Enable "sink-and-fold" in MachineSink by default ([AArch64] Enable "sink-and-fold" in MachineSink by default llvm/llvm-project#67432)"
[nvvm] use check-next in nvvm-to-llvm test (nfc) ([nvvm] use check-next in nvvm-to-llvm test (nfc) llvm/llvm-project#68326)
Reapply "[clang analysis][thread-safety] Handle return-by-reference..… (Reapply "[clang analysis][thread-safety] Handle return-by-reference..… llvm/llvm-project#68394)
[mlir][transform] Fix handling of transitive include in interpreter. ([mlir][transform] Fix handling of transitive include in interpreter. llvm/llvm-project#67560)
[mlir] Fix unused from after llvm@7876899
[libc++] Recategorize additional instantiations in the dylib as availability macros
[mlir][bazel] Disable test added in llvm@7876899
[InstCombine] Fold comparison of adding two z/sext booleans ([InstCombine] Fold comparison of adding two z/sext booleans llvm/llvm-project#67895)
[libc++] Implement P2614R2 (Deprecate numeric_limits::has_denorm)
[libc++] Bump the clang version the clang-tidy checks are based on ([libc++] Bump the clang version the clang-tidy checks are based on llvm/llvm-project#68318)
[mlir][bufferization] MaterializeInDestinationOp: Support memref destinations ([mlir][bufferization] MaterializeInDestinationOp: Support memref destinations llvm/llvm-project#68074)
[MLIR] NFC. Fix clang-tidy warnings in Affine Utils
Revert "[libc++] Remove UB in list, forward_list and __hash_table"
[lld][ELF][AVR] Add range check for R_AVR_13_PCREL ([lld][ELF][AVR] Add range check for R_AVR_13_PCREL llvm/llvm-project#67636)
[lldb][DWARFASTParserClang][NFC] Fix comment regarding static data member detection ([lldb][DWARFASTParserClang][NFC] Fix comment regarding static data member detection llvm/llvm-project#68405)
[mlir][transform] Allow passing various library files to interpreter. ([mlir][transform] Allow passing various library files to interpreter. llvm/llvm-project#67120)
[VectorCombine][X86] Add additional length changing foldBitcastShuf tests
[VectorCombine] foldBitcastShuf - add support for length changing shuffles
[TTI] improveShuffleKindFromMask - detect SK_ExtractSubvector patterns from SK_PermuteSingleSrc
[PATCH] [llvm] [InstCombine] Canonicalise ADD+GEP
[AArch64][SME] Tile slices to lazy-save/restore should be RDSVL. ([AArch64][SME] Tile slices to lazy-save/restore should be RDSVL. llvm/llvm-project#68403)
Revert "Revert "Fixes and closes clang::ASTWriter can create a crashing PCH if an incorrect hasErrors value is passed llvm/llvm-project#53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. (Fixes and closes #53952. Setting the ASTHasCompilerErrors member variable correctly based on the PP diagnostics. llvm/llvm-project#68127)""
[clang][NFC] Add missing placement-new after Allocate() calls ([clang][NFC] Add missing placement-new after Allocate() calls llvm/llvm-project#68382)
[mlir][bazel] Fix after llvm@6a2071c
[mlir][bazel] Fix after llvm@6a2071c
[TTI] Fix -Wsign-compare in BasicTTIImpl.h (NFC)
[mlir][Transform] Provide a minimal set of utils that allow implementing a simple transform dialect interpreter pass ([mlir][Transform] Provide a minimal set of utils that allow implementing a simple transform dialect interpreter pass llvm/llvm-project#68330)
[clang][CodeGen] Regenerate tests checks after 94795a3
[InstCombine] Add additional pre-commit tests for [InstCombine] Canonicalize (X +/- Y) & Y into ~X & Y when Y is a power of 2 llvm/llvm-project#67915. NFC.
[mlir][bazel] Fix after llvm@ef8c26b
[AArch64][SME] NFC: use update_test_checks.py for sme-pstate(sm|za)-attrs.ll
[InstCombine] Simplify the pattern a ne/eq (zext/sext (a ne/eq c)) ([InstCombine] Simplify the pattern a ne/eq (zext/sext (a ne/eq c)) llvm/llvm-project#65852)
Revert "MachineSink: Fix sinking VGPR def out of a divergent loop"
AMDGPU: Add test for temporal divergence introduced by machine-sink
AMDGPU: Fix temporal divergence introduced by machine-sink (AMDGPU: Fix temporal divergence introduced by machine-sink and performance regression introduced by D155343 llvm/llvm-project#67456)
[mlir][bazel] Fix after llvm@ef8c26b
[mlir] Fix -Wunused-function in TransformInterpreterPassBase.cpp (NFC)
[mlir][bazel] Fix after llvm@ef8c26b
[libc] Enable missing memory tests on the GPU ([libc] Enable missing memory tests on the GPU llvm/llvm-project#68111)
[mlir][VectorOps] Don't fold extract chains that include dynamic indices ([mlir][VectorOps] Don't fold extract chains that include dynamic indices llvm/llvm-project#68333)
[SPIRV] Implement log10 for logical SPIR-V ([SPIRV] Implement log10 for logical SPIR-V llvm/llvm-project#66921)
[flang] Update instructions for a standalone flang build ([flang] Update instructions for a standalone flang build llvm/llvm-project#68361)
[IndVars] Add test for Wrong code at -O2 on x86_64-linux_gnu since 3ddd1ff llvm/llvm-project#68260 (NFC)
[PowerPC] Fix missing kill flag update for XVCVDPSP transformations ([PowerPC] Fix missing kill flag update for XVCVDPSP transformations llvm/llvm-project#67997)
[lldb][Docs] Add section on using QEMU without bridge networking
[OpenMP][OpenMPIRBuilder] Move copyInput to a passed in lambda function and re-order kernel argument load/stores ([OpenMP][OpenMPIRBuilder] Move copyInput to a passed in lambda function and re-order kernel argument load/stores llvm/llvm-project#68124)
[DWARFLinker] Release input DWARF after object has been linked ([DWARFLinker] Release input DWARF after object has been linked llvm/llvm-project#68376)
[llvm][Docs][llvm-cov] Correct list of export options
[GitHub] Add myself to CODEOWNERS for LLDB (NFC)
Fix typo "x84_64" (Fix typo "x84_64" llvm/llvm-project#68419)
[AArch64][BTI] Prevent Machine Scheduler from moving branch targets ([AArch64][BTI] Prevent Machine Scheduler from moving branch targets llvm/llvm-project#68313)
[lldb[test] TestCppUnionStaticMembers.py: XFAIL assertions on windows ([lldb[test] TestCppUnionStaticMembers.py: XFAIL assertions on windows llvm/llvm-project#68408)
[flang][openacc] Do not generate duplicate routine op ([flang][openacc] Do not generate duplicate routine op llvm/llvm-project#68348)
[DebugInfo][SelectionDAG] Add debug info salvaging for TRUNC nodes
[RISCV] Add autogen header to autogen test [nfc]
[AArch64][SME] Add remarks to flag lazy ZA saves, and SMSTART/SMSTOP transitions ([AArch64][SME] Add remarks to flag lazy ZA saves, and SMSTART/SMSTOP transitions llvm/llvm-project#68255)
[AArch64] Fix for misched-branch-targets.mir test
[mlir][vector] Constrain patterns: vector.contract -> vector.outerproduct
[InstCombine] Retain exact instruction name for some cases in SimplifyDemandedUseBits. ([InstCombine] Retain exact instruction name for some cases in SimplifyDemandedUseBits. llvm/llvm-project#68371)
[MLIR][Presburger] Fix reduce bug in Fraction class and add tests ([MLIR][Presburger] Fix reduce bug in Fraction class and add tests llvm/llvm-project#68298)
[SelectionDAG] Fix an unused variable warning
[RISCV] Strip W suffix from ADDIW ([RISCV] Strip W suffix from ADDIW llvm/llvm-project#68425)
[RISCV] Support VLS for VCIX ([RISCV] Support VLS for VCIX llvm/llvm-project#67289)
[lldb] Expose SBPlatform::GetAllProcesses to the SB API ([lldb] Expose SBPlatform::GetAllProcesses to the SB API llvm/llvm-project#68378)
[gn build] Port 8f378ff
[RISCV][GISel] Select G_SELECT ([RISCV][GISel] Select G_SELECT llvm/llvm-project#67614)
[compiler-rt] Allow Fuchsia to use 64-bit allocator for RISCV ([compiler-rt] Allow Fuchsia to use 64-bit allocator for RISCV llvm/llvm-project#68343)
Revert "[RISCV][CostModel] VPIntrinsics have same cost as their non-vp counterparts ([RISCV][CostModel] VPIntrinsics have same cost as their non-vp counterparts llvm/llvm-project#67178)"
[CodeLayout] Faster basic block reordering, ext-tsp ([CodeLayout] Faster basic block reordering, ext-tsp llvm/llvm-project#68275)
Revert "[CodeLayout] Faster basic block reordering, ext-tsp ([CodeLayout] Faster basic block reordering, ext-tsp llvm/llvm-project#68275)"
[StatepointLowering] Precommit test for [StatepointLowering] Take return attributes of gc.result into account llvm/llvm-project#68439
Make -frewrite-includes put an endif at the end of the included text (Make -frewrite-includes put an endif at the end of the included text llvm/llvm-project#67613)
[clang][modules] Remove preloaded SLocEntries from PCM files ([clang][modules] Remove preloaded SLocEntries from PCM files llvm/llvm-project#66962)
[NFC] Change a reference member to pointer
Add -fkeep-system-includes modifier for -E
[Basic] Fix a warning
[mlir] Fix lower_unpack when dynamic dimensions are involved ([mlir] Fix lower_unpack when dynamic dimensions are involved llvm/llvm-project#68423)
[AArch64][SME] Fix generating incorrect TBZ when lowering lazy save. ([AArch64][SME] Fix generating incorrect TBZ when lowering lazy save. llvm/llvm-project#68429)
[mlir][sparse] introduce MapRef, unify conversion/codegen for reader ([mlir][sparse] introduce MapRef, unify conversion/codegen for reader llvm/llvm-project#68360)
add support for riscv64
Revert Wframe-larger-than to 530
[ADT] Add more ArrayRef <-> StringRef conversion functions
[Support] Introduce ThreadSafeAllocator
[ADT] Introduce LazyAtomicPointer
[libc++] Optimize ranges::count for __bit_iterators
[libc++][PSTL] Overhaul exceptions handling
[clang-tidy][libc] Fix namespace check with macro ([clang-tidy][libc] Fix namespace check with macro llvm/llvm-project#68134)
[mlir][sparse] introduce a pass to stage complex sparse operations in… ([mlir][sparse] introduce a pass to stage complex sparse operations in… llvm/llvm-project#68436)
[scudo] Improve the message of region exhaustion ([scudo] Improve the message of region exhaustion llvm/llvm-project#68444)
[llvm][Support] fix convertToSnakeFromCamelCase ([llvm][Support] fix convertToSnakeFromCamelCase llvm/llvm-project#68375)
Add '-p' argument to mkdir in test so that it does not give an error if the directory already exists.
[clang][modules] Move SLocEntry search into ASTReader ([clang][modules] Move SLocEntry search into ASTReader llvm/llvm-project#66966)
Revert "add support for riscv64"
[MachineSink] Fix crash due to use-after-free in a MachineInstr* cache.
[clang] Fix build after 537344f
Fix non-determinism in debuginfo (Fix non-determinism in debuginfo llvm/llvm-project#68332)
[clang] Fix tests build after 537344f
[gn build] Port 5d2a710
[gn build] Port aade746
[gn build] Port d07c3cf
[DWARF] Change to consistently print out abbrev code in .debug_names ([DWARF] Change to consistently print out abbrev code in .debug_names llvm/llvm-project#68353)
[mlir][tosa][linalg] Apply direct tosa -> linalg Conv2D lowering ([mlir][tosa][linalg] Apply direct tosa -> linalg Conv2D lowering llvm/llvm-project#68304)
[libcxxabi] Add missing include statement.
[AMDGPU][IGLP] SingleWaveOpt: Cache DSW Counters from PreRA ([AMDGPU][IGLP] SingleWaveOpt: Cache DSW Counters from PreRA llvm/llvm-project#67759)
[clang][ASTImporter] Fix crash when import VarTemplateDecl in record ([clang][ASTImporter] Fix crash when import VarTemplateDecl in record llvm/llvm-project#67522)
[libc] Fix linking of AMDGPU device runtime control constants for math ([libc] Fix linking of AMDGPU device runtime control constants for math llvm/llvm-project#65676)
Revert "Re-apply "[AArch64] Enable "sink-and-fold" in MachineSink by default ([AArch64] Enable "sink-and-fold" in MachineSink by default llvm/llvm-project#67432)""
Fix machine-sink-cache-invalidation post - 8abb2ac
[mlir][tools] Introduce tblgen-to-irdl tool ([mlir][tools] Introduce tblgen-to-irdl tool llvm/llvm-project#66865)
[clang][Lex][NFC] Make some local variables const
[RISCV][GISel] Add FPR register bank.
Revert "Reapply "[clang analysis][thread-safety] Handle return-by-reference..… (Reapply "[clang analysis][thread-safety] Handle return-by-reference..… llvm/llvm-project#68394)"
[clang] remove ClassScopeFunctionSpecializationDecl ([clang] remove ClassScopeFunctionSpecializationDecl llvm/llvm-project#66636)
[clang] Fix -Wreorder-ctor of DependentFunctionTemplateSpecializationInfo (NFC)
[bazel] Add missing dependency for 5d2a710
LangRef: add missing punctuation (LangRef: add missing punctuation llvm/llvm-project#68471)
[BOLT][RISCV] Fix reloc-tls tests
Fix Clang Sphinx build
[clang][Interp] Emit dummy values for unknown C variables ([clang][Interp] Emit dummy values for unknown C variables llvm/llvm-project#66749)
[Documentation][NFC] Remove invalid language specifiers in markdown code blocks
[clang][Intepr] Fix the build
[clang][NFC] Typo fix in PPC.cpp
[mlir][bufferization] Follow up for [mlir][bufferization] MaterializeInDestinationOp: Support memref destinations llvm/llvm-project#68074 ([mlir][bufferization] Follow up for #68074 llvm/llvm-project#68488)
[RISCV] Add sink-and-fold support for RISC-V. ([RISCV] Add sink-and-fold support for RISC-V. llvm/llvm-project#67602)
[AArch64] Tests for postinc scheduling write operands. NFC
[BOLT] Fix long jump negative offset issue. ([BOLT] Fix long jump negative offset issue. llvm/llvm-project#67132)
[VPlan] Avoid VPTransformState::reset in fixReduction (NFCI).
[clang][Modules] checkModuleIsAvailable should use a const & parameter instead of pointer ([clang][Modules] checkModuleIsAvailable should use a const & parameter instead of pointer llvm/llvm-project#67902)
[RISCV] Support fptoi like ops for fp16 vectors input when only have Zvfhmin ([RISCV] Support fptoi like ops for fp16 vectors input when only have Zvfhmin llvm/llvm-project#67532)
[AArch64][GlobalISel][NFC] Re-generate a test.
[RISCV][NFC] Add base classes of Operand and uimm/simm ([RISCV][NFC] Add base classes of Operand and uimm/simm llvm/llvm-project#68472)
[llvm] Remove "using support::endianness;" (NFC)
[VP] Use the interface of 'getFunctionalIntrinsicID' to get the non-p… ([VP] Use the interface of 'getFunctionalIntrinsicID' to get the non-p… llvm/llvm-project#68508)
[mlir][bufferization] Update empty_tensor_elimination transform op ([mlir][bufferization] Update empty_tensor_elimination transform op llvm/llvm-project#68497)
[clang-tidy][modernize-return-braced-init-list]fix false-positives ([clang-tidy][modernize-return-braced-init-list]fix false-positives llvm/llvm-project#68491)
[Driver] Hook up Haiku ARM support ([Driver] Hook up Haiku ARM support llvm/llvm-project#67222)
[Sparc] Replace CMP instructions with InstAlias (NFCI) ([Sparc] Replace CMP instructions with InstAlias (NFCI) llvm/llvm-project#66859)
[llvm] Drop unaligned from calls to llvm::support::endian::{read,write} (NFC)
[lldb][Docs] Fix typo in debugging lldb doc
[lldb][Docs] Use RST link format in IntelPT doc
[flang][hlfir] use fir.type_info to skip runtime call if nofinal is set ([flang][hlfir] use fir.type_info to skip runtime call if nofinal is set llvm/llvm-project#68397)
[flang] Set func.func arg attributes for procedure designators ([flang] Set func.func arg attributes for procedure designators llvm/llvm-project#68420)
Update MLIR conversion to LLVMFunc to account better for properties (Update MLIR conversion to LLVMFunc to account better for properties llvm/llvm-project#67406)
Use llvm::endianness{,::little,::native} (NFC)
[SPIRV] Fix SPV_KHR_expect_assume support ([SPIRV] Fix SPV_KHR_expect_assume support llvm/llvm-project#67793)
[clang-format][NFC] Make InsertNewlineAtEOF a little more efficient
[AArch64][SME] Zero reserved bytes when allocating a new TPIDR2 object ([AArch64][SME] Zero reserved bytes when allocating a new TPIDR2 object llvm/llvm-project#68411)
[mlir][ArmSVE] Restructure sources to match ArmSME dialect (NFC) ([mlir][ArmSVE] Restructure sources to match ArmSME dialect (NFC) llvm/llvm-project#68399)
Fix Wparentheses warning. NFC.
Fix Wunused-variable warning. NFC.
[Docs] Fix GEP type in example ([Docs] Fix GEP type in example llvm/llvm-project#68533)
[DAG] foldSelectOfBinops - correctly handle select of binops where ResNo != 0
[LV] Cache call vectorization decisions ([LV] Cache call vectorization decisions llvm/llvm-project#66521)
[VectorCombine] Rename foldBitcastShuf -> foldBitcastShuffle. NFC.
[DAG] Remove unused variable 'VT' in DAGCombiner.cpp (NFC)
[mlir][bazel] Fix after 7bbfd2a
[CodeGen] Really renumber slot indexes before register allocation ([CodeGen] Really renumber slot indexes before register allocation llvm/llvm-project#67038)
[MemCpyOpt] Fix the invalid code modification for GEP ([MemCpyOpt] Fix the invalid code modification for GEP llvm/llvm-project#68479)
Revert "[CodeGen] Really renumber slot indexes before register allocation ([CodeGen] Really renumber slot indexes before register allocation llvm/llvm-project#67038)"
[MachineLICM] Relax overlay conservative PHI check ([MachineLICM] Relax overlay conservative PHI check llvm/llvm-project#67186)
[NFS][CodeMoverUtils] Add comment saying not ready for production usage. ([NFS][CodeMoverUtils] Add comment saying not ready for production usage. llvm/llvm-project#68573)
[analyzer][NFC] Remove outdated FIXME comment ([analyzer][NFC] Remove outdated FIXME comment llvm/llvm-project#68211)
[clang-tidy] add namespace qualifier NFC ([clang-tidy] add namespace qualifier NFC llvm/llvm-project#68579)
[VP] IR expansion for bitreverse/bswap ([VP] IR expansion for bitreverse/bswap llvm/llvm-project#68504)
Reapply [Verifier] Sanity check alloca size against DILocalVariable fragment size
Revert "[MachineLICM] Relax overlay conservative PHI check ([MachineLICM] Relax overlay conservative PHI check llvm/llvm-project#67186)" (Revert "[MachineLICM] Relax overlay conservative PHI check (#67186)" llvm/llvm-project#68580)
[IndVars] Add test for phi select exit value with large BTC (NFC)
[SCEV] Don't require positive BTC when non-zero is sufficient
[libc++] LWG 3821 uses_allocator_construction_args should have overload for pair-like ([libc++] LWG 3821 uses_allocator_construction_args should have overload for pair-like llvm/llvm-project#66939)
[OpenMP] Fix setting visibility on declare target variables
[VP][NFC] Add 32-bit test for VP ([VP][NFC] Add 32-bit test for VP llvm/llvm-project#68582)
[OpenMPIRBuilder] Remove wrapper function in createTask, createTeams ([OpenMPIRBuilder] Remove wrapper function in createTask, createTeams llvm/llvm-project#67723)
[mlir][vector] Restore assert and fix typos ([mlir][vector] Restore assert and fix typos llvm/llvm-project#68581)
[ConstantFold] Avoid some uses of ConstantExpr::getSExt() (NFC)
[GlobalISel] Add support for *_fpmode intrinsics
[Sink] Fix bugs of sinking unreachable BB from phi ([Sink] Fix bugs of sinking unreachable BB from phi llvm/llvm-project#68576)
Revert "[SCEV] Don't invalidate past dependency-breaking instructions"
[clang-tidy] Improve ExceptionSpecAnalyzers handling of conditional noexcept expressions ([clang-tidy] Improve ExceptionSpecAnalyzers handling of conditional noexcept expressions llvm/llvm-project#68359)
[AArch64][LoopVectorize] Use upper bound trip count instead of the constant TC when choosing max VF ([AArch64][LoopVectorize] Use upper bound trip count instead of the constant TC when choosing max VF llvm/llvm-project#67697)
[InstCombine] Precommit test for PR68465
[InstCombine] Fold zext-of-icmp with no shift ([InstCombine] Fold zext-of-icmp with no shift llvm/llvm-project#68503)
[mlir][sparse] replace specialized buffer setup with util code ([mlir][sparse] replace specialized buffer setup with util code llvm/llvm-project#68461)
[mlir][tosa] Add verifier for ArgMax operator ([mlir][tosa] Add verifier for ArgMax operator llvm/llvm-project#68410)
[mlir][sparse] move variable into assert to avoid 'unused' error ([mlir][sparse] move variable into assert to avoid 'unused' error llvm/llvm-project#68604)
[flang][openacc] Support allocatable and pointer array in private recipe ([flang][openacc] Support allocatable and pointer array in private recipe llvm/llvm-project#68422)
Revert "[flang][openacc] Support allocatable and pointer array in private recipe ([flang][openacc] Support allocatable and pointer array in private recipe llvm/llvm-project#68422)"
Revert "[mlir][tools] Introduce tblgen-to-irdl tool ([mlir][tools] Introduce tblgen-to-irdl tool llvm/llvm-project#66865)"
[flang][openacc] Support allocatable and pointer array in private recipe
[mlir][nvvm] Introduce elect.sync Op ([mlir][nvvm] Introduce elect.sync Op llvm/llvm-project#68323)
[FrontEnd] Fix a warning
[lldb][NFCI] Remove use of ConstString from FilterRule in StructuredDataDarwinLog ([lldb][NFCI] Remove use of ConstString from FilterRule in StructuredDataDarwinLog llvm/llvm-project#68347)
[flang][openacc] Added acc::RecipeInterface for getting alloca insertion point. ([flang][openacc] Added acc::RecipeInterface for getting alloca insertion point. llvm/llvm-project#68464)
[clang-format][NFC] Annotate more r_braces
Annotate enum r brace
[mlir][arith] Canonicalization patterns for arith.select ([mlir][arith] Canonicalization patterns for arith.select llvm/llvm-project#67809)
[RISCV] Generaize reduction tree matching to fp sum reductions ([RISCV] Generaize reduction tree matching to fp sum reductions llvm/llvm-project#68599)
Revert "[Clang] Implement the 'counted_by' attribute" (Revert "[Clang] Implement the 'counted_by' attribute" llvm/llvm-project#68603)
[NVPTX] Improve lowering of v4i8 ([NVPTX] Improve lowering of v4i8 llvm/llvm-project#67866)
[X86] Add tests for incorrectly optimizing out shuffle used in movmsk; PR67287
[X86] Fix logic for optimizing movmsk(bitcast(shuffle(x))); PR67287
[clang-tidy] Add support for optional parameters in config.
[scudo] Make local cache be agnostic to the type of node in freelist ([scudo] Make local cache be agnostic to the type of node in freelist llvm/llvm-project#67379)
[libcxx] [test] Quote the python executable in the executor ([libcxx] [test] Quote the python executable in the executor llvm/llvm-project#68208)
[LLD] [MinGW] Handle the --dll option ([LLD] [MinGW] Handle the --dll option llvm/llvm-project#68575)
Revert "[compiler-rt] Allow Fuchsia to use 64-bit allocator for RISCV ([compiler-rt] Allow Fuchsia to use 64-bit allocator for RISCV llvm/llvm-project#68343)"
[clang-cl] Document behavior difference of strict aliasing in clang-cl vs clang. ([clang-cl] Document behavior difference of strict aliasing in clang-cl vs clang. llvm/llvm-project#68460)
Revert "[scudo] Make local cache be agnostic to the type of node in f… (Revert "[scudo] Make local cache be agnostic to the type of node in f… llvm/llvm-project#68626)
[mlir][python] generate value builders ([mlir][python] generate value builders llvm/llvm-project#68308)
[Debuginfod] Add \n to llvm-debuginfod-find error
[VectorCombine]Fix a crash during long vector analysis.
[flang][hlfir] address char_convert issues as mentioned in [flang][hlfir] fir.char_convert verification failure llvm/llvm-project#64315 ([flang][hlfir] address char_convert issues as mentioned in #64315 llvm/llvm-project#67570)
Remove LLDB introspection entrypoints from the shim (Remove LLDB introspection entrypoints from the shim llvm/llvm-project#68450)
[mlir][sparse] rename map utility ([mlir][sparse] rename map utility llvm/llvm-project#68611)
[mlir][sparse] add expanded size to API ([mlir][sparse] add expanded size to API llvm/llvm-project#68614)
[ASan][Windows] Fix rip-relative instruction replacement ([ASan][Windows] Fix rip-relative instruction replacement llvm/llvm-project#68432)
[llvm][objdump] Remove support for printing the embedded Bitcode section in MachO files. ([llvm][objdump] Remove support for printing the embedded Bitcode section in MachO files. llvm/llvm-project#68457)
[OpenACC][Bazel] Added OpenACCOpsInterfaces to BUILD.bazel file ([OpenACC][Bazel] Added OpenACCOpsInterfaces to BUILD.bazel file llvm/llvm-project#68639)
[Sanitizer][Docs] Improve docs on building Asan ([Sanitizer][Docs] Improve docs on building Asan llvm/llvm-project#68636)
Reapply "[scudo] Make local cache be agnostic to the type of node in … (Reapply "[scudo] Make local cache be agnostic to the type of node in … llvm/llvm-project#68633)
[Sanitizer][Docs] Reformat CMake invocation in docs
[Github] Add PR author name to subscription email ([Github] Add PR author name to subscription email llvm/llvm-project#68440)
[gn] port 24b0c43
[flang][runtime] Workaround cuda-11.8 compilation issue. ([flang][runtime] Workaround cuda-11.8 compilation issue. llvm/llvm-project#68459)
[lldb] add stop-at-user-entry option to process launch ([lldb] add stop-at-user-entry option to process launch llvm/llvm-project#67019)
[mlir][sparse] Fix errors in doc and tests ([mlir][sparse] Fix errors in doc and tests llvm/llvm-project#68641)
[flang][openacc] Support array with dynamic extent in private recipe ([flang][openacc] Support array with dynamic extent in private recipe llvm/llvm-project#68624)
[mlir][bufferization] Allow cyclic function graphs without tensors ([mlir][bufferization] Allow cyclic function graphs without tensors llvm/llvm-project#68632)
[mlir][bufferization][NFC] Simplify bufferizeOp function signature ([mlir][bufferization][NFC] Simplify bufferizeOp function signature llvm/llvm-project#68625)
[mlir][sparse] Extract StorageSpecifierToLLVMPass from bufferization pipeline ([mlir][sparse] Extract StorageSpecifierToLLVMPass from bufferization pipeline llvm/llvm-project#68635)
Support big endian in llvm-symbolizer's data location dwarf info parser (Support big endian in llvm-symbolizer's data location dwarf info parser llvm/llvm-project#67284)
[X86][NFC]Update test cases after D159250 ([X86][NFC]Update test cases after D159250 llvm/llvm-project#68517)
[C++20] [Modules] Don't emit function bodies which is noinline and av… ([C++20] [Modules] Don't emit function bodies which is noinline and av… llvm/llvm-project#68501)
[X86] Support EGPR (R16-R31) for APX ([X86] Support EGPR (R16-R31) for APX llvm/llvm-project#67702)
[RISCV] Simplify PatSetCC_m and PatFprFprDynFrm_m ([RISCV] Simplify PatSetCC_m and PatFprFprDynFrm_m llvm/llvm-project#68562)
[MLIR][TOSA] Add tosa.slice operation conversion failure scenario ([MLIR][TOSA] Add tosa.slice operation conversion failure scenario llvm/llvm-project#68578)
[mlir] remove some GCC warning GCC warning -Wunused-but-set-parameter in MLIR (<gcc-10 only) llvm/llvm-project#68409 ([mlir] remove some GCC warning #68409 llvm/llvm-project#68528)
[JITLink] Allow pre-existing eh-frame CIE edges on FDEs.
[mlir] Make overloads of SymbolTable::replaceAllSymbolUses consistent. ([mlir] Make overloads of SymbolTable::replaceAllSymbolUses consistent. llvm/llvm-project#68320)
-fsanitize=alignment: check memcpy/memmove arguments (-fsanitize=alignment: check memcpy/memmove arguments llvm/llvm-project#67766)
[clang] Fix several issues in the generated AttrHasAttributeImpl.inc
[Clang] Fix missing diagnostic for non-standard layout type in offsetof ([Clang] Fix missing diagnostic for non-standard layout type in offsetof llvm/llvm-project#65246)
Replace hard coded numbers from 462d583 with regex so the test passes on downstream projects that may define additional opcodes.
[MLIR][TOSA] Remove failed test cases ([MLIR][TOSA] Remove failed test cases llvm/llvm-project#68664)
[clang] [MinGW] Explicitly always pass the -fno-use-init-array ([clang] [MinGW] Explicitly always pass the -fno-use-init-array llvm/llvm-project#68571)
[Aarch64] Materialize immediates with 64-bit ORR + EOR if shorter ([Aarch64] Materialize immediates with 64-bit ORR + EOR if shorter llvm/llvm-project#68287)
[gitattributes] Don't mark all llvm-rc test Inputs as binary ([gitattributes] Don't mark all llvm-rc test Inputs as binary llvm/llvm-project#68583)
[AMDGPU] Use absolute relocations when compiling for AMDPAL and Mesa3D ([AMDGPU] Use absolute relocations when compiling for AMDPAL and Mesa3D llvm/llvm-project#67791)
[clang]Avoid diagnose invalid consteval call for invalid function decl ([clang]Avoid diagnose invalid consteval call for invalid function decl llvm/llvm-project#68646)
[AArch64] Fix postinc operands for Cortex-A510 scheduling
[clang][Interp][NFC] Move int128 tests to their own file
[LVI][CVP] Treat undef like a full range ([LVI][CVP] Treat undef like a full range llvm/llvm-project#68190)
[bazel] fix build for 4790578
[mlir][ArmSVE] Add convert.from/to.svbool intrinsics ([mlir][ArmSVE] Add convert.from/to.svbool intrinsics llvm/llvm-project#68418)
[bazel] fix build for 4790578
[AMDGPU] Add encoding/decoding support for non-result-returning ATOMIC_CSUB instructions

…ability macros Adding additional instantiations to the dylib isn't actually an ABI break as long as programs targeting an older dylib don't start to depend on them. Making additional instantiations a matter of availability allows us to add them without an ABI break. Reviewed By: #libc, ldionne, Mordante Spies: arichardson, ldionne, Mordante, libcxx-commits Differential Revision: https://reviews.llvm.org/D154796

- Add test coverage for sext/zext boolean additions - [InstCombine] Fold comparison of adding two z/sext booleans Fixes llvm#64859.

Reviewed By: #libc, ldionne Spies: ldionne, Mordante, libcxx-commits Differential Revision: https://reviews.llvm.org/D155411

…lvm#68318)

@test

…stinations (llvm#68074) Extend `bufferization.materialize_in_destination` to support memref destinations. This op can now be used to indicate that a tensor computation should materialize in a given buffer (that may have been allocated by another component/runtime). The op still participates in "empty tensor elimination". Example: ```mlir func.func @test(%out: memref<10xf32>) { %t = tensor.empty() : tensor<10xf32> %c = linalg.generic ... outs(%t: tensor<10xf32>) -> tensor<10xf32> bufferization.materialize_in_destination %c in restrict writable %out : (tensor<10xf32>, memref<10xf32>) -> () return } ``` After "empty tensor elimination", the above IR can bufferize without an allocation: ```mlir func.func @test(%out: memref<10xf32>) { linalg.generic ... outs(%out: memref<10xf32>) return } ``` This change also clarifies the meaning of the `restrict` unit attribute on `bufferization.to_tensor` ops.

This reverts commit 0687e4d. Causes LLDB failures: https://reviews.llvm.org/D101206#4653253

Some large AVR programs (for devices without long jump) may exceed 128KiB, and lld should give explicit errors other than generate wrong executables silently.

…mber detection (llvm#68405) Fixes misleading comment introduced in `f74aaca63202cabb512c78fe19196ff348d436a8`

…llvm#67120) The transfrom interpreter accepts an argument to a "library" file with named sequences. This patch exteneds this functionality such that (1) several such individual files are accepted and (2) folders can be passed in, in which all `*.mlir` files are loaded.

…ests Made these TODO instead of negative

…ffles Allow length changing shuffle masks in the "bitcast (shuf V, MaskC) --> shuf (bitcast V), MaskC'" fold. It also exposes some poor shuffle mask detection for extract/insert subvector cases inside improveShuffleKindFromMask First stage towards addressing Issue llvm#67803

…s from SK_PermuteSingleSrc

This patch tries to canonicalise add + gep to gep + gep. Co-authored-by: Paul Walker <paul.walker@arm.com> Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D155688

…m#68403) Instead of RDSVL * RDSVL.

…erErrors member variable correctly based on the PP diagnostics. (llvm#68127)"" This reverts commit a6acf3f and relands a50e63b. The original revert was done by mistake.

…8382) While working on llvm#68377 inspecting `Allocate()` calls, I found out that there are couple of places where we forget to use placement-new to create objects in the allocated memory.

Second try...

/llvm-project/llvm/include/llvm/CodeGen/BasicTTIImpl.h:948:33: error: comparison of integers of different signs: 'size_t' (aka 'unsigned long') and 'int' [-Werror,-Wsign-compare] (Index + Mask.size()) <= NumSrcElts) { ~~~~~~~~~~~~~~~~~~~ ^ ~~~~~~~~~~

…ing a simple transform dialect interpreter pass (llvm#68330)

These were missed as I didn't expect clang codegen to be updated

…ttrs.ll

…lvm#65852) This patch folds the pattern `a ne/eq (zext/sext (a ne/eq c))` into a boolean constant or a compare. Clang vs GCC: https://godbolt.org/z/4ro817WE8 Proof for `zext`: https://alive2.llvm.org/ce/z/6z9NRF Proof for `sext`: https://alive2.llvm.org/ce/z/tv5wuE Fixes llvm#65073.

This reverts commit 3f8ef57.

Introduced by 5b657f5 that moved LICM after AMDGPUCodeGenPrepare. Some instructions are no longer sunk during ir optimizations but in machine-sinking instead. If vgpr instruction used sgpr defined inside the cycle is sunk outside of the cycle we end up with not-handled case of temporal divergence. Add test for theoretical case when SALU instruction (represents uniform value) is sunk outside of the cycle. Add a test when SALU instruction can be sunk if it edits lane mask.

Temporal divergence that was present in input or introduced in IR transforms, like code-sinking or LICM, is handled in SIFixSGPRCopies by changing sgpr source instr to vgpr instr. After 5b657f5, that moved LICM after AMDGPUCodeGenPrepare, machine-sinking can introduce temporal divergence by sinking instructions outside of the cycle. Add isSafeToSink callback in TargetInstrInfo.

…er (llvm#67284) For now, data location expression is hard coded to little endian. We are going to support sanitizers on AIX which is big endian. Support big endian too in the data location expression parser of llvm-symbolizer.

llvm#68501) …ailabl externally A workaround for llvm#60996 As the title suggested, we can avoid emitting available externally functions which is marked as noinline already. Such functions should contribute nothing for optimizations. The update for docs will be sent seperately if this got approved.

1. Map R16-R31 to DWARF registers 130-145. 2. Make R16-R31 caller-saved registers. 3. Make R16-31 allocatable only when feature EGPR is supported 4. Make R16-31 availabe for instructions in legacy maps 0/1 and EVEX space, except XSAVE*/XRSTOR RFC: https://discourse.llvm.org/t/rfc-design-for-apx-feature-egpr-and-ndd-support/73031/4 Explanations for some seemingly unrelated changes: inline-asm-registers.mir, statepoint-invoke-ra-enter-at-end.mir: The immediate (TargetInstrInfo.cpp:1612) used for the regdef/reguse is the encoding for the register class in the enum generated by tablegen. This encoding will change any time a new register class is added. Since the number is part of the input, this means it can become stale. seh-directive-errors.s: R16-R31 makes ".seh_pushreg 17" legal musttail-varargs.ll: It seems some LLVM passes use the number of registers rather the number of allocatable registers as heuristic.

1. Use `Ext.PrimaryVT` in `PatSetCC_m ` 2. Merge `PatFprFprDynFrm` from Zfh/Zhinx two locations into `PatFprFprDynFrm_m`.

…vm#68578) Fixes llvm#68481, In the following scenario, the conversion fails: 1. resultType of tosa.slice is UnrankedTensorType 2. tosa.slice.getsize().size() < resultType.getRank()

This restores the pre-b9383a86b8f behavior. Most platforms / compilers don't add relocations for CIEs, however they're not prohibited and we want objects that contain them to remain loadable.

llvm#68320) This function has several overloads that allow to specify the symbol that should be renamed and the scope for that renaming in different ways. The overloads were inconsistent in the following way (quoted strings are `StringAttr`s, other variables are `Operation *`): * `replaceAllSymbolUses(symbolOp, "new_symbol", scopeOp)` would traverse into the nested regions of `scopeOp` and hence rename the symbol inside of `scopeOp`. * `replaceAllSymbolUses("symbol", "new_symbol", scopeOp)` would *not* traverse into the nested regions of `scopeOp` and hence *not* rename the symbol. The underlying behavior was spread over different places and is somewhat hard to understand. The two overloads above mainly differed by what `collectSymbolScopes` computed, which is itself overloaded. If `scopeOp` is a top-level module, then the overload on `(Operation *, Operation *)`, which is used in the first of the above cases, computes a scope where the body region of the module is the `limit`; however, the overload on `(StringAttr, Operation *)` computed the module op itself as the `limit`. Later, `walkSymbolTable` would walk the body of the module if it was given as a region but it would *not* enter the regions of the module op because that op has a symbol table (which was assumed to be a *different* scope). The fix in this commit is change the behavior of `collectSymbolScopes` such that the `(StringAttr, Operation *)` overload returns a scope for each region in the `limit` argument.

The -fsanitize=alignment implementation follows the model that we allow forming unaligned pointers but disallow accessing unaligned pointers. See [RFC: Enforcing pointer type alignment in Clang](https://lists.llvm.org/pipermail/llvm-dev/2016-January/094012.html) for detail. memcpy is a memory access and we require an `int *` argument to be aligned. Similar to https://reviews.llvm.org/D9673 , emit -fsanitize=alignment check for arguments of builtin memcpy and memmove functions to catch misaligned load like: ``` // Check the alignment of a but ignore the alignment of b void unaligned_load(int *a, void *b) { memcpy(a, b, sizeof(*a)); } ``` For a reference parameter, we emit a -fsanitize=alignment check as well, which can be optimized out by InstCombinePass. We rely on the call site `TCK_ReferenceBinding` check instead. ``` // The alignment check of a will be optimized out. void unaligned_load(int &a, void *b) { memcpy(&a, b, sizeof(a)); } ``` The diagnostic message looks like ``` runtime error: store to misaligned address [[PTR:0x[0-9a-f]*]] for type 'int *' ``` We could use a better message for memcpy, but we don't do it for now as it would require a new check name like misaligned-pointer-use, which is probably not necessary. *RFC: Enforcing pointer type alignment in Clang* is not well documented, but this patch does not intend to change the that. Technically builtin memset functions can be checked for -fsanitize=alignment as well, but it does not seem too useful.

1. The generated file contained a lot of duplicate switch cases, e.g.: ``` switch (Syntax) { case AttributeCommonInfo::Syntax::AS_GNU: return llvm::StringSwitch<int>(Name) ... .Case("error", 1) .Case("warning", 1) .Case("error", 1) .Case("warning", 1) ``` 2. Some attributes were listed in wrong places, e.g.: ``` case AttributeCommonInfo::Syntax::AS_CXX11: { if (ScopeName == "") { return llvm::StringSwitch<int>(Name) ... .Case("warn_unused_result", LangOpts.CPlusPlus11 ? 201907 : 0) ``` `warn_unused_result` is a non-standard attribute and should not be available as [[warn_unused_result]]. 3. Some attributes had the wrong version, e.g.: ``` case AttributeCommonInfo::Syntax::AS_CXX11: { } else if (ScopeName == "gnu") { return llvm::StringSwitch<int>(Name) ... .Case("fallthrough", LangOpts.CPlusPlus11 ? 201603 : 0) ``` [[gnu::fallthrough]] is a non-standard spelling and should not have the standard version. Instead, __has_cpp_attribute should return 1 for it. There is another issue with attributes that share spellings, e.g.: ``` .Case("interrupt", true && (T.getArch() == llvm::Triple::arm || ...) ? 1 : 0) .Case("interrupt", true && (T.getArch() == llvm::Triple::avr) ? 1 : 0) ... .Case("interrupt", true && (T.getArch() == llvm::Triple::riscv32 || ...) ? 1 : 0) ``` As can be seen, __has_attribute(interrupt) would only return true for ARM targets. This patch does not address this issue. Differential Revision: https://reviews.llvm.org/D159393

…tof` (llvm#65246) Fixes llvm#64619 Clang warns diagnostic for non-standard layout types in `offsetof` only if they are in evaluated context. With this patch, you'll also get diagnostic if you use `offsetof` on non-standard layout types in any other contexts

… on downstream projects that may define additional opcodes.

I would put this into the implementation of verify for tosa.slice

…68571) On MinGW targets, the .ctors section is always used for constructors. When using the .ctors section, the constructors need to be emitted in reverse order to get them execute in the right order. (Constructors with a specific priority are sorted separately by the linker later.) In LLVM, in CodeGen/AsmPrinter/AsmPrinter.cpp, there's code that reverses them before writing them out, executed when using the .ctors section. This logic is done whenever TM.Options.UseInitArray is set to false. Thus, make sure to set UseInitArray to false for this target. This fixes llvm#55938.

…vm#68287) A number of useful constants can be encoded with a 64-bit ORR followed by a 64-bit EOR, including all remaining repeated byte patterns, some useful repeated 16-bit patterns, and some irregular masks. This patch prioritizes that encoding over three or four instruction encodings. Encoding with MOV + MOVK or ORR + MOVK is still preferred for fast literal generation and readability respectively. The method devises three candidate values, and checks if both Candidate and (Imm ^ Candidate) are valid logical immediates. If so, Imm is materialized with: ``` ORR Xd, XZR, #(Imm ^ Candidate) EOR Xd, Xd, #(Candidate) ``` The method has been exhaustively tested to ensure it can solve all possible values (excluding 0, ~0, and plain logical immediates, which are handled earlier).

) This allows tooling to properly show diffs for files in the llvm/test/tools/llvm-rc/Inputs directory. Keep the actual icon/cursor/bitmap files marked as binary.

llvm#67791) The primary ISA-independent justification for using PC-relative addressing is that it makes code position-independent and therefore allows sharing of .text pages between processes. When not sharing .text pages, we can use absolute relocations instead, which will possibly prevent a bubble introduced by s_getpc_b64. Co-authored-by: Thomas Symalla <thomas.symalla@amd.com>

llvm#68646) Fixes:llvm#68542 It‘s meaningless to diagnose further error for invalid function declaration.

Similar to D159254, this fixes the order of WriteAdr operands on post/pre-inc loads/stores in the Cortex-A510 scheduling model. I will add the same for other models too, this will be the most impactful due to it being the default cpu scheduling model. Closes llvm#68518

When converting to ConstantRange, we should treat undef like a full range. Fixes llvm#68381.

These will be used in future pass to ensure that loads/stores of masks are legal (as the LLVM backend does not support this for any type smaller than an svbool, which is vector<[16]xi1>). Depends on llvm#68399

for real

…C_CSUB instructions The BUFFER_ATOMIC_CSUB and GLOBAL_ATOMIC_CSUB instructions have encodings for non-value-returning forms, although actually using them isn't supported by hardware. However, these encodings aren't supported by the backend, meaning that they can't even be assembled or disassembled. Add support for the non-returning encodings, but gate actually using them in instruction selection behind a new feature FeatureAtomicCSubNoRtnInsts, which no target uses. This does allow the non-returning instructions to be tested manually and llvm.amdgcn.atomic.csub.ll is extended to cover them. The feature does not gate assembling or disassembling them, this is now not an error, and encoding and decoding tests have been adapted accordingly.

This reverts commit a1e81d2. Revert "Fix test hip-offload-compress-zlib.hip" This reverts commit ba01ce6. Revert due to sanity fail at https://lab.llvm.org/buildbot/#/builders/5/builds/37188 https://lab.llvm.org/buildbot/#/builders/238/builds/5955 /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/clang/lib/Driver/OffloadBundler.cpp:1012:25: runtime error: load of misaligned address 0xaaaae2d90e7c for type 'const uint64_t' (aka 'const unsigned long'), which requires 8 byte alignment 0xaaaae2d90e7c: note: pointer points here bc 00 00 00 94 dc 29 9a 89 fb ca 2b 78 9c 8b 8f 77 f6 71 f4 73 8f f7 77 73 f3 f1 77 74 89 77 0a ^ #0 0xaaaaba125f70 in clang::CompressedOffloadBundle::decompress(llvm::MemoryBuffer const&, bool) /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/clang/lib/Driver/OffloadBundler.cpp:1012:25 #1 0xaaaaba126150 in clang::OffloadBundler::ListBundleIDsInFile(llvm::StringRef, clang::OffloadBundlerConfig const&) /b/sanitizer-aarch64-linux-bootstrap-ubsan/build/llvm-project/clang/lib/Driver/OffloadBundler.cpp:1089:7 Will reland after fixing it.

philnik777 and others added 30 commits October 6, 2023 11:21

[mlir][bazel] Disable test added in llvm@7876899

185e16d

[InstCombine] Fold comparison of adding two z/sext booleans (llvm#67895)

5d8fb47

- Add test coverage for sext/zext boolean additions - [InstCombine] Fold comparison of adding two z/sext booleans Fixes llvm#64859.

[libc++] Implement P2614R2 (Deprecate numeric_limits::has_denorm)

0d7947b

Reviewed By: #libc, ldionne Spies: ldionne, Mordante, libcxx-commits Differential Revision: https://reviews.llvm.org/D155411

[libc++] Bump the clang version the clang-tidy checks are based on (l…

ff843c0

…lvm#68318)

[MLIR] NFC. Fix clang-tidy warnings in Affine Utils

4e888e2

Revert "[libc++] Remove UB in list, forward_list and __hash_table"

b935882

This reverts commit 0687e4d. Causes LLDB failures: https://reviews.llvm.org/D101206#4653253

[lld][ELF][AVR] Add range check for R_AVR_13_PCREL (llvm#67636)

488a62f

Some large AVR programs (for devices without long jump) may exceed 128KiB, and lld should give explicit errors other than generate wrong executables silently.

[lldb][DWARFASTParserClang][NFC] Fix comment regarding static data me…

a233a49

…mber detection (llvm#68405) Fixes misleading comment introduced in `f74aaca63202cabb512c78fe19196ff348d436a8`

[VectorCombine][X86] Add additional length changing foldBitcastShuf t…

3bae69e

…ests Made these TODO instead of negative

[TTI] improveShuffleKindFromMask - detect SK_ExtractSubvector pattern…

a16f646

…s from SK_PermuteSingleSrc

[PATCH] [llvm] [InstCombine] Canonicalise ADD+GEP

e13bed4

This patch tries to canonicalise add + gep to gep + gep. Co-authored-by: Paul Walker <paul.walker@arm.com> Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D155688

[AArch64][SME] Tile slices to lazy-save/restore should be RDSVL. (llv…

ff48816

…m#68403) Instead of RDSVL * RDSVL.

Revert "Revert "Fixes and closes llvm#53952. Setting the ASTHasCompil…

46518a1

…erErrors member variable correctly based on the PP diagnostics. (llvm#68127)"" This reverts commit a6acf3f and relands a50e63b. The original revert was done by mistake.

[clang][NFC] Add missing placement-new after Allocate() calls (llvm#6…

99e6ef3

…8382) While working on llvm#68377 inspecting `Allocate()` calls, I found out that there are couple of places where we forget to use placement-new to create objects in the allocated memory.

[mlir][bazel] Fix after llvm@6a2071c

4e311ea

[mlir][bazel] Fix after llvm@6a2071c

03bdfcc

Second try...

[mlir][Transform] Provide a minimal set of utils that allow implement…

ef8c26b

…ing a simple transform dialect interpreter pass (llvm#68330)

[clang][CodeGen] Regenerate tests checks after 94795a3

32a9c09

These were missed as I didn't expect clang codegen to be updated

[InstCombine] Add additional pre-commit tests for llvm#67915. NFC.

b9edf6d

[mlir][bazel] Fix after llvm@ef8c26b

1cd14ad

[AArch64][SME] NFC: use update_test_checks.py for sme-pstate(sm|za)-a…

0e099fa

…ttrs.ll

Revert "MachineSink: Fix sinking VGPR def out of a divergent loop"

ccf68ab

This reverts commit 3f8ef57.

chenzheng1030 and others added 26 commits October 10, 2023 09:13

[X86][NFC]Update test cases after D159250 (llvm#68517)

057ec76

[RISCV] Simplify PatSetCC_m and PatFprFprDynFrm_m (llvm#68562)

7645df6

1. Use `Ext.PrimaryVT` in `PatSetCC_m ` 2. Merge `PatFprFprDynFrm` from Zfh/Zhinx two locations into `PatFprFprDynFrm_m`.

[MLIR][TOSA] Add tosa.slice operation conversion failure scenario (ll…

9ab732f

…vm#68578) Fixes llvm#68481, In the following scenario, the conversion fails: 1. resultType of tosa.slice is UnrankedTensorType 2. tosa.slice.getsize().size() < resultType.getRank()

[mlir] remove some GCC warning llvm#68409 (llvm#68528)

80815df

[JITLink] Allow pre-existing eh-frame CIE edges on FDEs.

0d0f219

This restores the pre-b9383a86b8f behavior. Most platforms / compilers don't add relocations for CIEs, however they're not prohibited and we want objects that contain them to remain loadable.

Replace hard coded numbers from 462d583 with regex so the test passes…

909087c

… on downstream projects that may define additional opcodes.

[MLIR][TOSA] Remove failed test cases (llvm#68664)

d37056c

I would put this into the implementation of verify for tosa.slice

[gitattributes] Don't mark all llvm-rc test Inputs as binary (llvm#68583

e46822e

) This allows tooling to properly show diffs for files in the llvm/test/tools/llvm-rc/Inputs directory. Keep the actual icon/cursor/bitmap files marked as binary.

[clang]Avoid diagnose invalid consteval call for invalid function decl (

19d1da5

llvm#68646) Fixes:llvm#68542 It‘s meaningless to diagnose further error for invalid function declaration.

[clang][Interp][NFC] Move int128 tests to their own file

3542dd8

[LVI][CVP] Treat undef like a full range (llvm#68190)

8185794

When converting to ConstantRange, we should treat undef like a full range. Fixes llvm#68381.

[bazel] fix build for 4790578

962a049

[mlir][ArmSVE] Add convert.from/to.svbool intrinsics (llvm#68418)

3d70ba6

These will be used in future pass to ensure that loads/stores of masks are legal (as the LLVM backend does not support this for any type smaller than an svbool, which is vector<[16]xi1>). Depends on llvm#68399

[bazel] fix build for 4790578

141ca54

for real

stepthomas closed this Oct 10, 2023

stepthomas deleted the AMDGPU-stepthomas-atomic-csub-no-rtn-forms-ver2 branch October 10, 2023 09:46

stepthomas restored the AMDGPU-stepthomas-atomic-csub-no-rtn-forms-ver2 branch October 10, 2023 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMDGPU stepthomas atomic csub no rtn forms ver2 #1

AMDGPU stepthomas atomic csub no rtn forms ver2 #1

stepthomas commented Oct 10, 2023

AMDGPU stepthomas atomic csub no rtn forms ver2 #1

AMDGPU stepthomas atomic csub no rtn forms ver2 #1

Conversation

stepthomas commented Oct 10, 2023