[AutoBump] Merge with fixes of 360a03c9 (Jan 08) (32)[Only tested MLIR] #502

jorickert · 2025-03-18T11:28:25Z

No description provided.

Tosa v1.0 adds accumulator type attributes to the various convolution operations defined in the spec. Update the dialect and any lit tests to include these attributes. Signed-off-by: Tai Ly <tai.ly@arm.com> Co-authored-by: Tai Ly <tai.ly@arm.com>

…m#121756) Raised in issue llvm#121697

Use combineMetadataForCSE instead of manually enumerating known metadata kinds. This is a typical sinking transform for which combineMetadataForCSE is safe to use (with DoesKMove=true). Part of llvm#121495.

…122065) After 0dedd6f and 03229e7, invalid concept declarations might lack expressions for evaluation and normalization. This could make it crash in certain scenarios, apart from the one of evaluation concepts showed in 03229e7, there's also an issue when checking specializations where the normalization also relies on a non-null expression. This patch prevents that by avoiding building up a type constraint in such situations, thereafter the template parameter wouldn't have a concept specialization of a null expression. With this patch, the assumption in ASTWriterDecl is no longer valid. Namely, HasConstraint and TypeConstraintInitialized must now represent different meanings for both source fidelity and semantic requirements. Fixes llvm#115004 Fixes llvm#121980

…ValueTracking. NFC. (llvm#122064) Needed by llvm#121958.

…l` (llvm#122101) Needed by llvm#71058

Fixes llvm#121965. --------- Co-authored-by: Christian Ulmann <christianulmann@gmail.com> Co-authored-by: Alex Zinenko <git@ozinenko.com>

…s after C++26 for bugprone-unused-local-non-trivial-variable (llvm#121783) Fixed: llvm#121731 According to https://eel.is/c++draft/basic.scope.scope#5, name independent declaration should not be warned as unused

…lvm#121968) Co-authored-by: Kiran Chandramohan <kiranchandramohan@gmail.com>

…122018) catering to platform differences as those calls are not posix.

substition -> substitution in-betweem -> in-between

…essor (llvm#120249) Part 2 (and final part) following llvm#120102 Allows users to do things like: ``` if (o->x.has_value()) { ((*o).x).value(); } ``` where the `->` and `*` are operator overload calls. A user could instead extract the nested optional into a local variable once instead of doing two accessor calls back to back, but currently they are unsure why the code is flagged.

Save the bitwidth value as a `ConstantExpr` with the value set. Remove the `ASTContext` parameter from `getBitWidthValue()`, so the latter simply returns the value from the `ConstantExpr` instead of constant-evaluating the bitwidth expression every time it is called.

These are triggering after b037bce.

This reverts commit 81fc3ad. This breaks some LLDB tests, e.g. SymbolFile/DWARF/x86/no_unique_address-with-bitfields.cpp: lldb: ../llvm-project/clang/lib/AST/Decl.cpp:4604: unsigned int clang::FieldDecl::getBitWidthValue() const: Assertion `isa<ConstantExpr>(getBitWidth())' failed.

The dependency file and the P1689 file are text files, but the open call misses the OF_Text flag. This PR adds the flag. Fixes regressions in test cases ClangScanDeps/modules-extern-unrelated.m and ClangScanDeps/P1689.cppm.

Need to check if the GEP bases are equal and return false early. Also, need to return false if the lookup is too deep, considering bases equal too. Fixes a crash in the assertion.

…e DAP object lifecycle. (llvm#120457)" This reverts commit 0d9cf26. Breaks the lldb-aarch64-windows buildbot.

This is in order to prepare for future MR where we will extend `ReachingDefAnalysis` to stack slots.

…"default<O3>" to allow DOS to correctly evaluate the RUN command Necessary for running update_test_checks.py on windows

The version string can be anything, don't restrict it to digits and dots. It's derived from the resource dir, so just check for that.

I think this is a false positive for a non-capturing lambda, but I can't find anything in the standard that guarantees that these have eternal lifetime.

…m#121366) This PR adds VALID_ELEMENT_ACCESS and VALID_INPUT_RANGE checks for vector<bool>.

/llvm-project/llvm/lib/Support/Timer.cpp:565:74: error: extra ';' outside of a function is incompatible with C++98 [-Werror,-Wc++98-compat-extra-semi] static bool mtg::TrackSpace() { return ManagedTimerGlobals->TrackSpace; }; ^ 1 error generated.

…19826) `RegisterClassInfo::getRegPressureSetLimit` is a wrapper of `TargetRegisterInfo::getRegPressureSetLimit` with some logics to adjust the limit by removing reserved registers. It seems that we shouldn't use `TargetRegisterInfo::getRegPressureSetLimit` directly, just like the comment "This limit must be adjusted dynamically for reserved registers" said. Separate from llvm#118787

When a destroying delete overload is selected, the destructor is not automatically called. Therefore, the destructor can be deleted without causing the program to be ill-formed. Fixes llvm#46818

This PR enables the use of the status register in inline assembly. This is necessary to, for example, set and retrieve the current interrupt mask. --------- Co-authored-by: kirk <knickish@gmail.com>

… <locale> (llvm#122168)

This patch adds a section for OpenMP 6.0 features. --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>

…ning reductions (llvm#122176)

We already custom lower the other 16-bit element type shuffles.

…1961) In DXC, setting the vulkan version automatically sets the target spir-v version to the maximum spir-v version that the vulkan version must support. So for Vulkan 1.2, we set the spir-v version to spirv 1.5 because every implementation of Vulkan 1.2 must support spirv 1.5, but not spir-v 1.6.

This change ensures that base relocations are sorted in the output, aligning with MSVC linker behavior. While input files typically provide sorted relocations, this update guarantees correct sorting even if the input relocations are unordered.

In my patch there, I left a test expectation stale. Tests with `REQUIRES: Z3` never run because no bots check such configurations. Here I'm adjusting the test expectations to meet reality.

Fixes llvm#121407

Rework involves below: - Return unsigned value, the number of div/rem bits actually needed. - Change from AtLeast(SignBits) to MaxDivBits hint. - Use MaxDivBits hint for unsigned case. - Remove unnecessary second early exit. Mostly NFC changes.

@1

Replaces llvm#121886 Fixes llvm#120254 (hopefully 🤞) ## Problem Consider the following example: ```fortran program test real :: x(1) integer :: i !$omp parallel do reduction(+:x) do i = 1,1 x = 1 end do !$omp end parallel do end program ``` The HLFIR+OMP IR for this example looks like this: ```mlir func.func @_QQmain() { ... omp.parallel { %5 = fir.embox %4#0(%3) : (!fir.ref<!fir.array<1xf32>>, !fir.shape<1>) -> !fir.box<!fir.array<1xf32>> %6 = fir.alloca !fir.box<!fir.array<1xf32>> ... omp.wsloop private(@_QFEi_private_ref_i32 %1#0 -> %arg0 : !fir.ref<i32>) reduction(byref @add_reduction_byref_box_1xf32 %6 -> %arg1 : !fir.ref<!fir.box<!fir.array<1xf32>>>) { omp.loop_nest (%arg2) : i32 = (%c1_i32) to (%c1_i32_0) inclusive step (%c1_i32_1) { ... omp.yield } } omp.terminator } return } ``` The problem addressed by this PR is related to: the `alloca` in the `omp.parallel` region + the related `reduction` clause on the `omp.wsloop` op. When we try translate the reduction from MLIR to LLVM, we have to choose an `alloca` insertion point. This happens in `convertOmpWsloop` where at entry to that function, this is what the LLVM module looks like: ```llvm define void @_QQmain() { %tid.addr = alloca i32, align 4 ... entry: %omp_global_thread_num = call i32 @__kmpc_global_thread_num(ptr @1) br label %omp.par.entry omp.par.entry: %tid.addr.local = alloca i32, align 4 ... br label %omp.par.region omp.par.region: br label %omp.par.region1 omp.par.region1: ... %5 = alloca { ptr, i64, i32, i8, i8, i8, i8, [1 x [3 x i64]] }, align 8 ``` Now, when we choose an `alloca` insertion point for the reduction, this is the chosen block `omp.par.entry` (without the changes in this PR). The problem is that the allocation needed for the reduction needs to reference the `%5` SSA value. This results in inserting allocations in `omp.par.entry` that reference allocations in a later block `omp.par.region1` which causes the `Instruction does not dominate all uses!` error. ## Possible solution - take 2: This PR contains a more localized solution than llvm#121886. It makes sure that on entry to `initReductionVars`, the IR builder is at a point where we can starting inserting initialization region; to make things cleaner, we still split the builder insertion point to a dedicated `omp.reduction.init`. This way we avoid splitting after the latest allocation block; which is what causing the issue.

Adds some comments and re-name variables to clarify the usage.

This reverts commit 45d4698. NVPTX fabs & fneg are incompatible with LLVM's semantics as LLVM guarantees the payload of NaNs to stay the same while PTX mangles NaNs. The bad patterns are still in the NVPTX backend and should probably be removed, since this change only exposed the bad behavior.

…types. (llvm#122240) When a vector is instantiated with a pointer type (`T` being `const Foo*`), the inferred annotation becomes `push_back(const Foo*& value [[clang::lifetime_capture_by(this)]])`. For reference parameters, the `lifetime_capture_by` attribute treats the lifetime as referring to the referenced object -- in this case, the **pointer** itself, not the pointee object. In the `push_back`, we copy the pointer's value, which does not establish a reference to the pointer. This behavior is safe and does not capture the pointer's lifetime. The annotation should not be inferred for cases where `T` is a pointer type, as the intended semantics do not align with the annotation. Fixes llvm#121391

The LLVM Discord bot now has the ability to scrape the LLVM calendar & send reminders about upcoming office hours events and sync-ups. Document that here. While I'm in the area, add a note about the bot's ability to @mention people when they're on buildbot blamelists. Related to llvm/Community.o#19

…llvm#121557) Adds an end-to-end test for `tensor.unpack` with dynamic inner tile sizes. While relatively simple (e.g., no vectorization), this example required a few fixes in handling `tensor.unpack` (and similar fixes for `tensor.pack` before that): * llvm#119379, llvm#121393, llvm#121400. The end goal for this test is to incrementally increase its complexity and to work towards scalable tile sizes. Note, this PR complements llvm#115698 in which similar test for `tensor.pack` was added.

One common error seen in CUDA/HIP compilation is: fatal error: 'cmath' file not found which is due to inproper installation of standard C++ libraries. Since it happens with #include_next, users may feel confusing which cmath is not found and how to fix it. Add an error directive to help users resolve this issue.

[AutoBump] Merge with 465a3ce (Jan 09) (33)

FranklandJack and others added 30 commits January 8, 2025 12:12

[lldb][OpenBSD][NFC] Replace tab with spaces (llvm#122041)

30b7da7

[TySan] Added a 'print_stacktrace' flag for more detailed errors (llv…

20d7fa1

…m#121756) Raised in issue llvm#121697

[InstCombine] Use combineMetadataForCSE in phi of loads fold

bfa711a

Use combineMetadataForCSE instead of manually enumerating known metadata kinds. This is a typical sinking transform for which combineMetadataForCSE is safe to use (with DoesKMove=true). Part of llvm#121495.

[TBAA] Add Clang pointer TBAA test with void *.

9fc152d

[ValueTracking] Move getFlippedStrictnessPredicateAndConstant into …

03e7862

…ValueTracking. NFC. (llvm#122064) Needed by llvm#121958.

[CodeGenPrepare] Regenerate test `X86/codegen-prepare-addrmode-sext.l…

7060d2a

…l` (llvm#122101) Needed by llvm#71058

[MLIR] Enable import of non self referential alias scopes (llvm#121987)

1c067a5

Fixes llvm#121965. --------- Co-authored-by: Christian Ulmann <christianulmann@gmail.com> Co-authored-by: Alex Zinenko <git@ozinenko.com>

Fix macho-merged-funcs-dwarf.yaml test on Windows

51d7605

[flang][driver] add option to make all main program variable static (l…

645c1ee

…lvm#121968) Co-authored-by: Kiran Chandramohan <kiranchandramohan@gmail.com>

[compiler-rt][rtsan] adding setlinebuf/setbuffer interception. (llvm#…

7004d68

…122018) catering to platform differences as those calls are not posix.

[UpdateTestChecks][NFC] - Fix typos (llvm#121964)

d07762e

substition -> substitution in-betweem -> in-between

Clean up -Wdangling-assignment-gsl in clang and mlir

35c5e56

These are triggering after b037bce.

[libc++][NFC] Add missing includes in tzdb.cpp

1411a9a

[TBAA] Simplify checks for unnamed struct case, where anyptr is used.

346fad5

[MLIR][GPU] Fix gpu.printf test syntax after f50f969

0d7022e

[SLP]Fix a crash for very long GEP chains

1160994

Need to check if the GEP bases are equal and return false early. Also, need to return false if the lookup is too deep, considering bases equal too. Fixes a crash in the assertion.

Revert "[lldb-dap] Ensure the IO forwarding threads are managed by th…

81898ac

…e DAP object lifecycle. (llvm#120457)" This reverts commit 0d9cf26. Breaks the lldb-aarch64-windows buildbot.

Revert llvm#116331 & llvm#121852 (llvm#122105)

b66f6b2

[ReachingDefAnalysis][NFC] Rename PhysReg to Reg. (llvm#122112)

f37bee1

This is in order to prepare for future MR where we will extend `ReachingDefAnalysis` to stack slots.

[PhaseOrdering][AArch64] block_scaling_decompr_8bit.ll - use -passes=…

322ff42

…"default<O3>" to allow DOS to correctly evaluate the RUN command Necessary for running update_test_checks.py on windows

Make test more lenient for custom clang version strings

fe162be

The version string can be anything, don't restrict it to digits and dots. It's derived from the resource dir, so just check for that.

Fix -Wdangling-assignment-gsl in ClangdLSPServerTests

a3b4d91

I think this is a false positive for a non-capturing lambda, but I can't find anything in the standard that guarantees that these have eternal lifetime.

[libc++] Add missing hardening checks and tests for vector<bool> (llv…

b054289

…m#121366) This PR adds VALID_ELEMENT_ACCESS and VALID_INPUT_RANGE checks for vector<bool>.

DamonFool and others added 26 commits January 9, 2025 20:47

[C++20] Destroying delete and deleted destructors (llvm#118800)

5ff7f47

When a destroying delete overload is selected, the destructor is not automatically called. Therefore, the destructor can be deleted without causing the program to be ill-formed. Fixes llvm#46818

[M68k] implement move to and from sr (llvm#111145)

a0d7749

This PR enables the use of the status register in inline assembly. This is necessary to, for example, set and retrieve the current interrupt mask. --------- Co-authored-by: kirk <knickish@gmail.com>

[libc++] Add __iswctype to the locale base API since it's required by…

81ae668

… <locale> (llvm#122168)

[Clang][OpenMP][Doc] Update OpenMPSupport.rst (llvm#122174)

c85d516

This patch adds a section for OpenMP 6.0 features. --------- Co-authored-by: Joseph Huber <huberjn@outlook.com>

[RISCV][VLOPT] Add getOperandInfo for integer and floating point wide…

f77a7dd

…ning reductions (llvm#122176)

AMDGPU: Custom lower bf16 shuffles (llvm#122252)

d2b78c6

We already custom lower the other 16-bit element type shuffles.

[lldb] Move GetEnvironment function into common code (llvm#122173)

cb04bc0

[RISCV][VLOPT] Add fp-reductions to getOperandInfo (llvm#122151)

550841f

[analyzer][NFC] Update stale test after llvm#97265 (llvm#122272)

aa0191e

In my patch there, I left a test expectation stale. Tests with `REQUIRES: Z3` never run because no bots check such configurations. Here I'm adjusting the test expectations to meet reality.

[Clang] disallow attributes after namespace identifier (llvm#121614)

1a73654

Fixes llvm#121407

[mlir][vector][nfc] Update alignedConversionPrecondition (llvm#122136)

21ba7ae

Adds some comments and re-name variables to clarify the usage.

[clang-tidy][doc] reorder release note

c01ffab

[AutoBump] Merge with fixes of 360a03c (Jan 08)

e04ed7e

[AutoBump] Merge with 465a3ce (Jan 09)

ee87c02

Merge pull request #503 from Xilinx/bump_to_465a3ce9

5546d0e

[AutoBump] Merge with 465a3ce (Jan 09) (33)

Base automatically changed from bump_to_67efbd0b to bump_to_c1d01b2f April 14, 2025 07:44

Base automatically changed from bump_to_c1d01b2f to bump_to_392622d0 April 14, 2025 07:48

jorickert merged commit 5be2d3b into bump_to_392622d0 Apr 14, 2025
2 checks passed

jorickert deleted the bump_to_360a03c9 branch April 14, 2025 07:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoBump] Merge with fixes of 360a03c9 (Jan 08) (32)[Only tested MLIR] #502

[AutoBump] Merge with fixes of 360a03c9 (Jan 08) (32)[Only tested MLIR] #502

Uh oh!

jorickert commented Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

95 participants

[AutoBump] Merge with fixes of 360a03c9 (Jan 08) (32)[Only tested MLIR] #502

[AutoBump] Merge with fixes of 360a03c9 (Jan 08) (32)[Only tested MLIR] #502

Uh oh!

Conversation

jorickert commented Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

95 participants