merge main into amd-staging #608

z1-cciauto · 2025-11-17T16:03:52Z

No description provided.

This patch implements DTLTO cache. DTLTO cache is implemented the same way as ThinLTO cache. In fact the same class Cache is used for both of them. Because parameters for codegen are different for DTLTO and ThinLTO (DTLTO codegen is done by invoking clang and its codegen parameters are not fully synchronized with codegen parameters used by LTO backend). The object files generated by DTLTO and ThinLTO might be different and shouldn't be mixed. If ThinLTO and DTLTO share the same cache directory, the cache file won't interfere with each other. I added a couple of test files in cross-project-test/dtlto directory, but if more tests are required for initial implementation, I could add them.

…llvm#167822) This check is introduced in llvm@b284005, but the documentation seems missing from `checkers.rst`.

Test using CTTZ to determine the lowest set bit, clear it and return the index Shows failure to use RMW pattern on the load-btr-store due to additional (but non-interference) uses of the load.

…c` (llvm#166255) Now the files location is used for macro expansions. This provides more accurate location when reporting compilation errors. Move from `getDecomposedExpansionLoc(Loc)` to `getDecomposedLoc(getFileLoc(Loc))` when computing Presumed location.

I've left Sonar by the end of October. For my upcoming contributions, I'll simply use my personal (this) account. I'll remain a Clang Static Analyser maintainer, but I'll likely spend less time on that part as in my new job this falls out of my key responsibilities. From now on, I'm part of the Apple org, but for accessibility, I'll keep using my personal email address for open-source contributions and for the build bots.

Changes: The previous patch had to be reverted to a mismatching-OpType assert in cse. The reduced-test has now been added corresponding to a RVV pointer-induction, and the pointer-induction case has been updated to use createOverflowingBinaryOp. While at it, record VPIRFlags in VPWidenInductionRecipe.

…esumedLoc`" (llvm#168368) Reverts llvm#166255 It broke bots: https://lab.llvm.org/buildbot/#/builders/190/builds/31102

…ons (llvm#168078) Instead of storing a variant with specific types, store parser::Block as the body. Add two access functions to make the traversal of the nest simpler. This will allow storing loop-nest sequences in the future.

Only the fortran source files in flang/test/Lower/OpenACC have been modified. The other files in flang/test will be cleaned up in subsequent commits

Per Intel Architecture Instruction Set Extensions Programming Reference rev. 60 (https://cdrdv2.intel.com/v1/dl/getContent/671368), table 1-2, NVL supports APX and AVX10.2

This patch is a minor NFC-intended refactoring to the way emitting redundant parentheses is prevented. The current implementation pushes and later pops a fake low precedence into the precedence stack when emitting function calls. The new implementation adds a boolean argument to `emitOperand()` that explicity guarantees that the operand is being emitted between some kind of brackets, exempting the method from enforcing correct evaluation order w.r.t precedence and associativity up the expression tree.

)

So setting the environment variable works with the new internal shell. This does not fix all the XRay tests because some of them are using subshells and need to be rewritten to not use subshells.

This does a couple of things: - code that is only useful for `shrink_to_fit` is moved into that function - `shrink_to_fit` is simplified a bit - `__recommend` is renamed to better reflect what the function actually does - `__allocate_long_buffer` asserts that the passed capacity doesn't fit into the SSO

So that they will actually function with the internal shell.

Currently only __builtin_elementwise_sqrt emits contrained fp intrinsic and propagates fp options. This commit adds this support for the rest of elementwise builtins.

Recent commits (7fe0691, 53ddeb4) marked several x86 intrinsics as constexpr in headers without providing the necessary constant evaluation support in the compiler backend. This caused compilation failures when attempting to use these intrinsics in constant expressions. Resolves llvm#166814 Resolves llvm#161203

…rser.cpp (NFC)

) Supports the fixed form syntax which has spaces in between the identifier

…undef, undef) (llvm#165539) This PR adds a new combine to the `post-legalizer-combiner` pass. The new combine checks for vectors being unmerged and subsequently padded with `G_IMPLICIT_DEF` values by building a new vector. If such a case is found, the vector being unmerged is instead just concatenated with a `G_IMPLICIT_DEF` that is as wide as the vector being unmerged. This removes unnecessary `mov` instructions in a few places.

…lvm#168390) This patch adds verification to the `SymbolOpInterface` to enforce the design constraint that symbol operations must not produce SSA results, as documented in [Symbols and SymbolTables](https://mlir.llvm.org/docs/SymbolsAndSymbolTables/#defining-or-declaring-a-symbol). This is a follow-up of llvm#168376

Identified with llvm-use-ranges.

While I am at it, this patch converts one of the loops to use llvm::is_contained. Identified with modernize-loop-convert.

Idx is already of type unsigned. Identified with readability-redundant-casting.

This patch consolidates the grow() logic in DenseMapBase::grow. With this patch, DenseMapBase::grow() creates a temporary grown instance and then lets DenseMap/SmallDenseMap attempt to move the instance back to *this. If it doesn't work, we move again. The "attempt to move" always succeeds for DenseMap. For SmallDenseMap, it succeeds only in the large mode. This is part of the effort outlined in llvm#168255.

This patch removes DenseMap::init and SmallDenseMap::init by inlining them into their call sites and simplifying them. init() is defined as: void init(unsigned InitNumEntries) { auto InitBuckets = BaseT::getMinBucketToReserveForEntries(InitNumEntries); this->initWithExactBucketCount(InitBuckets); } - Constuctors: Now that we have constructors that allocate the exact number of buckets (as opposed to the number of key/value pairs), init() does too much. Once we convert the number of key/value pairs to the number of buckets, we can call the constructors that take the exact number of buckets. - init(0) in the move assignment operators simplifies down to: initWithExactBucketCount(0) - shrink_and_clear() computes the number of buckets to have after the clear operation. As such, we should call initWithExactBucketCount, not init. Otherwise, we would end up adding "load factor padding" on top of NewNumBuckets: NextPowerOf2(NewNumBuckets * 4 / 3 + 1) All in all, init() doesn't bring any value in the current setup. This patch is part of the effort outlined in llvm#168255.

z1-cciauto · 2025-11-17T16:06:27Z

PSDB Link: https://compiler-ci.amd.com/job/compiler-psdb-amd-staging/2836

romanova-ekaterina and others added 29 commits November 17, 2025 04:24

[NFC][analyzer] Add missing documentation for decodeValueOfObjCType (…

c2ddaaa

…llvm#167822) This check is introduced in llvm@b284005, but the documentation seems missing from `checkers.rst`.

[X86] bittest-big-integer.ll - add BLSR style pattern test (llvm#168356)

515924f

Test using CTTZ to determine the lowest set bit, clear it and return the index Shows failure to use RMW pattern on the load-btr-store due to additional (but non-interference) uses of the load.

[mlir][bazel] Fix build after llvm#167848. (llvm#168366)

ae2fec0

[mlir][amdgpu] Fix documentation and verifiers (llvm#167369)

e468ea3

Revert "[clang][SourceManager] Use getFileLoc when computing `getPr…

fd1bdfd

…esumedLoc`" (llvm#168368) Reverts llvm#166255 It broke bots: https://lab.llvm.org/buildbot/#/builders/190/builds/31102

[flang][NFC] Strip trailing whitespace from tests (5 of N)

29e7b4f

Only the fortran source files in flang/test/Lower/OpenACC have been modified. The other files in flang/test will be cleaned up in subsequent commits

[X86] Enable APX and AVX10.2 on NVL (llvm#168061)

b6fd3c6

Per Intel Architecture Instruction Set Extensions Programming Reference rev. 60 (https://cdrdv2.intel.com/v1/dl/getContent/671368), table 1-2, NVL supports APX and AVX10.2

[llvm][RISCV] Support splat and vp_splat for zvfbfa codegen (llvm#167920

9fe0a70

)

[XRay] Prefix setting XRAY_OPTIONS with env

53e3f8e

So setting the environment variable works with the new internal shell. This does not fix all the XRay tests because some of them are using subshells and need to be rewritten to not use subshells.

[XRay] Rewrite tests to not use subshells

c7a9be8

So that they will actually function with the internal shell.

[clang] Support constrained fp elementwise builtins (llvm#166905)

e9743e2

Currently only __builtin_elementwise_sqrt emits contrained fp intrinsic and propagates fp options. This commit adds this support for the rest of elementwise builtins.

[MLIR] Apply clang-tidy fixes for readability-identifier-naming in Pa…

17cbb48

…rser.cpp (NFC)

[Flang] [OpenMP] Add support for spaces in between the name (llvm#168311

38811be

) Supports the fixed form syntax which has spaces in between the identifier

[Option] Use llvm::is_contained (NFC) (llvm#168295)

498a01d

Identified with llvm-use-ranges.

[TargetParser] Use range-based for loops (llvm#168296)

99bf41c

While I am at it, this patch converts one of the loops to use llvm::is_contained. Identified with modernize-loop-convert.

[IPO] Remove a redundant cast (NFC) (llvm#168297)

bf21156

Idx is already of type unsigned. Identified with readability-redundant-casting.

merge main into amd-staging

5a5462d

z1-cciauto requested a review from krzysz00 as a code owner November 17, 2025 16:03

z1-cciauto requested a review from kuhar as a code owner November 17, 2025 16:03

z1-cciauto requested a review from a team November 17, 2025 16:04

ronlieb approved these changes Nov 17, 2025

View reviewed changes

ronlieb merged commit 7d6a25a into amd-staging Nov 17, 2025
15 checks passed

ronlieb deleted the upstream_merge_202511171103 branch November 17, 2025 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

merge main into amd-staging #608

merge main into amd-staging #608

Uh oh!

z1-cciauto commented Nov 17, 2025

Uh oh!

z1-cciauto commented Nov 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

26 participants

merge main into amd-staging #608

merge main into amd-staging #608

Uh oh!

Conversation

z1-cciauto commented Nov 17, 2025

Uh oh!

z1-cciauto commented Nov 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

26 participants