[AutoBump] Merge with d0b641b7 (Jan 14) (40) #511

jorickert · 2025-03-18T15:32:44Z

No description provided.

Skip function declarations for instrumentation. Fixes llvm#122467

Add Matcher `m_Undef` Fixes: llvm#122439

…lvm#122507) Internal testing shows improvements in some SPEC HPC benchmarks with this change.

… KnownBits Under certain circumstances, lowering of other instructions can result in computeKnownBits being able to detect a constant that it couldn't previously. Fixes llvm#122580

The inlining code for llvm funcs seems to have needlessly forbidden inlining of private (e.g. non-cloning) symbols.

Co-authored-by: Oleksandr "Alex" Zinenko <ftynse@gmail.com>

Fixes the test introduced in llvm#111145. It would also make sense to throw an error when the user attempts to use a move-from-sr on an unsupported architecture. Currently the encoder generates garbage instructions for a 68000 because the AsmMatcher is able to match the move against a MOV16rr

…e paths contain `..` (llvm#121323) `makeAbsolute` will not normalize path. When getting parent folder, `..` will go into the subfolder instead of the parent folder.

…122595)

…vm#121350) If we have a CSEL instruction that depends on the flags set by a (SUBS x c) instruction and the true and/or false expression is (add (add x y) -c), we can reassociate the latter expression to (add (SUBS x c) y) and save one instruction. Proof for the basic transformation: https://alive2.llvm.org/ce/z/-337Pb We can extend this transformation for slightly different constants. For example, if we have (add (add x y) -(c-1)) and a the comparison x <u c, we can transform the comparison to x <=u c-1 to eliminate the comparison instruction, too. Similarly, we can transform (x == 0) to (x <u 1). Proofs for the transformations that alter the constants: https://alive2.llvm.org/ce/z/3nVqgR Fixes llvm#119606.

With range and undef metadata on a call we can have vector AssertZExt generated on a target with no vector operations. The AssertZExt needs to scalarize to a normal `AssertZext tin, ValueType`. I have added AssertSext too, although I do not have a test case. Fixes llvm#110374

) This adds a test line and updates a comment.

We want special handing for IGLP instructions in the scheduler but they should still be treated like they have side effects by other passes. Add a target hook to the ScheduleDAGInstrs DAG builder so that we have more control over this.

Providing the character that we failed on is helpful for figuring out what's going wrong in the tzdb.

The body of the loop only applies to wide induction recipes, skip any other header phi recipes up-frond

This patch fixes: llvm/lib/Target/AMDGPU/AMDGPUIGroupLP.cpp:255:18: error: private field 'DAG' is not used [-Werror,-Wunused-private-field]

…m#122552) - **[InstSimpify] Add tests for simplifying `(xor (sub C_Mask, X), C_Mask)`; NFC** - **[InstSimpify] Simplifying `(xor (sub C_Mask, X), C_Mask)` -> `X`** Helps address regressions with folding `clz(Pow2)`. Proof: https://alive2.llvm.org/ce/z/zGwUBp

…ng CR for `ct{t,l}z` (llvm#122548)

Note that PointerUnion::{is,get} have been soft deprecated in PointerUnion.h: // FIXME: Replace the uses of is(), get() and dyn_cast() with // isa<T>, cast<T> and the llvm::dyn_cast<T> I'm not touching PointerUnion::dyn_cast for now because it's a bit complicated; we could blindly migrate it to dyn_cast_if_present, but we should probably use dyn_cast when the operand is known to be non-null.

This patch makes the metrics job also detect failures in individual steps. This is necessary now that we are setting continue-on-error in the premerge jobs to prevent sending out unnecessary email to detect what jobs actually fail.

…release note (llvm#122594) <img width="1137" alt="image" src="https://github.com/user-attachments/assets/25433743-2c19-422a-93c5-3edfc1bb7a3f" />

This adds a test that consists of compiling `#include <...>`, pretty much alone, for each public header file in each different language mode (`-std=...` compiler switch) with -Werror and many warnings enabled. There are several headers that have bugs when used alone, and many more headers that have bugs in certain language modes. So for now, compiling the new tests is gated on the cmake switch -DLLVM_LIBC_BUILD_HEADER_TESTS=ON. When all the bugs are fixed, the switch will be removed so future regressions don't land.

Original commit: eb63cd6 Previously reverted due to non-negligible compile-time impact in stage1-ReleaseLTO-g scenario. The issue has been addressed by always reusing previously computed MemorySSA results, and request new ones only when `isMemorySSAEnabled` is set. Co-authored-by: Momchil Velikov <momchil.velikov@arm.com>

) This was an especially challenging escape hatch because it directly forced the use of a specific X-macro structure and prevented any other form of TableGen emission. The problematic feature that motivated this is a case where a builtin's prototype can't be represented in the mini-language used by TableGen. Instead of adding a complete custom entry for this, this PR just teaches the prototype handling to do the same thing the X-macros did in this case: emit an empty string and let the Clang builtin handling respond appropriately. This should produce identical results while preserving all the rest of the structured representation in the builtin TableGen code.

…cal declaration and use that for analysis (llvm#107627) This partially fixes llvm#62072 by making sure that re-declarations of a function do not have the effect of removing lifetimebound from the canonical declaration. It doesn't handle the implicit 'this' parameter, but that can be addressed in a separate fix.

…lvm#122643) Use SmallSetVector instead of SmallVector to avoid duplication, so that dead nodes get erased/deleted only once.

Add some release notes for the Minidump work I did over the last few months.

See the comment in Compiler<>::VisitCXXThisExpr. We need to mark the InitList explicitly, so we later know what to refer to when the init chain is active.

This was useful to test metrics before we had an actual workflow, now it generates noise. Signed-off-by: Nathan Gauër <brioche@google.com>

This patch adds the definition of a new entry block argument-defining `host_eval` clause. This is intended to implement the passthrough approach discussed in [this RFC](https://discourse.llvm.org/t/rfc-openmp-dialect-representation-of-num-teams-thread-limit-and-target-spmd/81106), for supporting host-evaluated clauses that apply to operations nested inside of `omp.target`.

This patch adds the `host_eval` clause to the `omp.target` operation. Additionally, it updates its op verifier to make sure all uses of block arguments defined by this clause fall within one of the few cases where they are allowed. MLIR to LLVM IR translation fails on translation of this clause with a not-yet-implemented error.

…wer-buffer-fat-pointers (llvm#120139)

llvm#98426) This partially addresses llvm#98244.

…other)), undef" --> "binop (shuffle), (shuffle)". (llvm#122118) foldPermuteOfBinops currently requires both binop operands to be oneuse shuffles to fold the shuffles across the binop, but there will be cases where its still profitable to fold across the binop with only one foldable shuffle.

…lvm#116050) This patch introduces the `OpenMPIRBuilder::TargetKernelDefaultAttrs` structure used to simplify passing default and constant values for number of teams and threads, and possibly other target kernel-related information in the future. This is used to forward values passed to `createTarget` to `createTargetInit`, which previously used a default unrelated set of values.

As in llvm#121565 we need to mark all stores as mayStore, hasSideEffects is not enough to prevent moving loads past the instructions. And marking the instructions as mayStore is a sensible thing to do on its own.

Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of the FIXME it introduced in SLPVectorizer: the FIXME is bad, and we'd get no testable impact by using CmpPredicate::getMatching here.

This was missed in 43ba97e (llvm#111821) when reindenting after 917ada3 (llvm#80007).

Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of one of the FIXMEs it introduced by replacing a predicate comparison with CmpPredicate::getMatching.

Improve a test by replacing undef with poison, and regenerate it using UpdateTestChecks.

…isn't length changing.

…mode (llvm#116051) This patch introduces a `TargetKernelRuntimeAttrs` structure to hold host-evaluated `num_teams`, `thread_limit`, `num_threads` and trip count values passed to the runtime kernel offloading call. Additionally, kernel type information is used to influence target device code generation and the `IsSPMD` flag is replaced by `ExecFlags`, which provides more granularity.

…117875) This patch copies the target-cpu and target-features attributes of functions containing target regions into the corresponding outlined function holding the target region. This mirrors what is currently being done for all other outlined functions through the `CodeExtractor` in `OpenMPIRBuilder::finalize()`.

This was added in OpenACC PR #511 in the 3.4 branch. From an AST/Sema perspective this is pretty trivial as the infrastructure for 'if' already exists, however the atomic construct needed to be taught to take clauses. This patch does that and adds some testing to do so.

Enna1 and others added 30 commits January 11, 2025 20:15

[TySan] Skip instrumentation for function declarations (llvm#122488)

876fa60

Skip function declarations for instrumentation. Fixes llvm#122467

[SDPatternMatch] Add Matcher m_Undef (llvm#122521)

1d58699

Add Matcher `m_Undef` Fixes: llvm#122439

[flang] Teach omp-map-info-finalization to reuse descriptor allocas (l…

d291e45

…lvm#122507) Internal testing shows improvements in some SPEC HPC benchmarks with this change.

[X86] LowerCTPOP - check if the operand is a constant when collecting…

b622cc6

… KnownBits Under certain circumstances, lowering of other instructions can result in computeKnownBits being able to detect a constant that it couldn't previously. Fixes llvm#122580

[MLIR] Enable inlining for private symbols (llvm#122572)

b306eff

The inlining code for llvm funcs seems to have needlessly forbidden inlining of private (e.g. non-cloning) symbols.

[MLIR] Import LLVM add flag to disable loadalldialects (llvm#122574)

38fcf62

Co-authored-by: Oleksandr "Alex" Zinenko <ftynse@gmail.com>

[clang-tidy] remove never used IgnoreCase in option (llvm#122573)

ae9bf17

[clang-tidy] fix incorrect configuration file path resolving when fil…

0249554

…e paths contain `..` (llvm#121323) `makeAbsolute` will not normalize path. When getting parent folder, `..` will go into the subfolder instead of the parent folder.

[X86] vector popcnt tests - regenerate VPTERNLOG comments

7895343

[X86] vselect-avx.ll - regenerate VPTERNLOG comments

7b18468

[X86] avx512-mask-op.ll - regenerate VPTERNLOG comments

6078815

[X86] avx512-build-vector.ll - regenerate VPTERNLOG comments

70f3732

[clang-tidy][doc] fix incorrectly code snippet in release note (llvm#…

32351b5

…122595)

[win/asan] GetInstructionSize: Add test for 8D A4 24 .... (llvm#119794

9a9e41c

) This adds a test line and updates a comment.

[libc++] Improve diagnostic when failing to parse the tzdb (llvm#122125)

2914ba1

Providing the character that we failed on is helpful for figuring out what's going wrong in the tzdb.

[VPlan] Skip non-induction phi recipes in legalizeAndOptimizeInductions.

7f59b4e

The body of the loop only applies to wide induction recipes, skip any other header phi recipes up-frond

[AMDGPU] Fix a warning

bfe93ae

This patch fixes: llvm/lib/Target/AMDGPU/AMDGPUIGroupLP.cpp:255:18: error: private field 'DAG' is not used [-Werror,-Wunused-private-field]

[ValueTracking] Take into account whether zero is poison when computi…

17ef436

…ng CR for `ct{t,l}z` (llvm#122548)

[TableGen] Avoid repeated hash lookups (NFC) (llvm#122586)

07ff786

[Sema] Avoid repeated hash lookups (NFC) (llvm#122588)

a56eb7c

[CI] Detect step failures in metrics job (llvm#122564)

eabf931

This patch makes the metrics job also detect failures in individual steps. This is necessary now that we are setting continue-on-error in the premerge jobs to prevent sending out unnecessary email to detect what jobs actually fail.

[clang-tidy][doc] combine the clang-tidy itself's change together in …

2c7829e

…release note (llvm#122594) <img width="1137" alt="image" src="https://github.com/user-attachments/assets/25433743-2c19-422a-93c5-3edfc1bb7a3f" />

[Driver] Avoid repeated map lookups (NFC) (llvm#122625)

4f6fabd

antoniofrighetto and others added 26 commits January 14, 2025 10:03

[gn build] Port 42595bd

ea4a879

[InterleavedAccessPass]: Ensure that dead nodes get erased only once (l…

0209739

…lvm#122643) Use SmallSetVector instead of SmallVector to avoid duplication, so that dead nodes get erased/deleted only once.

[llvm][Docs] Add Minidump related LLDB release notes (llvm#122759)

df1a84d

Add some release notes for the Minidump work I did over the last few months.

[clang][bytecode] Change the way we do init chains (llvm#122871)

ac857f9

See the comment in Compiler<>::VisitCXXThisExpr. We need to mark the InitList explicitly, so we later know what to refer to when the init chain is active.

[llvm][Docs] Formatting changes to LLDB release notes

04733fa

[llvm][Docs] Add release note for lldb-server port mapping changes

cfd7e02

[CI] Remove Check Clang Format from watched workflows (llvm#122740)

05f9cdd

This was useful to test metrics before we had an actual workflow, now it generates noise. Signed-off-by: Nathan Gauër <brioche@google.com>

[AMDGPU] Handle nontemporal and amdgpu.last.use metadata in amdgpu-lo…

cc3aab5

…wer-buffer-fat-pointers (llvm#120139)

[clang] Do not allow unorderable features in [[gnu::target{,_clones}]] (

9988309

llvm#98426) This partially addresses llvm#98244.

[AArch64] Add mayStore to more store instructions

ce7c881

As in llvm#121565 we need to mark all stores as mayStore, hasSideEffects is not enough to prevent moving loads past the instructions. And marking the instructions as mayStore is a sensible thing to do on its own.

SLPVectorizer: strip bad FIXME (NFC) (llvm#122888)

0fe8469

Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of the FIXME it introduced in SLPVectorizer: the FIXME is bad, and we'd get no testable impact by using CmpPredicate::getMatching here.

[libcxx] Reindent a section of a CMake file. NFC. (llvm#122800)

b87fdd9

This was missed in 43ba97e (llvm#111821) when reindenting after 917ada3 (llvm#80007).

[llvm-project] Fix typos mutli and mutliple. NFC. (llvm#122880)

e87f94a

VectorCombine: teach foldExtractedCmps about samesign (llvm#122883)

e409204

Follow up on 4a0d53a (PatternMatch: migrate to CmpPredicate) to get rid of one of the FIXMEs it introduced by replacing a predicate comparison with CmpPredicate::getMatching.

LoopVersioning: improve a test, regen with UTC (llvm#122876)

aae2592

Improve a test by replacing undef with poison, and regenerate it using UpdateTestChecks.

[VectorCombine] foldPermuteOfBinops - ensure potential identity mask …

6a9e987

…isn't length changing.

[AutoBump] Merge with d0b641b (Jan 14)

1f0a0e4

Base automatically changed from bump_to_9190e1c0 to bump_to_392622d0 April 14, 2025 07:51

jorickert merged commit 1ee97cd into bump_to_392622d0 Apr 14, 2025
7 checks passed

jorickert deleted the bump_to_d0b641b7 branch April 14, 2025 07:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoBump] Merge with d0b641b7 (Jan 14) (40) #511

[AutoBump] Merge with d0b641b7 (Jan 14) (40) #511

Uh oh!

jorickert commented Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

133 participants

[AutoBump] Merge with d0b641b7 (Jan 14) (40) #511

[AutoBump] Merge with d0b641b7 (Jan 14) (40) #511

Uh oh!

Conversation

jorickert commented Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

133 participants