[AutoBump] Merge with a8d506b3 (Oct 22) (15) #452

jorickert · 2025-01-14T12:31:24Z

No description provided.

…vm#107740) This patch adds a quantized version of the `linalg.conv2d_nchw_fchw` Op. This is the "channel-first" ordering typically used by PyTorch and others.

…m#112801) I've been getting complaints from users being spammed by -gmodules missing file warnings going out of control because each object file depends on an entire DAG of PCM files that usually are all missing at once. To reduce this problem, this patch does two things: 1. Module now maintains a DenseMap<hash, once> that is used to display each warning only once, based on its actual text. 2. The PCM warning itself is reworded to include less details, such as the DIE offset, which is only useful to LLDB developers, who can get this from the dwarf log if they need it. Because the detail is omitted the hashing from (1) deduplicates the warnings. rdar://138144624

…0738)

…hmetic Update visitSIGN_EXTEND_INREG to call FoldConstantArithmetic instead of getNode.

…04533) Add an minor mode which can be optionally used to run clang-format on save. Formatting before saving works well and is convenient to avoid having to remember to manually run clang format. I've written this as it's own package but it's probably better if the functionality is supported by clang-format.el. See: melpa/melpa#8762

for future combines

…vm#113022)

…tcasts (llvm#112710)" This reverts commit a630771. This caused compilation to hang for Windows/ARM, see llvm#112710 for details.

The new inlay hints have the `DefaultArguments` kind and can be enabled in config similar to other inlay kint kinds.

The personality routine `__xlcxx_personality_v0` in `libc++abi` is hard-coded in the unwinder as the handler for EH in applications generated by the legacy IBM C++ compiler. The symbol is resolved dynamically using `dlopen` to avoid a hard dependency of `libunwind` on `libc++abi` for cases such as non-C++ applications. However, `dlclose` was incorrectly called after `dlsym` succeeded, potentially invalidating the function pointer obtained from `dlsym` when the memory allocated for the `dlopen` is reclaimed. This PR changes to call `dlclose` only when `dlsym` fails.

This allows languages to provide an opinion on whether two symbol contexts are equivalent (i.e. belong to the same function). It is useful to drive the comparisons done by stepping plans that need to ensure symbol contexts obtained from different points in time are actually the same.

Induction users only need to be updated when vectorizing the epilogue. Avoid running fixupIVUsers when vectorizing the main loop during epilogue vectorization.

…a constant value. (llvm#113014) This patch adds support for constant folding for the `ilogb` and `ilogbf` libc functions.

Currently both True/False counts were folded. It lost the information, "It is True or False before folding." It prevented recalling branch counts in merging template instantiations. In `llvm-cov`, a folded branch is shown as: - `[True: n, Folded]` - `[Folded, False n]` In the case If `n` is zero, a branch is reported as "uncovered". This is distinguished from "folded" branch. When folded branches are merged, `Folded` may be dissolved. In the coverage map, either `Counter` is `Zero`. Currently both were `Zero`. Since "partial fold" has been introduced, either case in `switch` is omitted as `Folded`. Each `case:` in `switch` is reported as `[True: n, Folded]`, since `False` count doesn't show meaningful value. When `switch` doesn't have `default:`, `switch (Cond)` is reported as `[Folded, False: n]`, since `True` count was just the sum of `case`(s). `switch` with `default` can be considered as "the statement that doesn't have any `False`(s)".

…les when needed (llvm#109862) This PR adds an OpenMP dialect related pass for FIR/HLFIR which creates `MapInfoOp` instances for certain privatized symbols. For example, if an allocatable variable is used in a private clause attached to a `omp.target` op, then the allocatable variable's descriptor will be needed on the device (e.g. GPU). This descriptor needs to be separately mapped onto the device. This pass creates the necessary `omp.map.info` ops for this.

…lvm#113005)

@mstorsjo

…1)), c2) -> (and (ext V), (and c1, (ext c2))) Noticed while triaging the regression from llvm#112710 noticed by @mstorsjo - don't rely on isConstantIntBuildVectorOrConstantInt+getNode to guarantee constant folding (if it fails to constant fold it will infinite loop), use FoldConstantArithmetic instead.

…lvm#112710) (REAPPLIED) Alter both isConstantIntBuildVectorOrConstantInt + isConstantFPBuildVectorOrConstantFP to return a bool instead of the underlying SDNode, and adjust usage to account for this. Update isConstantIntBuildVectorOrConstantInt to peek though bitcasts when attempting to find a constant, in particular this improves canonicalization of constants to the RHS on commutable instructions. X86 is the beneficiary here as it often bitcasts rematerializable 0/-1 vector constants as vXi32 and bitcasts to the requested type Minor cleanup that helps with llvm#107423 Reapplied after regression fix ba1255d

These headers are not doing anything beyond the system or compiler provided equivalent headers, so there's no real reason to keep them around. Reducing the number of C headers we provide in libc++ simplifies our header layering and reduces the potential for confusion when headers are layered incorrectly.

Fixes: llvm#112836

This patch fixes the build failure seen on z/OS: ``` llvm/clang/include/clang/ASTMatchers/ASTMatchers.h:7212:1: error: unknown type name 'CLANG_ABI' ```

…m#113294) This fixes layering violation introduced in 2fd01d7. The declaration is moved to `SemaTemplateInstantiate` section of `Sema.h`, after the file where it's implemented.

…TQ` (llvm#113295) This patch improves, but doens't fully resolve the layering violation, which stems from relying on Sema. There's one function that needs to convert enumerator to a string (`buildQualifier` in `FixItHintUtils.cpp`), but `Qualifiers::TQ` doesn't offer such function. Even more, the set of enumerators is not complete compared to `DeclSpec::TQ`, so I'm afraid that this would be a functional change.

Enable all valid registers for intrinsics that read from and write to global named registers.

…llvm#111084) - [x] Add a simple canonicalization for `mlir::index::AddOp`.

Recent change applied too strict check for old and src operands match. These shall be compatible, but not necessarily exactly the same. Fixes: SWDEV-493072

PR is to: 1. Simplify test update in llvm#113200 2. Make tests more comprehensive, currently interesting cases looks very basic: ``` ; CHECK-LABEL: @ICmpSGTAllOnes ; CHECK: icmp slt ; CHECK-NOT: call void @__msan_warning ; CHECK: icmp sgt ; CHECK-NOT: call void @__msan_warning ; CHECK: ret i1 ```

Fixes llvm#111212. This grows .text by 5.3% on CTMark, (or 2.6% large internal binary) Perf regressed by 1.6%. We will try to improve in follow up patches. It worth to pay some performance regression to fix correctness to avoid stuff like llvm#111212.

For consistency with other dialects and other CUF passes and files, this patch renames passes CufOpConversion to CUFOpConversion, CufImplicitDeviceGlobal to CUFDeviceGlobal. It also renames the file.

…lvm#113216) Until now debug info was printing the symbols names as-is and that resulted in invalid PTX when the symbols contained characters that are invalid for PTX. E.g. `__PRETTY_FUNCTION.something` Debug info is somewhat disconnected from the symbols themselves, so the regular "NVPTXAssignValidGlobalNames" pass can't easily fix them. As the "plan B" this patch catches printout of debug symbols and fixes them, as needed. One gotcha is that the same code path is used to print the names of debug info sections. Those section names do start with a '.debug'. The dot in those names is nominally illegal in PTX, but the debug section names with a dot are accepted as a special case. The downside of this change is that if someone ever has a `.debug*` symbol that needs to be referred to from the debug info, that label will be passed through as-is, and will still produce broken PTX output. If/when we run into a case where we need it to work, we could consider only passing through specific debug section names, or add a mechanism allowing us to tell section names apart from regular symbols. Fixes llvm#58491

…3376) Reverts llvm#113200 Breaks bots, see llvm#113200

…13378)

When printing a memory operand in MIR, this line https://github.com/llvm/llvm-project/blob/d37bc32a65651e647148236ffb9728ea2e77eac3/llvm/lib/CodeGen/MachineOperand.cpp#L1247 calls this https://github.com/llvm/llvm-project/blob/d37bc32a65651e647148236ffb9728ea2e77eac3/llvm/include/llvm/Support/Alignment.h#L238 which assumes `Rhs` (the size in this case) is positive. But Wasm reference types' size is set to 0: https://github.com/llvm/llvm-project/blob/d37bc32a65651e647148236ffb9728ea2e77eac3/llvm/include/llvm/CodeGen/ValueTypes.td#L326-L328 `getSize() > 0` condition was added with the Wasm reference types support in llvm@46667a1, and it looks it was removed in llvm#84751. This revives the condition so that Wasm reference types will not crash the MIR printer.

…llvm#113379) Reverts llvm#113376 Fixed with llvm#113378

) The convention is to use enum names that match the source spelling (up to upper/lower case), including names with underscores. Remove the special case from unparser, update tests.

ubfx and others added 30 commits October 19, 2024 18:25

[mlir][linalg] Add quantized conv2d operator with FCHW,NCHW order (ll…

02bf3b5

…vm#107740) This patch adds a quantized version of the `linalg.conv2d_nchw_fchw` Op. This is the "channel-first" ordering typically used by PyTorch and others.

[Github] Bump CI compiler version to 19.1.2 (llvm#113016)

f87f3ad

AMDGPU: Handle folding frame indexes into add with immediate (llvm#11…

ef91cd3

…0738)

[X86] X86.td - whitespace cleanup. NFC.

06fce61

[DAG] Move SIGN_EXTEND_INREG constant folding inside FoldConstantArit…

93ec08d

…hmetic Update visitSIGN_EXTEND_INREG to call FoldConstantArithmetic instead of getNode.

[GlobalISel][AArch64] Legalize G_EXTRACT_SUBVECTOR (llvm#112946)

10f6d01

for future combines

[InstCombine] Simplify code with SmallMapVector::operator[] (NFC) (ll…

8819267

…vm#113022)

[lldb] Avoid repeated hash lookups (NFC) (llvm#113024)

ca9f396

[llvm-diff] Avoid repeated hash lookups (NFC) (llvm#113025)

f4136b3

Revert "[DAG] isConstantIntBuildVectorOrConstantInt - peek through bi…

b26df3e

…tcasts (llvm#112710)" This reverts commit a630771. This caused compilation to hang for Windows/ARM, see llvm#112710 for details.

[clangd] Add inlay hints for default function arguments (llvm#95712)

2eb1699

The new inlay hints have the `DefaultArguments` kind and can be enabled in config similar to other inlay kint kinds.

[LV] Fixup IV users only once during epilogue vectorization. (NFC)

2deb3a2

Induction users only need to be updated when vectorizing the epilogue. Avoid running fixupIVUsers when vectorizing the main loop during epilogue vectorization.

[ELF] Pass Ctx & to Defined & CommonSymbol

fe8af49

[VPlan] Mark unreachable code path when retrieving the scalar PH. (NFCI)

cba5c77

[ConstantFold] Fold ilogb and ilogbf when the input parameter is …

1336e3d

…a constant value. (llvm#113014) This patch adds support for constant folding for the `ilogb` and `ilogbf` libc functions.

[ELF] Pass Ctx & to Symbol::getVA

861bd36

[ELF] Remove error-prone RelocationBaseSection::classof

5d928ff

[ELF] Pass Ctx &

e6625a2

[libc][math][c23] Update newhdrgen for new _Float16 math functions (l…

e6c0143

…lvm#113005)

[ARM] Add reduced regression test for infinite-loop due to llvm#112710

94cddcf

[gn build] Port aa7f377

5a47d48

aeubanks and others added 28 commits October 22, 2024 17:32

[gn build] Port 1295d2e

2cbe7c2

[gn build] Port 470a599

c08a5c1

[gn build] Port 6512a8d

24293e6

[gn build] Port 6c4267f

50df49e

[gn build] Port b8930cd

62cb1a3

[lld][WebAssembly] Improve -v/-V/--version flag compat (llvm#113204)

0764e55

Fixes: llvm#112836

[docs] Add link to newest CoC transparency report

383bd05

[RISCV][NFC] Move RISCVISD::TAIL beside RISCVISD::CALL

9b9c2a0

Define CLANG_ABI and LLVM_ABI for z/OS (llvm#113333)

9edb4f7

This patch fixes the build failure seen on z/OS: ``` llvm/clang/include/clang/ASTMatchers/ASTMatchers.h:7212:1: error: unknown type name 'CLANG_ABI' ```

[clang][NFC] Move concepts::createSubstDiagAt from AST to Sema (llv…

8536c2e

…m#113294) This fixes layering violation introduced in 2fd01d7. The declaration is moved to `SemaTemplateInstantiate` section of `Sema.h`, after the file where it's implemented.

[ORC] Fix LLJITWithRemoteDebugging example after db21bd4.

e575483

[PowerPC] Expand global named register support (llvm#112603)

06d1929

Enable all valid registers for intrinsics that read from and write to global named registers.

[RISCV][NFC] Document F64 ISD Nodes

78e026f

[NFC] Fix missplaced comment

00b47b9

[MLIR] Add folding constants canonicalization for mlir::index::AddOp. (…

7191ced

…llvm#111084) - [x] Add a simple canonicalization for `mlir::index::AddOp`.

[flang][cuda] Fix module registration (llvm#113358)

cc4926a

[AMDGPU] Relax __builtin_amdgcn_update_dpp sema check (llvm#113341)

03fef62

Recent change applied too strict check for old and src operands match. These shall be compatible, but not necessarily exactly the same. Fixes: SWDEV-493072

Renaming Cuf passes to CUF (llvm#113351)

f1e59dc

For consistency with other dialects and other CUF passes and files, this patch renames passes CufOpConversion to CUFOpConversion, CufImplicitDeviceGlobal to CUFDeviceGlobal. It also renames the file.

Revert "[msan] Switch to -msan-handle-icmp-exact my default" (llvm#11…

c3aa8b7

…3376) Reverts llvm#113200 Breaks bots, see llvm#113200

[NFC][msan] Workaround arg evaluation order diff GCC vs Clang (llvm#1…

71792dc

…13378)

Revert "Revert "[msan] Switch to -msan-handle-icmp-exact my default"" (…

c77d8ed

…llvm#113379) Reverts llvm#113376 Fixed with llvm#113378

[flang][OpenMP] Rename enum OmpxHold to Ompx_Hold in parser (llvm#113366

a8d506b

) The convention is to use enum names that match the source spelling (up to upper/lower case), including names with underscores. Remove the special case from unparser, update tests.

[AutoBump] Merge with a8d506b (Oct 22)

1b28c53

jorickert merged commit d3ab502 into bump_to_0a17bdfc Jan 24, 2025
4 of 5 checks passed

jorickert deleted the bump_to_a8d506b3 branch January 24, 2025 08:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AutoBump] Merge with a8d506b3 (Oct 22) (15) #452

[AutoBump] Merge with a8d506b3 (Oct 22) (15) #452

Uh oh!

jorickert commented Jan 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

133 participants

[AutoBump] Merge with a8d506b3 (Oct 22) (15) #452

[AutoBump] Merge with a8d506b3 (Oct 22) (15) #452

Uh oh!

Conversation

jorickert commented Jan 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

133 participants