LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024) #15669

iclsrc · 2024-10-10T22:23:31Z

LLVM: llvm/llvm-project@2f50b28
SPIRV-LLVM-Translator: KhronosGroup/SPIRV-LLVM-Translator@d3e72db7e0d74f4

This reverts commit cf02d8b.

This reverts commit 2383bc8.

This reverts commit b4a8e87.

…Error (NFC) (#106774)" This reverts commit 06939fa.

…rotocol" This reverts commit a7c1745.

Fix windows test after #108921.

…epCandidate() (#109212) These are helper functions to be used by the vectorizer's dependency graph.

@firstmoonlight

Resolve #94928 This PR adds `if (TD->getTemplateDecl())` to prevent `InnerD` becoming `nullptr`, suggested by @firstmoonlight. I also add `-ast-dump-decl-types` option and declare type `CHECK` to the testcase `clang/test/AST/ast-dump-concepts.cpp`. --------- Co-authored-by: Aaron Ballman <aaron@aaronballman.com>

This patch improves the documentation for JITLink by fixing some typos, correcting indentations and fixing out-dated code examples.

…uild. (#109078)" (#109207) `std::complex` operators do not work for the CUDA device compilation of F18 runtime. This change makes use of `cuda::std::complex` from `libcudacxx`. `cuda::std::complex` does not have specializations for `long double`, so the change is accompanied with a clean-up for `long double` usage. Additional change on top of #109078 is to use `cuda::std::complex` only for the device compilation, otherwise the host compilation fails because `libcudacxx` may not support `long double` specialization at all (depending on the compiler).

…109176) The API is present, and we even have a test for it, but it isn't documented so no one probably knows you can set requirements for your scripted commands. This just adds docs and uses it appropriately in the `framestats` example command.

… is marked Promote. We have a special check that tries to determine if vector FP operations are supported for the type to determine whether to scalarize or not. If FP arithmetic would be promoted, don't unroll. This improves Zvfhmin codegen on RISC-V.

Check that the destination of G_EXTRACT_SUBVECTOR is smaller than the source. Improve wording of error messages.

-Improve messages. -Remove redundant checks that are handled in generic code. -Add check that the subvector is smaller than the vector. -Add checks that subvector is smaller than the vector.

This revision adds vector predication smax, smin, umax and umin intrinsic ops.

Fixes #108589.

…ariable (#109213) This patch adds new runtime entry points that perform the simple allocation/deallocation of module allocatable variable with cuda attributes. When the allocation is initiated on the host, the descriptor on the device is synchronized. Both descriptors point to the same data on the device. This is the first PR of a stack.

… (#109195) Change RegisterBankEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089

Convert `cuf.allocate` and `cuf.deallocate` to the runtime entry points added in #109213 Was reviewed in llvm/llvm-project#109214 but the parent branch was closed for some reason.

Added tests to the validator and fixed issues stemming from the previous skipping over BBs with single successors - which is incorrect. That would be now picked by added tests where the assertions are expected to be triggered.

…ntable callsites (#109184) Reinforcing properties ensured at instrumentation time.

Example: https://lab.llvm.org/buildbot/#/builders/169/builds/3381 The CI allowed the `llvm::append_range` instantiation, but on the other hand it's quite unnecessary here.

The code was passing a physical register directly to getPressureSets which expects a register unit. Fix this by looping over the register units and calling getPressureSets for each of them. Found while trying to add a RegisterUnit class to stop storing register units in `Register`. 0 is a valid register unit but not a valid Register.

Change variable name `o` to `OS` to match definition, and `ClName` to `ClassName` for better clarity. Cache RegBank reference in the class and do no pass around class members to functions.

…r (#108094) Make sure there is no data transfer generated when a device variable is used in these intrinsic functions.

…(#109234)

…er (#109194) Change PseudoLoweringEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089

…109189) Change InstrInfoEmitter to use const RecordKeeper. This is a part of effort to have better const correctness in TableGen backends: https://discourse.llvm.org/t/psa-planned-changes-to-tablegen-getallderiveddefinitions-api-potential-downstream-breakages/81089

@jasonmolenda

…8663) macOS 10.15 added a "full" x86_64 GPR thread state flavor, equivalent to the normal one but with DS, ES, SS, and GSbase added. This flavor can only be used with processes that install a custom LDT (functionality that was also added in 10.15 and is used by apps like Wine to execute 32-bit code). Along with allowing DS, ES, SS, and GSbase to be viewed/modified, using the full flavor is necessary when debugging a thread executing 32-bit code. If thread_set_state() is used with the regular thread state flavor, the kernel resets CS to the 64-bit code segment (see [set_thread_state64()](https://github.com/apple-oss-distributions/xnu/blob/94d3b452840153a99b38a3a9659680b2a006908e/osfmk/i386/pcb.c#L723), which makes debugging impossible. There's no way to detect whether the full flavor is available, try to use it and fall back to the regular one if it's not available. A downside is that this patch exposes the DS, ES, SS, and GSbase registers for all x86_64 processes, even though they are not populated unless the full thread state is available. I'm not sure if there's a way to tell LLDB that a register is unavailable. The classic GDB `g` command [allows returning `x`](https://sourceware.org/gdb/current/onlinedocs/gdb.html/Packets.html#Packets) to denote unavailable registers, but it seems like the debug server uses newer commands like `jThreadsInfo` and I'm not sure if those have the same support. Fixes #57591 (also filed as Apple FB11464104) @jasonmolenda

jsji · 2024-10-16T21:00:23Z

This is ready for review.

Use Basic Blocker Iterator instead @intel/dpcpp-kernel-fusion-reviewers
Update CodeSectionINTEL tests due to EntryPoint customization @intel/dpcpp-spirv-reviewers
[bindless images][Cuda] XFAIL mipmap_read_3D due to type mismatch @intel/bindless-images-reviewers

See #15727

Before b7b28e7, AreSupportedUsers will skip MemTransferInst, it may cause unexpected assertion. https://godbolt.org/z/z5d691fj1 In b7b28e7, we start to allow MemTransferInst, we should allow it in adjustByValArgAlignment too. (cherry picked from commit 0bbdc76)

llvm-spirv/test/extensions/INTEL/SPV_INTEL_function_pointers/CodeSectionINTEL/alias.ll

jsji · 2024-10-17T15:56:46Z

@intel/llvm-gatekeepers Please help to issue a /merge. The dev ci and AMD failures are irrelevant, also failing on other PRs.
#15727 is a follow up for bindless image test.

sarnex · 2024-10-17T16:09:33Z

/merge

bb-sycl · 2024-10-17T16:10:02Z

Thu 17 Oct 2024 04:10:01 PM UTC --- Start to merge the commit into sycl branch. It will take several minutes.

bb-sycl · 2024-10-17T16:14:51Z

Thu 17 Oct 2024 04:14:51 PM UTC --- Merge the branch in this PR to base automatically. Will close the PR later.

adrian-prantl and others added 30 commits September 18, 2024 17:28

Revert "[lldb] Store ECError as CloneableECError in Status"

79a69cb

This reverts commit cf02d8b.

Revert "[lldb] Update SocketTestUtilities.cpp to use CloneableECError"

8b456b4

This reverts commit 2383bc8.

Revert "Add noexcept qualifier to placate g++"

2730373

This reverts commit b4a8e87.

Revert "[lldb] Change the implementation of Status to store an llvm::…

cb6d531

…Error (NFC) (#106774)" This reverts commit 06939fa.

Revert "[lldb] Only send "posix" error codes through the gdb-remote p…

6dcde73

…rotocol" This reverts commit a7c1745.

[NFC][sanitizer] Use InitializePlatformEarly() in test (#109224)

4e659c6

Fix windows test after #108921.

[SandboxIR] Add Instruction::isStackSaveRestoreIntrinsic() and isMemD…

1bda7ba

…epCandidate() (#109212) These are helper functions to be used by the vectorizer's dependency graph.

[Doc] Improve documentation for JITLink. (#109163)

8f3fb5d

This patch improves the documentation for JITLink by fixing some typos, correcting indentations and fixing out-dated code examples.

[SandboxIR] Fix unused variable build error

9f5139c

[MachineVerifier] Improve G_EXTRACT_SUBVECTOR checking (#109202)

e494e2a

Check that the destination of G_EXTRACT_SUBVECTOR is smaller than the source. Improve wording of error messages.

[MachineVerifier] Improve checks for G_INSERT_SUBVECTOR. (#109209)

009398b

-Improve messages. -Remove redundant checks that are handled in generic code. -Add check that the subvector is smaller than the vector. -Add checks that subvector is smaller than the vector.

[mlir][LLVMIR] Add more vector predication intrinsic ops (#107663)

87dc3e8

This revision adds vector predication smax, smin, umax and umin intrinsic ops.

[clang-format] Fix regression in BAS_AlwaysBreak for-await (#108634)

c9aa9d5

Fixes #108589.

[flang][cuda] Convert module allocation/deallocation to runtime calls

156035e

Convert `cuf.allocate` and `cuf.deallocate` to the runtime entry points added in #109213 Was reviewed in llvm/llvm-project#109214 but the parent branch was closed for some reason.

[nfc][ctx_prof] Don't try finding callsite annotation for un-instrume…

ee5709b

…ntable callsites (#109184) Reinforcing properties ensured at instrumentation time.

[ctx_prof] Avoid llvm::append_range to fix some build bots

12d9485

Example: https://lab.llvm.org/buildbot/#/builders/169/builds/3381 The CI allowed the `llvm::append_range` instantiation, but on the other hand it's quite unnecessary here.

[NFC] Cleanup RegisterInfoEmitter code (#109199)

0f06f70

Change variable name `o` to `OS` to match definition, and `ClName` to `ClassName` for better clarity. Cache RegBank reference in the class and do no pass around class members to functions.

[flang][cuda][NFC] Add more descriptor inquiry tests for data transfe…

5e1a54b

…r (#108094) Make sure there is no data transfer generated when a device variable is used in these intrinsic functions.

[flang][cuda][NFC] Fix grammar in CanCUDASymbolHasSave function name …

4194e8d

…(#109234)

jsji temporarily deployed to WindowsCILock October 15, 2024 14:10 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 15, 2024 15:11 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 15, 2024 16:47 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 01:31 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 01:32 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 02:25 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 02:39 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 16:36 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 16:37 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 18:32 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 16, 2024 18:40 — with GitHub Actions Inactive

jsji self-assigned this Oct 16, 2024

jsji added 2 commits October 16, 2024 17:03

[bindless images][Cuda] XFAIL mipmap_read_3D due to type mismatch

a9e8cf6

See #15727

jsji force-pushed the llvmspirv_pulldown branch from 457b79d to 6f4c075 Compare October 17, 2024 00:05

jsji temporarily deployed to WindowsCILock October 17, 2024 00:05 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 17, 2024 00:06 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 17, 2024 00:57 — with GitHub Actions Inactive

jsji temporarily deployed to WindowsCILock October 17, 2024 00:58 — with GitHub Actions Inactive

AlexeySachkov reviewed Oct 17, 2024

View reviewed changes

llvm-spirv/test/extensions/INTEL/SPV_INTEL_function_pointers/CodeSectionINTEL/alias.ll Show resolved Hide resolved

jsji requested a review from AlexeySachkov October 17, 2024 14:27

AlexeySachkov approved these changes Oct 17, 2024

View reviewed changes

bb-sycl approved these changes Oct 17, 2024

View reviewed changes

bb-sycl merged commit 20a7cd1 into sycl Oct 17, 2024
24 of 26 checks passed

bader mentioned this pull request Nov 7, 2024

Revert "LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024)" #16023

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024) #15669

LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024) #15669

Uh oh!

iclsrc commented Oct 10, 2024

Uh oh!

jsji commented Oct 16, 2024 •

edited

Loading

Uh oh!

Uh oh!

jsji commented Oct 17, 2024

Uh oh!

sarnex commented Oct 17, 2024

Uh oh!

bb-sycl commented Oct 17, 2024

Uh oh!

bb-sycl commented Oct 17, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024) #15669

LLVM and SPIRV-LLVM-Translator pulldown (WW41 2024) #15669

Uh oh!

Conversation

iclsrc commented Oct 10, 2024

Uh oh!

jsji commented Oct 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

jsji commented Oct 17, 2024

Uh oh!

sarnex commented Oct 17, 2024

Uh oh!

bb-sycl commented Oct 17, 2024

Uh oh!

bb-sycl commented Oct 17, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

jsji commented Oct 16, 2024 •

edited

Loading