[pull] master from intel:master #21

pull · 2022-11-17T05:11:25Z

See Commits and Changes for more details.

Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

Create a new CSE to remove redundant WaveBallot for performance.

Add EarlyCSE to pass pipeline without generating weird IR patterns that degrade performance

…dify cr0 on debug SIP exit Only modify cr0 on debug SIP exit

Currently flag value was being overriden in code so it was unusable.

Enable MAXNUM by default in IGCVectorizer

…blem in split barrier Fixed problem in split barrier when we are using with regular barrier. Case: splitbarrier.signal() regularbarrier() splitbarrier.wait() was causing the hang due assingning the same ID of the barrier in the regular barrier and split barrier. Now, the split barrier will take other ID than the regular one.

When the destination type is byte (UB or B), destination sunbregnum can be aligned to 2 or 3 of the (DWORD) execution channel.

Enable abort on spills to SIMD16 for more platforms.

Add missing lit for GenSpecificPattern, also align clang fmt.

For subroutine, there is no need add live out dependence of call BB

…at datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

Group the dpas instructions which have no dependence between each others and can be in same macro block in instruction scheduling

…rands alignment issues for SIMD2 instructions with 64b or float datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

Fix issue that align=1 can not be parsed correctly

Changes: * UseNewInlineRaytracing is now a mask that lets user selectively enable new inline raytracing for particular shader type * New regkey AddDummySlotsForNewInlineRaytracing forces increased number of slots required for rayqueries to test if UMD allocated the HW stacks necessary

…the SWSB compilation time when there is subroutine For subroutine, there is no need add live out dependence of call BB

Fix non-determinism in metadata

…a new CSE to remove redundant WaveBallot Create a new CSE to remove redundant WaveBallot for performance.

For subroutine, there is no need add live out dependence of call BB

…failing When adding Opaque Pointers support to JointMatrix I've found that 4 test were failing due to this assert: info: error, assertion failed: bits == elementSize file: Source\IGC\Compiler\Optimizer\OpenCLPasses\PrivateMemory\PrivateMemoryResolution.cpp function: TransposeHelperPrivateMem::handleLoadInst line: 665 Failed Tests (4): SYCL :: Matrix/SG32/joint_matrix_bf16_fill_k_cache_unroll.cpp SYCL :: Matrix/SG32/joint_matrix_bf16_fill_k_cache_unroll_init.cpp SYCL :: Matrix/joint_matrix_bf16_fill_k_cache_unroll.cpp SYCL :: Matrix/joint_matrix_bf16_fill_k_cache_unroll_init.cpp My investigation showed that such resolution path: alloca -> gep -> load used invalid vector elements count value, which caused this assert to fail. To my understanding the reason for this was that we used elementSize saved in "TransposeHelperPrivateMem" instance, But when we were going thru instructions (alloca->gep->load) then they weren't updated, so there was mismatch.

…the SWSB compilation time when there is subroutine For subroutine, there is no need add live out dependence of call BB

Adding internal options: `-cl-intel-disable-sendwarwa, -ze-opt-disable-sendwarwa` to turn off PVCSendWARWA

Cleaned up dead code that's related to patch token binary format deprecation. Removed unused code, adjusted some comments. Most of these changes are related to previous commits that deprecated the format in VC and OCL. Some parts are still to be refactored, this doesn't cover all patch token code.

Create a new CSE to remove redundant WaveBallot for performance.

Upgrade IGC C++ standard from 17 to 20

For subroutine, there is no need add live out dependence of call BB

…at datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

…SIMD16 drop for more platforms Enable abort on spills to SIMD16 for more platforms.

trafico-bot bot added the 🔍 Ready for Review Pull Request is not reviewed yet label Nov 17, 2022

pull bot added ⤵️ pull and removed 🔍 Ready for Review Pull Request is not reviewed yet labels Nov 17, 2022

trafico-bot bot added the 🔍 Ready for Review Pull Request is not reviewed yet label Nov 17, 2022

VPG-SWE-Github force-pushed the master branch from 22f9c81 to 9bbc7c9 Compare November 18, 2022 12:15

VPG-SWE-Github force-pushed the master branch 2 times, most recently from 89e978f to 809f6c4 Compare December 7, 2022 13:19

VPG-SWE-Github force-pushed the master branch 2 times, most recently from a031151 to 0265002 Compare December 21, 2022 19:06

VPG-SWE-Github force-pushed the master branch from b2bc28e to 6bcd3c8 Compare December 23, 2022 14:18

VPG-SWE-Github force-pushed the master branch 2 times, most recently from 659983d to f179289 Compare January 20, 2023 22:46

VPG-SWE-Github force-pushed the master branch from cc7d017 to 66b520c Compare February 17, 2023 14:18

VPG-SWE-Github force-pushed the master branch from d9dd5f9 to be8a568 Compare March 22, 2023 03:07

VPG-SWE-Github force-pushed the master branch from 8a1c26d to 4f5b8c5 Compare April 1, 2023 07:06

VPG-SWE-Github force-pushed the master branch 4 times, most recently from 067fb01 to cc30341 Compare May 24, 2023 12:10

VPG-SWE-Github force-pushed the master branch from ad79b9e to 99a6292 Compare May 27, 2023 13:07

VPG-SWE-Github force-pushed the master branch from 2870a43 to d37bb42 Compare August 10, 2023 13:09

VPG-SWE-Github force-pushed the master branch 3 times, most recently from 7ab2f49 to bd5532c Compare August 30, 2023 12:06

VPG-SWE-Github force-pushed the master branch 4 times, most recently from e5bc891 to 49fed10 Compare October 6, 2023 19:09

VPG-SWE-Github force-pushed the master branch from e166627 to e30ad65 Compare October 18, 2023 20:08

VPG-SWE-Github force-pushed the master branch from af0d1e1 to fb42ffb Compare October 31, 2023 11:04

ichenkai and others added 29 commits July 9, 2025 19:36

Create a new CSE to remove redundant WaveBallot

6bfa6f3

Create a new CSE to remove redundant WaveBallot for performance.

Add EarlyCSE without degrading performance

5927122

Add EarlyCSE to pass pipeline without generating weird IR patterns that degrade performance

[Autobackout][FunctionalRegression]Revert of change: 10520a2: Only mo…

c02e235

…dify cr0 on debug SIP exit Only modify cr0 on debug SIP exit

Fix RegPressureVerbocity flag

c96c687

Currently flag value was being overriden in code so it was unusable.

Enable MAXNUM by default in IGCVectorizer

43a46ca

Enable MAXNUM by default in IGCVectorizer

Relax byte destination restrictions

4107480

When the destination type is byte (UB or B), destination sunbregnum can be aligned to 2 or 3 of the (DWORD) execution channel.

Enable SIMD16 drop for more platforms

cebdde9

Enable abort on spills to SIMD16 for more platforms.

Add lit for GenSpecificPattern w/ Clang Formatting

7f1a010

Add missing lit for GenSpecificPattern, also align clang fmt.

Reduce the SWSB compilation time when there is subroutine

5eee6f4

For subroutine, there is no need add live out dependence of call BB

Fix operands alignment issues for SIMD2 instructions with 64b or flo…

089d12e

…at datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

Add heuristic for DPAS macro building in post-RA scheduling

58e13ae

Group the dpas instructions which have no dependence between each others and can be in same macro block in instruction scheduling

[Autobackout][FunctionalRegression]Revert of change: 089d12e: Fix ope…

a0f4cb0

…rands alignment issues for SIMD2 instructions with 64b or float datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

Fix issue that align=1 can not be parsed correctly

1f2e916

Fix issue that align=1 can not be parsed correctly

Revert "Add EarlyCSE and fix perf regressions"

c621877

[Autobackout][FunctionalRegression]Revert of change: 5eee6f4: Reduce …

9bd31a2

…the SWSB compilation time when there is subroutine For subroutine, there is no need add live out dependence of call BB

Fix non-determinism in metadata

2dd58f4

Fix non-determinism in metadata

[Autobackout][FunctionalRegression]Revert of change: 6bfa6f3: Create …

5e53ab2

…a new CSE to remove redundant WaveBallot Create a new CSE to remove redundant WaveBallot for performance.

Reduce the SWSB compilation time when there is subroutine

16e7042

For subroutine, there is no need add live out dependence of call BB

[Autobackout][FunctionalRegression]Revert of change: 16e7042: Reduce …

599485a

…the SWSB compilation time when there is subroutine For subroutine, there is no need add live out dependence of call BB

Add internal option to turn off PVCSendWARWA

8530f90

Adding internal options: `-cl-intel-disable-sendwarwa, -ze-opt-disable-sendwarwa` to turn off PVCSendWARWA

Create a new CSE to remove redundant WaveBallot

fb2c3b0

Create a new CSE to remove redundant WaveBallot for performance.

Upgrade IGC C++ standard from 17 to 20

957a009

Upgrade IGC C++ standard from 17 to 20

Reduce the SWSB compilation time when there is subroutine

9d7bfe9

For subroutine, there is no need add live out dependence of call BB

Fix operands alignment issues for SIMD2 instructions with 64b or flo…

495e061

…at datatype Fix operands alignment issues for SIMD2 instructions with 64b or float datatype

[Autobackout][FunctionalRegression]Revert of change: cebdde9: Enable …

398538e

…SIMD16 drop for more platforms Enable abort on spills to SIMD16 for more platforms.

pull bot merged commit 398538e into ConnectionMaster:master Jul 15, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pull] master from intel:master #21

[pull] master from intel:master #21

Uh oh!

pull bot commented Nov 17, 2022 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

[pull] master from intel:master #21

[pull] master from intel:master #21

Uh oh!

Conversation

pull bot commented Nov 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

pull bot commented Nov 17, 2022 •

edited

Loading