[SYCL] Run EarlyCSEPass to eliminate redundant memory ops after SYCLLowerWGLocalMemoryPass #21030

wenju-he · 2026-01-12T02:07:53Z

This fixes a regression from d8ba6b5: a missed optimization when target is SPIR-V.
In SYCL early optimization pipeline, EarlyCSEPass is currently missing after SYCLLowerWGLocalMemoryPass. This PR adds it to VectorizerStartEPCallback which is the first function pass callback following AlwaysInlinerPass.

In pytorch_microbench/kernel-submit-slm-size--1 (Jira GSD-12161), adding EarlyCSEPass can eliminates %4 and replace it with %2:
store ptr addrspace(3) %2, ptr addrspace(3) @__sycl_dynamicLocalMemoryPlaceholder_GV
%4 = load ptr addrspace(3), ptr addrspace(3) @__sycl_dynamicLocalMemoryPlaceholder_GV

…owerWGLocalMemoryPass In SYCL early optimization pipeline, EarlyCSEPass is currently missing after SYCLLowerWGLocalMemoryPass. This PR adds it to VectorizerStartEPCallback which is the first function pass callback following AlwaysInlinerPass. In pytorch_microbench/kernel-submit-slm-size--1 (Jira GSD-12161), adding EarlyCSEPass can eliminates %4 and replace it with %2: store ptr addrspace(3) %2, ptr addrspace(3) @__sycl_dynamicLocalMemoryPlaceholder_GV %4 = load ptr addrspace(3), ptr addrspace(3) @__sycl_dynamicLocalMemoryPlaceholder_GV

wenju-he · 2026-01-13T10:12:44Z

Windows BMG fail Basic/parallel_for_range_roundup.cpp is known issue: #20827

Failed Tests (1):
  SYCL :: Basic/parallel_for_range_roundup.cpp


Testing Time: 182.63s

Total Discovered Tests: 2421
  Unsupported      : 1202 (49.65%)
  Passed           : 1208 (49.90%)
  Expectedly Failed:   10 (0.41%)
  Failed           :    1 (0.04%)

@intel/llvm-gatekeepers please merge, thanks

wenju-he requested review from a team as code owners January 12, 2026 02:07

wenju-he temporarily deployed to WindowsCILock January 12, 2026 02:08 — with GitHub Actions Inactive

wenju-he temporarily deployed to WindowsCILock January 12, 2026 02:37 — with GitHub Actions Inactive

wenju-he had a problem deploying to WindowsCILock January 12, 2026 02:37 — with GitHub Actions Failure

wenju-he temporarily deployed to WindowsCILock January 12, 2026 02:37 — with GitHub Actions Inactive

wenju-he had a problem deploying to WindowsCILock January 12, 2026 05:44 — with GitHub Actions Failure

YuriPlyakhin approved these changes Jan 12, 2026

View reviewed changes

wenju-he had a problem deploying to WindowsCILock January 13, 2026 02:42 — with GitHub Actions Failure

Fznamznon approved these changes Jan 13, 2026

View reviewed changes

uditagarwal97 merged commit bd89e79 into intel:sycl Jan 13, 2026
35 of 38 checks passed

wenju-he deleted the sycl-registerVectorizerStartEPCallback-EarlyCSEPass branch January 13, 2026 22:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL] Run EarlyCSEPass to eliminate redundant memory ops after SYCLLowerWGLocalMemoryPass #21030

[SYCL] Run EarlyCSEPass to eliminate redundant memory ops after SYCLLowerWGLocalMemoryPass #21030

Uh oh!

wenju-he commented Jan 12, 2026 •

edited

Loading

Uh oh!

wenju-he commented Jan 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[SYCL] Run EarlyCSEPass to eliminate redundant memory ops after SYCLLowerWGLocalMemoryPass #21030

[SYCL] Run EarlyCSEPass to eliminate redundant memory ops after SYCLLowerWGLocalMemoryPass #21030

Uh oh!

Conversation

wenju-he commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenju-he commented Jan 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wenju-he commented Jan 12, 2026 •

edited

Loading