[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols #137794

EikanWang · 2024-10-11T17:37:12Z

Stack from ghstack (oldest at bottom):

Intel GPU aten library(libtorch_xpu) utilizes torchgen to generate structure kernels. Currently, the generated structure kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controlled by the CAFFE2_BUILD_MAIN_LIB macro. However, we cannot enable CAFFE2_BUILD_MAIN_LIB for the Intel GPU ATen library naively. Because the macro not only serves for the TORCH_API semantic. It means that the semantic of TORCH_API is symbol hidden.

https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99

Therefore, we need to use TORCH_XPU_API to decorate the produced structure kernels.

cc @voznesenskym @penguinwu @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. [ghstack-poisoned]

pytorch-bot · 2024-10-11T17:37:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137794

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 9361b5a with merge base 41977a0 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-py3.9-clang10 / test (dynamo, 2, 3, linux.2xlarge) (gh) (similar failure)
'test/test_autograd.py::TestAutograd::test_post_accumulate_grad_hook_e2e'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: ef0ca21 Pull Request resolved: #137794

…e structured kernel symbols" Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. And the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define` CAFFE2_BUILD_MAIN_LIB` macro to enable it. [ghstack-poisoned]

…e structured kernel symbols" Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. Currently, the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. However, we cannot enable `CAFFE2_BUILD_MAIN_LIB` for the Intel GPU ATen library naively. Because the macro not only serves for the `TORCH_API` semantic. It means that the semantic of `TORCH_API` is symbol `hidden`. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Therefore, we need to use ` TORCH_XPU_API` to decorate the produced structure kernels. [ghstack-poisoned]

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: c153216 Pull Request resolved: #137794

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: 3628a5e Pull Request resolved: #137794

…e structured kernel symbols" Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. Currently, the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. However, we cannot enable `CAFFE2_BUILD_MAIN_LIB` for the Intel GPU ATen library naively. Because the macro not only serves for the `TORCH_API` semantic. It means that the semantic of `TORCH_API` is symbol `hidden`. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Therefore, we need to use ` TORCH_XPU_API` to decorate the produced structure kernels. [ghstack-poisoned]

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: 8714c7c Pull Request resolved: #137794

…uctured kernel symbols" Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. Currently, the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. However, we cannot enable `CAFFE2_BUILD_MAIN_LIB` for the Intel GPU ATen library naively. Because the macro not only serves for the `TORCH_API` semantic. It means that the semantic of `TORCH_API` is symbol `hidden`. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Therefore, we need to use ` TORCH_XPU_API` to decorate the produced structure kernels. [ghstack-poisoned]

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: 8714c7c Pull Request resolved: #137794

EikanWang · 2024-10-14T02:27:30Z

test/inductor/test_torchinductor_opinfo.py

-    # not implemented for 'Boolean'
-    "nn.functiona.unfold": {b8},


The latest torch-xpu-ops has supported boolean for unfolad. Therefore, we update this case for the sake of CI.

…uctured kernel symbols" Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. Currently, the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. However, we cannot enable `CAFFE2_BUILD_MAIN_LIB` for the Intel GPU ATen library naively. Because the macro not only serves for the `TORCH_API` semantic. It means that the semantic of `TORCH_API` is symbol `hidden`. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Therefore, we need to use ` TORCH_XPU_API` to decorate the produced structure kernels. cc voznesenskym penguinwu jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy yf225 chenyang78 kadeng muchulee8 ColinPeppler amjames desertfire chauhang [ghstack-poisoned]

…nerated kernels are decorated by TORCH_API to control the visibility, while TORCH_API is controled by the CAFFE2_BUILD_MAIN_LIB macro. Intel GPU requires the generated structure kernels to be visible for linking. Therefore, we need to define CAFFE2_BUILD_MAIN_LIB macro to enable it. ghstack-source-id: de0471e Pull Request resolved: #137794

atalman

lgtm

EikanWang · 2024-10-15T12:33:28Z

@pytorchbot merge

pytorchmergebot · 2024-10-15T12:35:26Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…nel symbols (#137794) Intel GPU aten library(libtorch_xpu) utilizes `torchgen` to generate structure kernels. Currently, the generated structure kernels are decorated by `TORCH_API` to control the visibility, while `TORCH_API` is controlled by the `CAFFE2_BUILD_MAIN_LIB` macro. However, we cannot enable `CAFFE2_BUILD_MAIN_LIB` for the Intel GPU ATen library naively. Because the macro not only serves for the `TORCH_API` semantic. It means that the semantic of `TORCH_API` is symbol `hidden`. https://github.com/pytorch/pytorch/blob/main/c10/macros/Export.h#L95-L99 Therefore, we need to use ` TORCH_XPU_API` to decorate the produced structure kernels. Pull Request resolved: #137794 Approved by: https://github.com/atalman ghstack dependencies: #137873

pytorch-bot bot added topic: not user facing topic category labels Oct 11, 2024

EikanWang changed the title ~~[Intel GPU] Fix linking issue due to invisible structured kernel symbols~~ [Intel GPU] Fix linkage issue due to invisible structured kernel symbols Oct 11, 2024

EikanWang added the ciflow/xpu Run XPU CI tasks label Oct 11, 2024

EikanWang requested a review from atalman October 11, 2024 17:41

EikanWang changed the title ~~[Intel GPU] Fix linkage issue due to invisible structured kernel symbols~~ [Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols Oct 11, 2024

pytorchbot added the open source label Oct 11, 2024

EikanWang removed the request for review from atalman October 11, 2024 18:27

EikanWang added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Oct 11, 2024

EikanWang marked this pull request as draft October 11, 2024 18:39

EikanWang changed the title ~~[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols~~ [WIP][Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols Oct 11, 2024

EikanWang added 2 commits October 12, 2024 06:38

guangyey mentioned this pull request Oct 12, 2024

Update torch-xpu-ops pin commit #137839

Closed

EikanWang mentioned this pull request Oct 12, 2024

Skip test export with fake tensor inputs on cuda devices for Intel GPU #137847

Closed

EikanWang changed the title ~~[WIP][Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols~~ [Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols Oct 12, 2024

EikanWang requested review from atalman and malfet October 12, 2024 14:02

EikanWang marked this pull request as ready for review October 12, 2024 14:02

EikanWang requested a review from gujinghui as a code owner October 12, 2024 14:02

EikanWang mentioned this pull request Oct 14, 2024

Fix Intel GPU test failure due to unsupport bool for unfold #137873

Closed

pytorch-bot bot added the module: inductor label Oct 14, 2024

EikanWang commented Oct 14, 2024

View reviewed changes

atalman approved these changes Oct 14, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 15, 2024

pytorchmergebot added the merging label Oct 15, 2024

pytorchmergebot added the Merged label Oct 15, 2024

pytorchmergebot closed this in 5689e33 Oct 15, 2024

pytorchmergebot removed the merging label Oct 15, 2024

github-actions bot deleted the gh/EikanWang/74/head branch November 15, 2024 02:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols #137794

[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols #137794

EikanWang commented Oct 11, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 11, 2024 •

edited

Loading

Uh oh!

EikanWang Oct 14, 2024

Uh oh!

atalman left a comment

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

pytorchmergebot commented Oct 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols #137794

[Intel GPU] Fix Windows linkage issue due to invisible structured kernel symbols #137794

Conversation

EikanWang commented Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/137794

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

EikanWang Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

atalman left a comment

Choose a reason for hiding this comment

Uh oh!

EikanWang commented Oct 15, 2024

Uh oh!

pytorchmergebot commented Oct 15, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

EikanWang commented Oct 11, 2024 •

edited

Loading

pytorch-bot bot commented Oct 11, 2024 •

edited

Loading