[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74572

sanchitintel · 2022-03-22T19:08:19Z

Description

Relanding #68111
Preview4 PR of this RFC.

On the basis of #50256, the below improvements are included:

The preview4 release branch of the oneDNN Graph API is used
The fuser now works with the profiling graph executor. We have inserted type check nodes to guard the profiled tensor properties.

User API:

The optimization pass is disabled by default. Users could enable it by:

torch.jit.enable_onednn_fusion(True)

Performance:

pytorch/benchmark tool is used to compare the performance:

SkyLake 8180 (1 socket of 28 cores):
SkyLake 8180 (single thread):
- By mapping hardswish to oneDNN Graph, it’s 8% faster than PyTorch JIT (NNC + OFI)
  ** We expect performance gain after mapping transpose, contiguous & view to oneDNN graph ops

Directory structure of the integration code

Fuser-related code are placed under:

torch/csrc/jit/codegen/onednn/

Optimization pass registration is done in:

torch/csrc/jit/passes/onednn_graph_fuser.h

CMake for the integration code is:

caffe2/CMakeLists.txt

Limitations

In this PR, we have only supported the optimization on Linux platform. The support on Windows and MacOS will be enabled as a next step.
We have only optimized the inference use case.

facebook-github-bot · 2022-03-22T19:08:26Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/74572
Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit c02ece3 (more details on the Dr. CI page):

4/4 failures introduced in this PR

4 failures not recognized by patterns:

Job	Step	Action
^{pull / linux-xenial-cuda11.3-py3.7-gcc7 / build}	^Unknown	🔁 rerun
^{pull / linux-bionic-rocm4.5-py3.7 / build}	^Unknown	🔁 rerun
^{pull / linux-xenial-py3.7-gcc5.4-mobile-lightweight-dispatch-build / build}	^Unknown	🔁 rerun
^{pull / linux-vulkan-bionic-py3.7-clang9 / build}	^Unknown	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

malfet · 2022-03-22T21:13:44Z

@sanchitintel to make review easier, can you simply cherry-picked landed commit into the branch and then apply any other changes on top of that?

sanchitintel · 2022-03-22T21:27:10Z

to make review easier, can you simply cherry-picked landed commit into the branch and then apply any other changes on top of that?

Sorry @malfet, please clarify which landed commit you're referring to.
Please confirm if you mean first rebasing PR #68111 with the master branch, and then adding this commit to fix the lite-interpreter build. Thanks!

malfet · 2022-03-22T21:29:16Z

This one: cd17683

Summary: ## Description Preview4 PR of this [RFC](pytorch#49444). On the basis of pytorch#50256, the below improvements are included: - The [preview4 release branch](https://github.com/oneapi-src/oneDNN/releases/tag/graph-v0.4.1) of the oneDNN Graph API is used - The fuser now works with the profiling graph executor. We have inserted type check nodes to guard the profiled tensor properties. ### User API: The optimization pass is disabled by default. Users could enable it by: ``` torch.jit.enable_onednn_fusion(True) ``` ### Performance: [pytorch/benchmark](https://github.com/pytorch/benchmark) tool is used to compare the performance: - SkyLake 8180 (1 socket of 28 cores): ![image](https://user-images.githubusercontent.com/65992142/151162305-05e44425-a24e-4d5e-94e1-743b40b87a8c.png) - SkyLake 8180 (single thread): ![image](https://user-images.githubusercontent.com/65992142/151162528-69f90b79-d08d-46b8-8775-d80a6ccbce8a.png) \* By mapping hardswish to oneDNN Graph, it’s 8% faster than PyTorch JIT (NNC + OFI) \** We expect performance gain after mapping transpose, contiguous & view to oneDNN graph ops ### Directory structure of the integration code Fuser-related code are placed under: ``` torch/csrc/jit/codegen/onednn/ ``` Optimization pass registration is done in: ``` torch/csrc/jit/passes/onednn_graph_fuser.h ``` CMake for the integration code is: ``` caffe2/CMakeLists.txt ``` ## Limitations - In this PR, we have only supported the optimization on Linux platform. The support on Windows and MacOS will be enabled as the next step. - We have only optimized the inference use case. Pull Request resolved: pytorch#68111 Reviewed By: eellison Differential Revision: D34584878 Pulled By: malfet fbshipit-source-id: ce817aa8cc9052ee9ed930c9cf66be83449e61a4

sanchitintel · 2022-03-22T22:37:32Z

Windows build failed while compiling a lite interpreter file (test_jit/CMakeFiles/test_jit.dir/test_lite_interpreter.cpp.obj), but seems to have failed due to an unrelated cause -

Will rebase later to check if the issue got fixed.
Similar failures in other PRs, such as #74586.

sanchitintel · 2022-03-23T01:28:24Z

Closing & reopening as #74596. Thanks!

sanchitintel · 2022-03-23T17:08:37Z

~~Somehow CI is not running on #74596.~~ GitHub Actions outage is over

facebook-github-bot added the cla signed label Mar 22, 2022

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label Mar 22, 2022

pytorchbot added the open source label Mar 22, 2022

chunyuan-w and others added 2 commits March 22, 2022 14:37

Fix lite-interpreter build

bc4739a

sanchitintel force-pushed the onednn-graph-preview4 branch from 7b7dbfc to bc4739a Compare March 22, 2022 21:47

sanchitintel closed this Mar 23, 2022

sanchitintel reopened this Mar 23, 2022

Merge branch 'pytorch:master' into onednn-graph-preview4

c02ece3

sanchitintel closed this Mar 23, 2022

sanchitintel mentioned this pull request Mar 23, 2022

[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74596

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74572

[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74572

Uh oh!

sanchitintel commented Mar 22, 2022 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 22, 2022 •

edited

Loading

Uh oh!

malfet commented Mar 22, 2022

Uh oh!

sanchitintel commented Mar 22, 2022 •

edited

Loading

Uh oh!

malfet commented Mar 22, 2022

Uh oh!

sanchitintel commented Mar 22, 2022 •

edited

Loading

Uh oh!

sanchitintel commented Mar 23, 2022

Uh oh!

sanchitintel commented Mar 23, 2022 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74572

[Re-landing 68111] Add JIT graph fuser for oneDNN Graph API (Preview4.1) #74572

Uh oh!

Conversation

sanchitintel commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

User API:

Performance:

Directory structure of the integration code

Limitations

Uh oh!

facebook-github-bot commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

4 failures not recognized by patterns:

Uh oh!

malfet commented Mar 22, 2022

Uh oh!

sanchitintel commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

malfet commented Mar 22, 2022

Uh oh!

sanchitintel commented Mar 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanchitintel commented Mar 23, 2022

Uh oh!

sanchitintel commented Mar 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sanchitintel commented Mar 22, 2022 •

edited

Loading

facebook-github-bot commented Mar 22, 2022 •

edited

Loading

sanchitintel commented Mar 22, 2022 •

edited

Loading

sanchitintel commented Mar 22, 2022 •

edited

Loading

sanchitintel commented Mar 23, 2022 •

edited

Loading