[Inductor] add config for weight prepacking #93811

Valentine233 · 2023-02-01T01:39:15Z

Mkldnn weight prepacking may lead to large memory footprint for some models such as UniXcoder. In this case, disabling mkldnn weight prepacking is needed to avoid memory overload.

This PR adds a config for switching mkldnn weight prepacking.

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10 @ezyang @soumith @msaroufim @wconstab @ngimel @bdhirsh @mlazos @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @Guobing-Chen @chunyuan-w @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @desertfire

pytorch-bot · 2023-02-01T01:39:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/93811

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 90cce21:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/_inductor/mkldnn.py

jgong5

After a second thought, the prepack happens outside cpp codegen. Adding a mkldnn_weight_prepack at the top level sounds better.

jgong5 · 2023-02-01T06:08:18Z

torch/_inductor/config.py

@@ -78,6 +78,9 @@

 comment_origin = False

+# enable mkldnn weight prepacking to get a better performance; may lead to large memory cost


Suggested change

# enable mkldnn weight prepacking to get a better performance; may lead to large memory cost

# enable mkldnn weight prepacking to get a better performance; may lead to large memory footprint

Chillee

Can we move this under the cpp section?

Valentine233 · 2023-02-01T06:51:41Z

Can we move this under the cpp section?

@Chillee The topic is mentioned by Jiong above. As the weight prepack happens outside cpp codegen, it may not be proper to move it under the cpp section.

Chillee · 2023-02-01T06:56:38Z

But it's only relevant for CPU codegen, no?

jgong5 · 2023-02-01T07:05:28Z

But it's only relevant for CPU codegen, no?

Yes. I'm fine it is kept inside cpp section.

Valentine233 · 2023-02-01T07:14:47Z

But it's only relevant for CPU codegen, no?

@Chillee OK, changed.

Valentine233 · 2023-02-02T12:16:46Z

@pytorchbot merge

pytorchmergebot · 2023-02-02T12:18:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…n-dev-setup * origin: (898 commits) Move dynamo.optimizations.distributed to backends (pytorch#93408) Remove cuda 11.6 from nightly (pytorch#93979) Refactor dynamo register_backend/BACKENDS (pytorch#93389) Remove cuda 11.6 from CI replace with 11.7 (pytorch#93406) [Dynamo] Rename `GuardBuilder.guarded_code` -> `check_fn_manager` (pytorch#93934) Revert "Remove CUDA 11.6 from nightly builds (pytorch#93404)" Revert "[inductor] fix crash issue when input is a view tensor (pytorch#90150)" Basic Validation for FSDP `state_dict` transformations of modules with persistent buffers (pytorch#93396) Merge Inductor perf smoke test with other inductor CI tests (pytorch#93395) [inductor] Don't import torchvision (pytorch#93027) [FSDP][3/N] Refactor `summon_full_params` unit tests (pytorch#92298) [FSDP][2/N] `_summon_full_params` -> `_unshard_params` (pytorch#92297) Remove CUDA 11.6 from nightly builds (pytorch#93404) Mark buffers that reuse other buffers (pytorch#93329) Refactor to allow reuse of SchedulerNode.allocate (pytorch#93328) retire sparse_mask_helper (pytorch#91714) update fbgemm third party (pytorch#93907) [inductor] fix crash issue when input is a view tensor (pytorch#90150) [Inductor] add config for weight prepacking (pytorch#93811) Check for none for NNModuleVariable.__module__ (pytorch#93326) ...

github-actions bot added ciflow/inductor module: inductor labels Feb 1, 2023

Valentine233 added intel This tag is for PR from Intel release notes: inductor labels Feb 1, 2023

Valentine233 requested review from jgong5 and EikanWang February 1, 2023 01:42

pytorchbot added the open source label Feb 1, 2023

Valentine233 requested a review from XiaobingSuper February 1, 2023 02:25

drisspg added the oncall: pt2 label Feb 1, 2023

jgong5 requested changes Feb 1, 2023

View reviewed changes

torch/_inductor/mkldnn.py Outdated Show resolved Hide resolved

jgong5 approved these changes Feb 1, 2023

View reviewed changes

Valentine233 force-pushed the weight_prepack_config branch from 33afa67 to 5a8bca0 Compare February 1, 2023 06:20

Valentine233 requested review from jansel, desertfire and Chillee February 1, 2023 06:21

Valentine233 added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 1, 2023

Chillee reviewed Feb 1, 2023

View reviewed changes

Valentine233 force-pushed the weight_prepack_config branch from 5a8bca0 to 90cce21 Compare February 1, 2023 07:12

add config for weight prepacking

90cce21

jansel approved these changes Feb 2, 2023

View reviewed changes

chunyuan-w mentioned this pull request Feb 2, 2023

[inductor] weight prepack for single conv_transpose2d #90267

Closed

pytorchmergebot added the Merged label Feb 2, 2023

pytorchmergebot closed this in a672fd1 Feb 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] add config for weight prepacking #93811

[Inductor] add config for weight prepacking #93811

Valentine233 commented Feb 1, 2023 •

edited

Loading

pytorch-bot bot commented Feb 1, 2023 •

edited

Loading

jgong5 left a comment

jgong5 Feb 1, 2023

Chillee left a comment

Valentine233 commented Feb 1, 2023 •

edited

Loading

Chillee commented Feb 1, 2023

jgong5 commented Feb 1, 2023

Valentine233 commented Feb 1, 2023

Valentine233 commented Feb 2, 2023

pytorchmergebot commented Feb 2, 2023

		@@ -78,6 +78,9 @@

		comment_origin = False

		# enable mkldnn weight prepacking to get a better performance; may lead to large memory cost

[Inductor] add config for weight prepacking #93811

[Inductor] add config for weight prepacking #93811

Conversation

Valentine233 commented Feb 1, 2023 • edited Loading

pytorch-bot bot commented Feb 1, 2023 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/93811

✅ No Failures

jgong5 left a comment

Choose a reason for hiding this comment

jgong5 Feb 1, 2023

Choose a reason for hiding this comment

Chillee left a comment

Choose a reason for hiding this comment

Valentine233 commented Feb 1, 2023 • edited Loading

Chillee commented Feb 1, 2023

jgong5 commented Feb 1, 2023

Valentine233 commented Feb 1, 2023

Valentine233 commented Feb 2, 2023

pytorchmergebot commented Feb 2, 2023

Merge started

Valentine233 commented Feb 1, 2023 •

edited

Loading

pytorch-bot bot commented Feb 1, 2023 •

edited

Loading

Valentine233 commented Feb 1, 2023 •

edited

Loading