Add new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked #134232

hl475 · 2024-08-22T14:10:11Z

Summary:
This diff adds two new operators torch.ops._quantized.wrapped_linear_prepack and torch.ops._quantized.wrapped_quantized_linear_prepacked. It is a decomposition of the op torch.ops._quantized.wrapped_quantized_linear added in the previous diff.

We decomposed in this way as packed weight could be computed early so we don;t need to do it in every forward in AOTI

Reviewed By: jerryzh168

Differential Revision: D61395887

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2024-08-22T14:10:14Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134232

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 3c8b8d1 with merge base fee677e ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / linux-focal-py3.11-clang10 / test (default, 3, 4, amz2023.linux.2xlarge) (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following job failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

trunk / linux-focal-rocm6.1-py3.8 / test (default, 1, 2, linux.rocm.gpu) (gh) (trunk failure)
inductor/test_torchinductor.py::CpuTests::test_mutable_custom_op_fixed_layout2_cpu

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-08-22T14:10:27Z

This pull request was exported from Phabricator. Differential Revision: D61395887

github-actions · 2024-08-22T14:14:18Z

Attention! native_functions.yaml was changed

If you are adding a new function or defaulted argument to native_functions.yaml, you cannot use it from pre-existing Python frontend code until our FC window passes (two weeks). Split your PR into two PRs, one which adds the new C++ functionality, and one that makes use of it from Python, and land them two weeks apart. See https://github.com/pytorch/pytorch/wiki/PyTorch's-Python-Frontend-Backward-and-Forward-Compatibility-Policy#forwards-compatibility-fc for more info.

Caused by:

aten/src/ATen/native/native_functions.yaml

facebook-github-bot · 2024-08-22T17:20:00Z

This pull request was exported from Phabricator. Differential Revision: D61395887

…cked (pytorch#134232) Summary: Pull Request resolved: pytorch#134232 This diff adds two new operators torch.ops._quantized.wrapped_linear_prepack and torch.ops._quantized.wrapped_quantized_linear_prepacked. It is a decomposition of the op torch.ops._quantized.wrapped_quantized_linear added in the previous diff. We decomposed in this way as packed weight could be computed early so we don;t need to do it in every forward in AOTI Reviewed By: jerryzh168 Differential Revision: D61395887

facebook-github-bot · 2024-08-22T18:20:28Z

This pull request was exported from Phabricator. Differential Revision: D61395887

houseroad

Looks good. Thanks!

hl475 · 2024-08-22T20:52:36Z

@pytorchbot merge

pytorchmergebot · 2024-08-22T20:54:19Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2024-08-23T02:53:04Z

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

hl475 · 2024-08-23T04:21:11Z

@pytorchbot merge

pytorchmergebot · 2024-08-23T04:23:16Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

zou3519 · 2024-09-06T21:29:03Z

aten/src/ATen/native/native_functions.yaml

+- func: wrapped_linear_prepack(Tensor weight, Tensor weight_scale, Tensor weight_zero_point, Tensor bias) -> Tensor
+
+- func: wrapped_quantized_linear_prepacked(Tensor input, Tensor input_scale, Tensor input_zero_point, Tensor packed_weight, Tensor output_scale, Tensor output_zero_point, int out_channel) -> Tensor


You don't need these -- these generate aten::wrapped_linear_prepack ops. It looks like you're only defining _quantized::wrapped_linear_prepack

zou3519 · 2024-09-06T21:30:09Z

@albanD, why didn't the public API tests trigger on this? It looks like we added a new torch.wrapped_linear_prepack with no docstring

zou3519 · 2024-09-06T21:33:12Z

aten/src/ATen/native/quantized/cpu/qlinear_prepack.cpp

+  auto ret = cpp_custom_type_hack::create(
+      std::move(unique_ptr_wrapper), weight.options());


This is a really bad idea. Are you sure you want this? @huayuli00 @houseroad

…to private by adding _ as prefix Summary: In #134232, we added two new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked. From the review comments and offline discussion, we are changing them to private by adding `_` as prefix Differential Revision: D62325142

…to private by adding _ as prefix (#135401) Summary: Pull Request resolved: #135401 In #134232, we added two new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked. From the review comments and offline discussion, we are changing them to private by adding `_` as prefix Differential Revision: D62325142

…to private by adding _ as prefix (pytorch#135401) Summary: Pull Request resolved: pytorch#135401 In pytorch#134232, we added two new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked. From the review comments and offline discussion, we are changing them to private by adding `_` as prefix Reviewed By: houseroad Differential Revision: D62325142

…to private by adding _ as prefix (#135401) Summary: In #134232, we added two new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked. From the review comments and offline discussion, we are changing them to private by adding `_` as prefix Differential Revision: D62325142 Pull Request resolved: #135401 Approved by: https://github.com/houseroad

…to private by adding _ as prefix (pytorch#135401) Summary: In pytorch#134232, we added two new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked. From the review comments and offline discussion, we are changing them to private by adding `_` as prefix Differential Revision: D62325142 Pull Request resolved: pytorch#135401 Approved by: https://github.com/houseroad

hl475 requested review from digantdesai, jerryzh168, jianyuh, kimishpatel and salilsdesai as code owners August 22, 2024 14:10

pytorch-bot bot added module: cpu CPU specific problem (e.g., perf, algorithm) release notes: quantization release notes category labels Aug 22, 2024

facebook-github-bot added the fb-exported label Aug 22, 2024

hl475 force-pushed the export-D61395887 branch from 4a8c269 to 2e8edd2 Compare August 22, 2024 17:20

hl475 force-pushed the export-D61395887 branch from 2e8edd2 to 3c8b8d1 Compare August 22, 2024 18:20

houseroad approved these changes Aug 22, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 22, 2024

pytorchmergebot added the merging label Aug 22, 2024

pytorchmergebot added the Merged label Aug 23, 2024

pytorchmergebot closed this in 311af3b Aug 23, 2024

pytorchmergebot removed the merging label Aug 23, 2024

zou3519 reviewed Sep 6, 2024

View reviewed changes

hl475 mentioned this pull request Sep 6, 2024

Change wrapped_linear_prepack and wrapped_quantized_linear_prepacked to private by adding _ as prefix #135401

Closed

		- func: wrapped_linear_prepack(Tensor weight, Tensor weight_scale, Tensor weight_zero_point, Tensor bias) -> Tensor

		- func: wrapped_quantized_linear_prepacked(Tensor input, Tensor input_scale, Tensor input_zero_point, Tensor packed_weight, Tensor output_scale, Tensor output_zero_point, int out_channel) -> Tensor

		auto ret = cpp_custom_type_hack::create(
		std::move(unique_ptr_wrapper), weight.options());

Add new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked #134232

Add new ops wrapped_linear_prepack and wrapped_quantized_linear_prepacked #134232

Uh oh!

Conversation

hl475 commented Aug 22, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/134232

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

github-actions bot commented Aug 22, 2024

Attention! native_functions.yaml was changed

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

facebook-github-bot commented Aug 22, 2024

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

hl475 commented Aug 22, 2024

Uh oh!

pytorchmergebot commented Aug 22, 2024

Merge started

Uh oh!

pytorchmergebot commented Aug 23, 2024

Uh oh!

hl475 commented Aug 23, 2024

Uh oh!

pytorchmergebot commented Aug 23, 2024

Merge started

Uh oh!

zou3519 Sep 6, 2024

Choose a reason for hiding this comment

Uh oh!

zou3519 commented Sep 6, 2024

Uh oh!

zou3519 Sep 6, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

hl475 commented Aug 22, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 22, 2024 •

edited

Loading