update quantization doc: add x86 backend as default backend of server inference #86794

XiaobingSuper · 2022-10-12T14:24:50Z

Stack from ghstack (oldest at bottom):

-> update quantization doc: add x86 backend as default backend of server inference #86794

[ghstack-poisoned]

pytorch-bot · 2022-10-12T14:24:52Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86794

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c3d8613:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: e3f8c232ce566213b31fd5448a07a86ca41a0d3f Pull Request resolved: #86794

…ption" [ghstack-poisoned]

ghstack-source-id: 4bb0ca12ffe3465d9970b4c6698fac29c9bc6289 Pull Request resolved: #86794

jgong5 · 2022-10-13T01:42:52Z

Since x86 is now the default qengine replacing fbgemm, can we only recommend x86 for server CPUs? We can leave a note that fbgemm is still available but not recommended.

…ption" [ghstack-poisoned]

ghstack-source-id: 08dd01ebb39df926006c3126cde54d300844381b Pull Request resolved: #86794

…ption" [ghstack-poisoned]

ghstack-source-id: 5a2fbb19f228039521387b824bd9c35bba7aca62 Pull Request resolved: #86794

jgong5 · 2022-10-13T10:13:07Z

docs/source/quantization.rst

@@ -742,30 +748,31 @@ Backend/Hardware Support

 Today, PyTorch supports the following backends for running quantized operators efficiently:

-* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `fbgemm <https://github.com/pytorch/FBGEMM>`_
+* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `x86` to apply the optimization of `fbgemm <https://github.com/pytorch/FBGEMM>`_ and `onednn <https://github.com/oneapi-src/oneDNN>`_ (see the details at `RFC <https://github.com/pytorch/pytorch/issues/83888>`_)


Suggested change

* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `x86` to apply the optimization of `fbgemm <https://github.com/pytorch/FBGEMM>`_ and `onednn <https://github.com/oneapi-src/oneDNN>`_ (see the details at `RFC <https://github.com/pytorch/pytorch/issues/83888>`_)

* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `x86` optimized by `fbgemm <https://github.com/pytorch/FBGEMM>`_ and `onednn <https://github.com/oneapi-src/oneDNN>`_ (see the details at `RFC <https://github.com/pytorch/pytorch/issues/83888>`_)

…ption" [ghstack-poisoned]

ghstack-source-id: 4f27ab24f6341b4f69d3a98a396ba4e070f2fa2f Pull Request resolved: #86794

kit1980 · 2022-11-18T01:58:40Z

@pytorchbot rebase

kit1980 · 2022-11-18T01:59:47Z

@XiaobingSuper feel free to merge when ready.

pytorchmergebot · 2022-11-18T02:00:30Z

@pytorchbot successfully started a rebase job. Check the current status here

pytorchmergebot · 2022-11-18T02:00:36Z

Rebase failed due to Command git -C /home/runner/work/pytorch/pytorch rebase refs/remotes/origin/viable/strict gh/XiaobingSuper/16/orig returned non-zero exit code 1

Rebasing (1/1)
Auto-merging docs/source/quantization.rst
CONFLICT (content): Merge conflict in docs/source/quantization.rst
error: could not apply 4ae3e0ee3c... update quantization doc: add onednn and x86 backend description
hint: Resolve all conflicts manually, mark them as resolved with
hint: "git add/rm <conflicted_files>", then run "git rebase --continue".
hint: You can instead skip this commit: run "git rebase --skip".
hint: To abort and get back to the state before "git rebase", run "git rebase --abort".
Could not apply 4ae3e0ee3c... update quantization doc: add onednn and x86 backend description

Raised by https://github.com/pytorch/pytorch/actions/runs/3493543061

…d of server inference" [ghstack-poisoned]

ghstack-source-id: 6e9d7bef60d4f6312c66971b9643bcbcedfc05df Pull Request resolved: #86794

XiaobingSuper · 2022-12-01T02:23:57Z

@kit1980 @jerryzh168, code is rebased.

XiaobingSuper · 2022-12-02T02:08:44Z

@pytorchbot merge

pytorchmergebot · 2022-12-02T02:10:21Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

… inference (pytorch#86794) Pull Request resolved: pytorch#86794 Approved by: https://github.com/jgong5, https://github.com/kit1980

update quantization doc: add onednn and x86 backend description

ea5a220

[ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Oct 12, 2022

update quantization doc: add onednn and x86 backend description

ccffe69

ghstack-source-id: e3f8c232ce566213b31fd5448a07a86ca41a0d3f Pull Request resolved: #86794

pytorchbot added the open source label Oct 12, 2022

Update on "update quantization doc: add onednn and x86 backend descri…

ccd65db

…ption" [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Oct 12, 2022

update quantization doc: add onednn and x86 backend description

ccf8775

ghstack-source-id: 4bb0ca12ffe3465d9970b4c6698fac29c9bc6289 Pull Request resolved: #86794

XiaobingSuper requested a review from jgong5 October 12, 2022 15:03

jgong5 requested a review from jerryzh168 October 13, 2022 01:43

Update on "update quantization doc: add onednn and x86 backend descri…

49969c3

…ption" [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Oct 13, 2022

update quantization doc: add onednn and x86 backend description

b580776

ghstack-source-id: 08dd01ebb39df926006c3126cde54d300844381b Pull Request resolved: #86794

Update on "update quantization doc: add onednn and x86 backend descri…

dbe9f32

…ption" [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Oct 13, 2022

update quantization doc: add onednn and x86 backend description

907465a

ghstack-source-id: 5a2fbb19f228039521387b824bd9c35bba7aca62 Pull Request resolved: #86794

jgong5 requested changes Oct 13, 2022

View reviewed changes

Update on "update quantization doc: add onednn and x86 backend descri…

478e357

…ption" [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Oct 13, 2022

update quantization doc: add onednn and x86 backend description

4ae3e0e

ghstack-source-id: 4f27ab24f6341b4f69d3a98a396ba4e070f2fa2f Pull Request resolved: #86794

jgong5 approved these changes Oct 13, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 13, 2022

XiaobingSuper changed the title ~~update quantization doc: add onednn and x86 backend description~~ update quantization doc: add x86 backend as default backend of server inference Oct 13, 2022

kit1980 approved these changes Nov 18, 2022

View reviewed changes

Update on "update quantization doc: add x86 backend as default backen…

c3d8613

…d of server inference" [ghstack-poisoned]

XiaobingSuper added a commit that referenced this pull request Dec 1, 2022

update quantization doc: add onednn and x86 backend description

b31da87

ghstack-source-id: 6e9d7bef60d4f6312c66971b9643bcbcedfc05df Pull Request resolved: #86794

XiaobingSuper added the release notes: quantization release notes category label Dec 1, 2022

XiaobingSuper requested a review from kit1980 December 1, 2022 02:32

kit1980 approved these changes Dec 2, 2022

View reviewed changes

pytorchmergebot added the Merged label Dec 2, 2022

pytorchmergebot closed this in 8b2f988 Dec 2, 2022

fzhao3 mentioned this pull request Jan 13, 2023

[RFC] Unified quantization backend for x86 CPU platforms #83888

Closed

facebook-github-bot deleted the gh/XiaobingSuper/16/head branch June 8, 2023 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update quantization doc: add x86 backend as default backend of server inference #86794

update quantization doc: add x86 backend as default backend of server inference #86794

XiaobingSuper commented Oct 12, 2022 •

edited

pytorch-bot bot commented Oct 12, 2022 •

edited

jgong5 commented Oct 13, 2022

jgong5 Oct 13, 2022

kit1980 commented Nov 18, 2022

kit1980 commented Nov 18, 2022

pytorchmergebot commented Nov 18, 2022

pytorchmergebot commented Nov 18, 2022

XiaobingSuper commented Dec 1, 2022

XiaobingSuper commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

	* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `x86` to apply the optimization of `fbgemm <https://github.com/pytorch/FBGEMM>`_ and `onednn <https://github.com/oneapi-src/oneDNN>`_ (see the details at `RFC <https://github.com/pytorch/pytorch/issues/83888>`_)
	* x86 CPUs with AVX2 support or higher (without AVX2 some operations have inefficient implementations), via `x86` optimized by `fbgemm <https://github.com/pytorch/FBGEMM>`_ and `onednn <https://github.com/oneapi-src/oneDNN>`_ (see the details at `RFC <https://github.com/pytorch/pytorch/issues/83888>`_)

update quantization doc: add x86 backend as default backend of server inference #86794

update quantization doc: add x86 backend as default backend of server inference #86794

Conversation

XiaobingSuper commented Oct 12, 2022 • edited

pytorch-bot bot commented Oct 12, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86794

✅ No Failures

jgong5 commented Oct 13, 2022

jgong5 Oct 13, 2022

Choose a reason for hiding this comment

kit1980 commented Nov 18, 2022

kit1980 commented Nov 18, 2022

pytorchmergebot commented Nov 18, 2022

pytorchmergebot commented Nov 18, 2022

XiaobingSuper commented Dec 1, 2022

XiaobingSuper commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

Merge started

XiaobingSuper commented Oct 12, 2022 •

edited

pytorch-bot bot commented Oct 12, 2022 •

edited