Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Xia-Weiwen · 2023-11-20T06:52:08Z

Stack from ghstack (oldest at bottom):

-> Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Description
Fix #107030
Dequantize X by (x_val - zp) * scale instead of x_val * scale + (-zp * scale) to eliminate rounding error.
Now this overload is used for sigmoid only.

Performance impact:

Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake)

Test plan
python test_quantization.py TestQuantizedOps.test_sigmoid_dequantize_rounding_error

cc @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

…gmoid [ghstack-poisoned]

pytorch-bot · 2023-11-20T06:52:13Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114098

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit a7ba41b with merge base 5a96a42 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…gmoid ghstack-source-id: ae0fc902a817baaf4da14943587808f572bf78bc Pull Request resolved: #114098

jgong5

Maybe we need to revisit other callers and fix them too?

Xia-Weiwen · 2023-11-21T05:24:22Z

Maybe we need to revisit other callers and fix them too?

Yes. Maybe we can do it one by one if the same issue occurs for other ops so that we can minimize the impact.

Xia-Weiwen · 2023-11-22T02:54:28Z

Hi @jerryzh168 @salilsdesai @kimishpatel @digantdesai @jianyuh Could you please review this PR? Thanks!

Xia-Weiwen · 2023-11-23T01:49:24Z

@pytorchbot merge

pytorchmergebot · 2023-11-23T01:51:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…gmoid (pytorch#114098) **Description** Fix pytorch#107030 Dequantize X by `(x_val - zp) * scale` instead of `x_val * scale + (-zp * scale)` to eliminate rounding error. Now this overload is used for sigmoid only. Performance impact: ![image](https://github.com/pytorch/pytorch/assets/12522207/655abd16-7d9d-4a9a-8c59-327ebf39157a) Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) **Test plan** `python test_quantization.py TestQuantizedOps.test_sigmoid_dequantize_rounding_error` Pull Request resolved: pytorch#114098 Approved by: https://github.com/jgong5, https://github.com/jerryzh168

kimishpatel · 2023-11-28T16:07:50Z

Dequantize X by (x_val - zp) * scale instead of x_val * scale + (-zp * scale) to eliminate rounding error

Why were we doing x_val * scale + (-zp * scale) in the first place? That doesnt sound right.

Xia-Weiwen · 2023-11-29T00:53:49Z

Dequantize X by (x_val - zp) * scale instead of x_val * scale + (-zp * scale) to eliminate rounding error

Why were we doing x_val * scale + (-zp * scale) in the first place? That doesnt sound right.

I guess it was about making use of fma to speed up the computation.

jgong5 · 2023-11-29T04:02:01Z

I guess it was about making use of fma to speed up the computation.

It is bound by mem access, not sure if fma really matters here.

Overload vec::dequantize to eliminate rounding error for quantized si…

a7ba41b

…gmoid [ghstack-poisoned]

Xia-Weiwen requested review from jerryzh168, salilsdesai, kimishpatel, digantdesai and jianyuh as code owners November 20, 2023 06:52

pytorch-bot bot added the release notes: quantization release notes category label Nov 20, 2023

Xia-Weiwen added a commit that referenced this pull request Nov 20, 2023

Overload vec::dequantize to eliminate rounding error for quantized si…

32e1f20

…gmoid ghstack-source-id: ae0fc902a817baaf4da14943587808f572bf78bc Pull Request resolved: #114098

github-actions bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Nov 20, 2023

Xia-Weiwen marked this pull request as draft November 20, 2023 06:55

Xia-Weiwen requested review from jgong5 and mingfeima November 20, 2023 06:55

pytorchbot added the open source label Nov 20, 2023

jgong5 approved these changes Nov 20, 2023

View reviewed changes

Xia-Weiwen marked this pull request as ready for review November 21, 2023 05:21

Xia-Weiwen added the intel This tag is for PR from Intel label Nov 21, 2023

jerryzh168 approved these changes Nov 22, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 23, 2023

pytorchmergebot added the merging label Nov 23, 2023

pytorchmergebot added the Merged label Nov 23, 2023

pytorchmergebot removed the merging label Nov 23, 2023

pytorchmergebot closed this in d18e6b0 Nov 23, 2023

facebook-github-bot deleted the gh/Xia-Weiwen/18/head branch November 26, 2023 15:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Xia-Weiwen commented Nov 20, 2023 •

edited

pytorch-bot bot commented Nov 20, 2023 •

edited

jgong5 left a comment

Xia-Weiwen commented Nov 21, 2023

Xia-Weiwen commented Nov 22, 2023

Xia-Weiwen commented Nov 23, 2023

pytorchmergebot commented Nov 23, 2023

kimishpatel commented Nov 28, 2023

Xia-Weiwen commented Nov 29, 2023

jgong5 commented Nov 29, 2023

Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Overload vec::dequantize to eliminate rounding error for quantized sigmoid #114098

Conversation

Xia-Weiwen commented Nov 20, 2023 • edited

pytorch-bot bot commented Nov 20, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114098

✅ No Failures

jgong5 left a comment

Choose a reason for hiding this comment

Xia-Weiwen commented Nov 21, 2023

Xia-Weiwen commented Nov 22, 2023

Xia-Weiwen commented Nov 23, 2023

pytorchmergebot commented Nov 23, 2023

Merge started

kimishpatel commented Nov 28, 2023

Xia-Weiwen commented Nov 29, 2023

jgong5 commented Nov 29, 2023

Xia-Weiwen commented Nov 20, 2023 •

edited

pytorch-bot bot commented Nov 20, 2023 •

edited