[Quant] Implemented 4 bit embedding op support; added corresponding test case #69768

dzdang · 2021-12-10T17:14:11Z

Stack from ghstack:

Summary: Support for the 4 embedding operator has been added. The support is analogous to the preexisting support for byte/8bit embedding. A corresponding test case was added to test_quantized_embedding_op.py

Reviewers: jerryzh168

Subscribers: jerryzh168, supriyar

Test plan: In pytorch main dir, execute

python test/test_quantization.py TestStaticQuantizedModule.test_embedding_api

to run the series of tests, including the newly added test_embedding_4bit
function

Tasks: T106931792

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/b908918a706e0fc2d654d12d32c79f3b7359c5d0/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-docs	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/docs`, `ciflow/linux`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
docker-builds	`ciflow/all`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
linux-docs-push	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`	🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
periodic-libtorch-linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-bionic-cuda11.5-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
periodic-win-vs2019-cuda11.5-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

…est case Summary: Support for the 4 embedding operator has been added. The support is analogous to the preexisting support for byte/8bit embedding. A corresponding test case was added to test_quantized_embedding_op.py Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Test plan: In pytorch main dir, execute ``` python test/test_quantization.py TestStaticQuantizedModule.test_embedding_api ``` to run the series of tests, including the newly added test_embedding_4bit function Tasks: T106931792 Tags ghstack-source-id: 318c8b8 Pull Request resolved: #69768

facebook-github-bot · 2021-12-11T17:10:57Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/69768
📄 Preview docs built from this PR
📄 Preview C++ docs built from this PR
↩️ [fb-only] Re-run with SSH instructions
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit b908918 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-scanned failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm4.3.1-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

…esponding test case" Summary: Support for the 4 embedding operator has been added. The support is analogous to the preexisting support for byte/8bit embedding. A corresponding test case was added to test_quantized_embedding_op.py Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Test plan: In pytorch main dir, execute ``` python test/test_quantization.py TestStaticQuantizedModule.test_embedding_api ``` to run the series of tests, including the newly added test_embedding_4bit function Tasks: T106931792 Tags [ghstack-poisoned]

jerryzh168 · 2021-12-13T19:05:11Z

test/quantization/core/test_quantized_op.py

        torch.testing.assert_close(ref, qresult, atol=0.005, rtol=1e-3)

+    """ Tests the correctness of the quantized 4 bit embedding lookup operator """
+    @given(num_embeddings=st.integers(10, 100),


is it possible to merge this test with test_embedding_byte?
also we don't want to use hypothesis for testing, might be good to remove hypothesis here as well and change it to simple for loops. (remove @given, example: https://github.com/pytorch/pytorch/blob/master/test/quantization/core/test_quantized_op.py#L1430), can be done in a separate PR.

Do you mean removing hypothesis for both test_embedding_byte and test_embedding_4bit or just the latter? It was already there in test_embedding_byte, so I thought I was supposed to add it.

And yeah, I can easily combine test_embedding_byte and test_embedding_4bit. Is it acceptable to rename the function to something like test_embedding?

Do you mean removing hypothesis for both test_embedding_byte and test_embedding_4bit or just the latter? It was already there in test_embedding_byte, so I thought I was supposed to add it.

we can combine the tests and remove hypothesis, it can be in a separate PR

Sounds good. I'll put that in PR5

jerryzh168

looks good, had a comment for test

…esponding test case" Summary: Support for the 4 embedding operator has been added. The support is analogous to the preexisting support for byte/8bit embedding. A corresponding test case was added to test_quantized_embedding_op.py Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Test plan: In pytorch main dir, execute ``` python test/test_quantization.py TestStaticQuantizedModule.test_embedding_api ``` to run the series of tests, including the newly added test_embedding_4bit function Tasks: T106931792 Tags [ghstack-poisoned]

jerryzh168 · 2021-12-15T23:55:08Z

please import the PR with ghimport

…esponding test case" Summary: Support for the 4 embedding operator has been added. The support is analogous to the preexisting support for byte/8bit embedding. A corresponding test case was added to test_quantized_embedding_op.py Reviewers: jerryzh168 Subscribers: jerryzh168, supriyar Test plan: In pytorch main dir, execute ``` python test/test_quantization.py TestStaticQuantizedModule.test_embedding_api ``` to run the series of tests, including the newly added test_embedding_4bit function Tasks: T106931792 Tags [ghstack-poisoned]

dzdang · 2021-12-16T03:38:58Z

@dzdang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ghstack-source-id: d9ecca9 Pull Request resolved: #71387

[ghstack-poisoned]

ghstack-source-id: 2db55fc Pull Request resolved: #71387

[ghstack-poisoned]

ghstack-source-id: 4554ffd Pull Request resolved: #71387

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

ghstack-source-id: 056c02c Pull Request resolved: #71387

Summary: Pull Request resolved: #71387 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33808593 Pulled By: dzdang fbshipit-source-id: 3950400dc7506006666fcd055819e9a08a42eda9

Summary: Pull Request resolved: #71387 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33808593 Pulled By: dzdang fbshipit-source-id: 3950400dc7506006666fcd055819e9a08a42eda9 (cherry picked from commit 38dc2de)

…69768 (#71387) Summary: Pull Request resolved: pytorch/pytorch#71387 Test Plan: Imported from OSS Reviewed By: albanD Differential Revision: D33808593 Pulled By: dzdang fbshipit-source-id: 3950400dc7506006666fcd055819e9a08a42eda9 (cherry picked from commit 38dc2de)

pytorch-probot bot added the ciflow/default label Dec 10, 2021

dzdang mentioned this pull request Dec 10, 2021

[Quant] Added 4 bit support for embedding quantized module #69769

Closed

dzdang requested a review from jerryzh168 December 10, 2021 17:44

facebook-github-bot added the cla signed label Dec 10, 2021

dzdang mentioned this pull request Dec 11, 2021

[Quant][Eager] Added 4 bit support for eager mode quantization flow #69806

Closed

dzdang mentioned this pull request Dec 13, 2021

[Quant][fx] Added test for quint4x2 for fx graph mode quantization #69846

Closed

jerryzh168 reviewed Dec 13, 2021

View reviewed changes

jerryzh168 approved these changes Dec 13, 2021

View reviewed changes

dzdang added 2 commits December 14, 2021 19:00

dzdang added 2 commits December 15, 2021 17:45

facebook-github-bot closed this in b331752 Dec 19, 2021

facebook-github-bot deleted the gh/dzdang/12/head branch December 22, 2021 15:16

dzdang added a commit that referenced this pull request Jan 17, 2022

[Quant] Fixed errors in test_embedding introduced by #69768

1abd4b3

ghstack-source-id: d9ecca9 Pull Request resolved: #71387

dzdang added a commit that referenced this pull request Jan 17, 2022

[Quant] Fixed errors in test_embedding introduced by #69768

af124a8

[ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 21, 2022

Update on "[Quant] Fixed errors in test_embedding introduced by #69768"

dfbd940

[ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 21, 2022

[Quant] Fixed errors in test_embedding introduced by #69768

e506ec6

ghstack-source-id: 2db55fc Pull Request resolved: #71387

dzdang added a commit that referenced this pull request Jan 27, 2022

Update on "[Quant] Fixed errors in test_embedding introduced by #69768"

0394baf

[ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 27, 2022

[Quant] Fixed errors in test_embedding introduced by #69768

98c4f98

ghstack-source-id: 4554ffd Pull Request resolved: #71387

dzdang added a commit that referenced this pull request Jan 27, 2022

Update base for Update on "[Quant] Fixed errors in test_embedding int…

6db77e4

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 27, 2022

Update on "[Quant] Fixed errors in test_embedding introduced by #69768"

d58d248

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 28, 2022

Update base for Update on "[Quant] Fixed errors in test_embedding int…

a766415

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 28, 2022

Update on "[Quant] Fixed errors in test_embedding introduced by #69768"

6e39abb

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 28, 2022

Update base for Update on "[Quant] Fixed errors in test_embedding int…

4f66721

…roduced by #69768" Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 28, 2022

Update on "[Quant] Fixed errors in test_embedding introduced by #69768"

0fb8e7f

Differential Revision: [D33808593](https://our.internmc.facebook.com/intern/diff/D33808593) [ghstack-poisoned]

dzdang added a commit that referenced this pull request Jan 28, 2022

[Quant] Fixed errors in test_embedding introduced by #69768

4cfbfae

ghstack-source-id: 056c02c Pull Request resolved: #71387

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Quant] Implemented 4 bit embedding op support; added corresponding test case #69768

[Quant] Implemented 4 bit embedding op support; added corresponding test case #69768

Uh oh!

dzdang commented Dec 10, 2021 •

edited

Loading

Uh oh!

pytorch-probot bot commented Dec 10, 2021 •

edited

Loading

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Dec 11, 2021 •

edited

Loading

Uh oh!

jerryzh168 Dec 13, 2021

Uh oh!

dzdang Dec 14, 2021

Uh oh!

jerryzh168 Dec 15, 2021

Uh oh!

dzdang Dec 16, 2021

Uh oh!

jerryzh168 left a comment

Uh oh!

jerryzh168 commented Dec 15, 2021

Uh oh!

dzdang commented Dec 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Quant] Implemented 4 bit embedding op support; added corresponding test case #69768

[Quant] Implemented 4 bit embedding op support; added corresponding test case #69768

Uh oh!

Conversation

dzdang commented Dec 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-probot bot commented Dec 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚛️ CI Flow

Uh oh!

facebook-github-bot commented Dec 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

jerryzh168 Dec 13, 2021

Choose a reason for hiding this comment

Uh oh!

dzdang Dec 14, 2021

Choose a reason for hiding this comment

Uh oh!

jerryzh168 Dec 15, 2021

Choose a reason for hiding this comment

Uh oh!

dzdang Dec 16, 2021

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 commented Dec 15, 2021

Uh oh!

dzdang commented Dec 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dzdang commented Dec 10, 2021 •

edited

Loading

pytorch-probot bot commented Dec 10, 2021 •

edited

Loading

facebook-github-bot commented Dec 11, 2021 •

edited

Loading