-
Notifications
You must be signed in to change notification settings - Fork 294
Add support for Int4GroupwisePreshuffleTensor for fbgemm #2421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/2421
Note: Links to docs will display an error until the docs builds have been completed. ❌ 1 New FailureAs of commit e664a1e with merge base 5a50667 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
565e596
to
65a1373
Compare
65a1373
to
8dcecf4
Compare
44b79dd
to
6ce4c7b
Compare
6ce4c7b
to
027648f
Compare
if importlib.util.find_spec("fbgemm_gpu") is None: | ||
quantize_int4_preshuffle = None | ||
else: | ||
from fbgemm_gpu.experimental.gen_ai.quantize import quantize_int4_preshuffle |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this a prototype API? If yes, should the torchao version also be prototype? what does "experimental" mean in the folder structure here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it is stable and production ready, and used in production. it's just bad naming according to @jwfromm, and they have a plan to get rid of it
ccd2cba
to
69a215e
Compare
Summary: Note: slice is not working yet, others are working Test Plan: python test/dtypes/test_int4_groupwise_preshuffle.py Reviewers: Subscribers: Tasks: Tags: stack-info: PR: #2421, branch: jerryzh168/stack/1
69a215e
to
e664a1e
Compare
Stacked PRs:
transpose_input
from fbgemm configs #2422Add support for Int4GroupwisePreshuffleTensor for fbgemm
Summary:
Note: slice is not working yet, others are working
Test Plan:
python test/dtypes/test_int4_groupwise_preshuffle.py
Reviewers:
Subscribers:
Tasks:
Tags: