cherry-picking some fixes into release 0.1 branch #123

jerryzh168 · 2024-04-04T16:10:39Z

fixes for int4 non-gptq/gptq quantizer and int8 weight only quantization

Summary: int4weightlinear had a bug that made it not pad when it should have Test Plan: python test/quantization/test_quant_api.py -k "int4wo" Reviewers: Subscribers: Tasks: Tags:

* fixing bug in GPTQ Summary: shape was always padded even when not needed. Test Plan: pythont test/quantization/test_quant_api.py -k "test_gptq_quantizer_int4wo" Reviewers: Subscribers: Tasks: Tags: * removing extra spaces Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags:

Summary: registering fields as buffers so they get picked up in `model.to` Test Plan: python test/quantization/test_quant_api.py -k test_int8_wo_quant_save_load Reviewers: Subscribers: Tasks: Tags:

Add information about params-path to README, update spelling of torchat

HDCharles and others added 3 commits April 3, 2024 22:09

add int4 non-gptq and bugfixes (#119)

ec258e0

Summary: int4weightlinear had a bug that made it not pad when it should have Test Plan: python test/quantization/test_quant_api.py -k "int4wo" Reviewers: Subscribers: Tasks: Tags:

Support model.to int8 weight only quantized model (#122)

76e2ef5

Summary: registering fields as buffers so they get picked up in `model.to` Test Plan: python test/quantization/test_quant_api.py -k test_int8_wo_quant_save_load Reviewers: Subscribers: Tasks: Tags:

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 4, 2024

jerryzh168 merged commit e25c79a into release/v0.1 Apr 4, 2024

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

Update README.md (pytorch#123)

0166817

Add information about params-path to README, update spelling of torchat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cherry-picking some fixes into release 0.1 branch #123

cherry-picking some fixes into release 0.1 branch #123

Uh oh!

jerryzh168 commented Apr 4, 2024

Uh oh!

Uh oh!

cherry-picking some fixes into release 0.1 branch #123

cherry-picking some fixes into release 0.1 branch #123

Uh oh!

Conversation

jerryzh168 commented Apr 4, 2024

Uh oh!

Uh oh!