[Hackability Refactor] Collapse quant_ops into calling sites #1062

Jack-Khuu · 2024-08-26T16:02:18Z

Part of the overall refactoring work to make the repo more accessible, this PR moves quant related files into their respective call sites, flattening the qops file.

Specifically qops.py =>

gguf_loader.py (int4)
quantize.py (int8 + Embedding)

A separate PR will move this file to torchchat/torchchat/utils (Not done in this PR to avoid merge conflicts)

This results in the quantize.py file growing in size, but this file itself will be slimmed via migration to torchao apis

pytorch-bot · 2024-08-26T16:02:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1062

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 611b449 with merge base d0d1105 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jerryzh168

lg, please resolve merge conflict

Remove unused quant code

d068731

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 26, 2024

Jack-Khuu changed the title ~~[WIP][Hackability Refactor] Refactor quant code~~ [WIP][Hackability Refactor] Refactor quant code: Clean up + Move under torchchat/torchchat Aug 26, 2024

Jack-Khuu added 3 commits August 26, 2024 09:40

Decompose qops to call sites

0b961fb

Fix import call in gguf_loader

59ad3d4

Lint gguf imports

7cea58a

Jack-Khuu changed the title ~~[WIP][Hackability Refactor] Refactor quant code: Clean up + Move under torchchat/torchchat~~ [WIP][Hackability Refactor] Collapse quant_ops into calling sites Aug 26, 2024

Jack-Khuu changed the title ~~[WIP][Hackability Refactor] Collapse quant_ops into calling sites~~ [Hackability Refactor] Collapse quant_ops into calling sites Aug 26, 2024

Jack-Khuu marked this pull request as ready for review August 26, 2024 17:27

Jack-Khuu requested review from Gasoonjia, byjlw, jerryzh168, larryliu0820, metascroy and vmpuri August 26, 2024 17:27

jerryzh168 approved these changes Aug 27, 2024

View reviewed changes

Merge branch 'main' into quant_refactor

611b449

Jack-Khuu merged commit 9a276c3 into main Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Hackability Refactor] Collapse quant_ops into calling sites #1062

[Hackability Refactor] Collapse quant_ops into calling sites #1062

Uh oh!

Jack-Khuu commented Aug 26, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 26, 2024 •

edited

Loading

Uh oh!

jerryzh168 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Hackability Refactor] Collapse quant_ops into calling sites #1062

[Hackability Refactor] Collapse quant_ops into calling sites #1062

Uh oh!

Conversation

Jack-Khuu commented Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1062

✅ No Failures

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Jack-Khuu commented Aug 26, 2024 •

edited

Loading

pytorch-bot bot commented Aug 26, 2024 •

edited

Loading