Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Conversation

@Jack-Khuu
Copy link
Contributor

@Jack-Khuu Jack-Khuu commented Aug 26, 2024

Part of the overall refactoring work to make the repo more accessible, this PR moves quant related files into their respective call sites, flattening the qops file.

Specifically qops.py =>

  • gguf_loader.py (int4)
  • quantize.py (int8 + Embedding)

A separate PR will move this file to torchchat/torchchat/utils (Not done in this PR to avoid merge conflicts)

This results in the quantize.py file growing in size, but this file itself will be slimmed via migration to torchao apis

@pytorch-bot
Copy link

pytorch-bot bot commented Aug 26, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1062

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 611b449 with merge base d0d1105 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 26, 2024
@Jack-Khuu Jack-Khuu changed the title [WIP][Hackability Refactor] Refactor quant code [WIP][Hackability Refactor] Refactor quant code: Clean up + Move under torchchat/torchchat Aug 26, 2024
@Jack-Khuu Jack-Khuu changed the title [WIP][Hackability Refactor] Refactor quant code: Clean up + Move under torchchat/torchchat [WIP][Hackability Refactor] Collapse quant_ops into calling sites Aug 26, 2024
@Jack-Khuu Jack-Khuu changed the title [WIP][Hackability Refactor] Collapse quant_ops into calling sites [Hackability Refactor] Collapse quant_ops into calling sites Aug 26, 2024
@Jack-Khuu Jack-Khuu marked this pull request as ready for review August 26, 2024 17:27
Copy link
Contributor

@jerryzh168 jerryzh168 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lg, please resolve merge conflict

@Jack-Khuu Jack-Khuu merged commit 9a276c3 into main Aug 28, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants