add more options for loading checkpoints #5823

helunwencser · 2024-10-02T18:38:17Z

Stack from ghstack (oldest at bottom):

-> add more options for loading checkpoints #5823

Differential Revision: D63714794

This PR adds support to load qat_lora checkpoints. It mainly does the following two things: - Refactor the existing quantization flow for SpinQuant to be separate function, which is used to load QAT checkpoint as well since they share the same format. - For QAT_LoRA checkpoint, we do one more extra step after quantization. It replaces `Int8DynActInt4WeightLinear` layers with `Int8DynActInt4WeightLinearLoRA` which contains LoRA adaptor. Differential Revision: [D63714794](https://our.internmc.facebook.com/intern/diff/D63714794/) [ghstack-poisoned]

pytorch-bot · 2024-10-02T18:38:21Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5823

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 96e6198 with merge base 152e22d ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-10-02T18:38:30Z

This pull request was exported from Phabricator. Differential Revision: D63714794

This PR adds support to load qat_lora checkpoints. It mainly does the following two things: - Refactor the existing quantization flow for SpinQuant to be separate function, which is used to load QAT checkpoint as well since they share the same format. - For QAT_LoRA checkpoint, we do one more extra step after quantization. It replaces `Int8DynActInt4WeightLinear` layers with `Int8DynActInt4WeightLinearLoRA` which contains LoRA adaptor. Differential Revision: [D63714794](https://our.internmc.facebook.com/intern/diff/D63714794/) ghstack-source-id: 245945707 Pull Request resolved: #5823

This PR adds support to load qat_lora checkpoints. It mainly does the following two things: - Refactor the existing quantization flow for SpinQuant to be separate function, which is used to load QAT checkpoint as well since they share the same format. - For QAT_LoRA checkpoint, we do one more extra step after quantization. It replaces `Int8DynActInt4WeightLinear` layers with `Int8DynActInt4WeightLinearLoRA` which contains LoRA adaptor. Differential Revision: [D63714794](https://our.internmc.facebook.com/intern/diff/D63714794/) [ghstack-poisoned]

Pull Request resolved: #5823 Internal: This PR adds support to load qat_lora checkpoints. It mainly does the following two things: - Refactor the existing quantization flow for SpinQuant to be separate function, which is used to load QAT checkpoint as well since they share the same format. - For QAT_LoRA checkpoint, we do one more extra step after quantization. It replaces `Int8DynActInt4WeightLinear` layers with `Int8DynActInt4WeightLinearLoRA` which contains LoRA adaptor. Differential Revision: [D63714794](https://our.internmc.facebook.com/intern/diff/D63714794/) ghstack-source-id: 245956347

facebook-github-bot · 2024-10-02T19:11:06Z

This pull request was exported from Phabricator. Differential Revision: D63714794

facebook-github-bot · 2024-10-03T00:33:17Z

This pull request has been merged in 9ff3351.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 2, 2024

facebook-github-bot added the fb-exported label Oct 2, 2024

helunwencser changed the base branch from gh/helunwencser/38/base to main October 2, 2024 18:38

helunwencser changed the title ~~add support for loading qat_lora checkpoints~~ add more options for loading checkpoints Oct 2, 2024

mergennachin self-requested a review October 2, 2024 19:20

mergennachin approved these changes Oct 2, 2024

View reviewed changes

facebook-github-bot closed this in 9ff3351 Oct 3, 2024

facebook-github-bot added the Merged label Oct 3, 2024

cccclai mentioned this pull request Jul 22, 2025

Export a lora model #11045

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

add more options for loading checkpoints #5823

add more options for loading checkpoints #5823

Uh oh!

helunwencser commented Oct 2, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 2, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Oct 2, 2024

Uh oh!

facebook-github-bot commented Oct 2, 2024

Uh oh!

facebook-github-bot commented Oct 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

add more options for loading checkpoints #5823

add more options for loading checkpoints #5823

Uh oh!

Conversation

helunwencser commented Oct 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5823

✅ No Failures

Uh oh!

facebook-github-bot commented Oct 2, 2024

Uh oh!

facebook-github-bot commented Oct 2, 2024

Uh oh!

facebook-github-bot commented Oct 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

helunwencser commented Oct 2, 2024 •

edited

Loading

pytorch-bot bot commented Oct 2, 2024 •

edited

Loading