Support prequant qwen3 #10839

metascroy · 2025-05-13T03:34:57Z

As titled

pytorch-bot · 2025-05-13T03:35:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10839

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[PREEMPTIVE] Removal of ephemeral variants on scale-config.yml

❌ 3 New Failures

As of commit 880c99b with merge base d7201ab ():

NEW FAILURES - The following jobs have failed:

Check Labels / Check labels (gh)
RuntimeError: Error checking labels: PR does not have required labels
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t 7d31b3e5b0d5265a2b1dcaaa9bab5600dd082b0cb6cdc89762e6110b4224934f /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-05-13T03:35:31Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:.

If not, please add the release notes: none label.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

jackzhxng

have you tested the regular qwen3 export?

jackzhxng · 2025-05-13T15:55:44Z

examples/models/qwen3/convert_weights.py

-    converted_state_dict["output.weight"] = converted_state_dict[
-        "tok_embeddings.weight"
-    ]
+    # If lm_head.weight is not present, assume tied embeddings (e.g., 0.6b and 4b models)


Suggested change

# If lm_head.weight is not present, assume tied embeddings (e.g., 0.6b and 4b models)

# If lm_head.weight is not present, assume tied embeddings (0.6b, 1.7b, and 4b models)

jackzhxng · 2025-05-13T15:57:42Z

examples/models/qwen3/convert_weights.py

-        "tok_embeddings.weight"
-    ]
+    # If lm_head.weight is not present, assume tied embeddings (e.g., 0.6b and 4b models)
+    if "lm_head.weight" not in state_dict:


lm_head is present in the hf checkpoints even if they are tied embeddings, it will just be the same weights as the tok_embeddings

Hmmm, are you sure it's there? I thought when config. tie_word_embeddings = true, it might not be there, but gets materialized during a tie_weights() command on the HF model.

In any case, if it is there, it's covered by the regular loop through keys and this logic is not executed. If it's not there, this sets lm_head's weight to the embeddings.

Yeah it's here, https://huggingface.co/Qwen/Qwen3-0.6B/tree/main?show_file_info=model.safetensors. Also I remember seeing it while debugging the checkpoint. But sure, in that case could you reword the comment?

Reworded the comment a little.

I'm not convinced by https://huggingface.co/Qwen/Qwen3-0.6B/tree/main?show_file_info=model.safetensors because it's just metadata, and doesn't prove anything about what is stored in the file.

It does not look like lm_head is present in the safetensors when I unpack them locally. Perhaps you looked at the checkpoint after running your script? (Which copied the embedding tensors into lm_head).

Ok, if you double checked then that's fine!

jackzhxng · 2025-05-13T15:58:34Z

examples/models/qwen3/convert_weights.py

    )
    parser.add_argument(
-        "input_dir",
+        "input_dir_or_checkpoint",


not a big fan of specifying something that could be either a dir or a file, if pytorch_model.bin is the filename for the quantized checkpoint going forward i'd rather just specify the checkpoint and search for that

how do you usually download the directories from HF?

e.g. huggingface-cli download ibm-granite/granite-3b-code-instruct-128k gets you the entire directory

I changed to use directories.

metascroy · 2025-05-13T18:14:20Z

have you tested the regular qwen3 export?

I can test if there is no CI for it.

jackzhxng · 2025-05-13T21:29:42Z

Yeah if you test it and it works, feel free to merge pending comments

metascroy · 2025-05-14T02:25:50Z

Yeah if you test it and it works, feel free to merge pending comments

Checked convert script works with original Qwen3 on HF

init

751eb3f

metascroy requested review from lucylq and jackzhxng as code owners May 13, 2025 03:34

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 13, 2025

up

ea7cbdc

jackzhxng reviewed May 13, 2025

View reviewed changes

jackzhxng approved these changes May 13, 2025

View reviewed changes

up

880c99b

metascroy added the topic: not user facing label May 14, 2025

metascroy merged commit fa5048b into main May 14, 2025
89 of 93 checks passed

metascroy deleted the support-qwen3-prequant branch May 14, 2025 16:34

	# If lm_head.weight is not present, assume tied embeddings (e.g., 0.6b and 4b models)
	# If lm_head.weight is not present, assume tied embeddings (0.6b, 1.7b, and 4b models)

Support prequant qwen3 #10839

Support prequant qwen3 #10839

Uh oh!

Conversation

metascroy commented May 13, 2025

Uh oh!

pytorch-bot bot commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10839

❗ 1 Active SEVs

❌ 3 New Failures

Uh oh!

github-actions bot commented May 13, 2025

This PR needs a release notes: label

Uh oh!

jackzhxng left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jackzhxng May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

metascroy commented May 13, 2025

Uh oh!

jackzhxng commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

metascroy commented May 14, 2025

Uh oh!

Uh oh!

Uh oh!

pytorch-bot bot commented May 13, 2025 •

edited

Loading

This PR needs a `release notes:` label

jackzhxng May 13, 2025 •

edited

Loading

jackzhxng commented May 13, 2025 •

edited

Loading