Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295

DannyYuyang-quic · 2025-09-15T01:31:40Z

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer

Summary

add seq_mse_candidates setting to SmolLM3
fixed a bug in the graph drawer

Test plan

DrawGraph Unit test

python -m backends.qualcomm.tests.test_qnn_delegate TestQNNFloatingPointUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL}  -b build-android -a . --executorch_root .
python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL}  -b build-android -a . --executorch_root .

SmolLM3
script in ./examples/qualcomm/oss_scripts/llama/README.md at SmolLM3 part

… and fixed a bug in the graph drawer

pytorch-bot · 2025-09-15T01:31:44Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14295

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job

As of commit 11e794c with merge base eec95d0 ():

NEW FAILURES - The following jobs have failed:

pull / unittest / macos / macos-job (gh)
backends/xnnpack/test/passes/test_channels_last_tagged_reshape.py::TestChannelsLastTaggedReshapePass::test_two_conv_add
pull / unittest-editable / macos / macos-job (gh)
backends/xnnpack/test/recipes/test_xnnpack_recipes.py::TestXnnpackRecipes::test_8a4w_recipe

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-openvino-linux / linux-job (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

DannyYuyang-quic · 2025-09-15T01:36:14Z

Hi @cccclai, this patch adds the default seq_mse_candidates setting following the landing of PR #12700, and also fixes a directory bug in the graph drawer.
Please have a look, thanks!!

DannyYuyang-quic · 2025-09-15T01:37:07Z

@pytorchbot label "release notes: qualcomm"

facebook-github-bot · 2025-09-15T20:20:19Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this in D82478041.

cccclai · 2025-09-15T20:20:42Z

examples/qualcomm/oss_scripts/llama/__init__.py

    ptq = QuantDtype.use_16a4w_block
    group_size = 32
    masked_softmax = True
+    seq_mse_candidates = 0


What does 0 seq_mse_candidates mean

Thanks for asking the question about seq_mse_candidates setting.
Setting seq_mse_candidates = 0 means that SeqMSE is disabled during quantization.
For more details, can refer:
https://github.com/pytorch/executorch/blob/main/examples/qualcomm/oss_scripts/llama/decoder_utils.py#L367-L375

… and fixed a bug in the graph drawer (pytorch#14295) Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer ### Summary - add seq_mse_candidates setting to SmolLM3 - fixed a bug in the graph drawer ### Test plan DrawGraph Unit test ``` bash python -m backends.qualcomm.tests.test_qnn_delegate TestQNNFloatingPointUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . ``` SmolLM3 script in `./examples/qualcomm/oss_scripts/llama/README.md` at SmolLM3 part

DannyYuyang-quic · 2025-09-25T02:38:32Z

@pytorchbot cherry-pick --onto release/1.0 -c critical

… and fixed a bug in the graph drawer (#14295) Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer ### Summary - add seq_mse_candidates setting to SmolLM3 - fixed a bug in the graph drawer ### Test plan DrawGraph Unit test ``` bash python -m backends.qualcomm.tests.test_qnn_delegate TestQNNFloatingPointUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . ``` SmolLM3 script in `./examples/qualcomm/oss_scripts/llama/README.md` at SmolLM3 part (cherry picked from commit d61dbb9)

pytorchbot · 2025-09-25T02:40:51Z

Cherry picking #14295

The cherry pick PR is at #14571 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated:

[v1.0.0] Release Tracker #14288 (comment)

Details for Dev Infra team

Raised by workflow job

DannyYuyang-quic · 2025-09-25T03:25:33Z

This PR fixes the bug in the config for smolLM3 and should be included in release/1.0
Without this fix, an error will occur during smolLM3 model compilation.

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3…

11e794c

… and fixed a bug in the graph drawer

DannyYuyang-quic requested a review from cccclai as a code owner September 15, 2025 01:31

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 15, 2025

pytorch-bot bot added the release notes: qualcomm Changes to the Qualcomm backend delegate label Sep 15, 2025

cccclai reviewed Sep 15, 2025

View reviewed changes

cccclai approved these changes Sep 16, 2025

View reviewed changes

cccclai merged commit d61dbb9 into pytorch:main Sep 16, 2025
131 of 137 checks passed

pytorchbot mentioned this pull request Sep 25, 2025

[v1.0.0] Release Tracker #14288

Open

DannyYuyang-quic mentioned this pull request Sep 25, 2025

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14571

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295

Uh oh!

DannyYuyang-quic commented Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025 •

edited

Loading

Uh oh!

DannyYuyang-quic commented Sep 15, 2025

Uh oh!

DannyYuyang-quic commented Sep 15, 2025

Uh oh!

facebook-github-bot commented Sep 15, 2025

Uh oh!

cccclai Sep 15, 2025

Uh oh!

DannyYuyang-quic Sep 16, 2025

Uh oh!

Uh oh!

DannyYuyang-quic commented Sep 25, 2025

Uh oh!

pytorchbot commented Sep 25, 2025

Uh oh!

DannyYuyang-quic commented Sep 25, 2025

Uh oh!

Uh oh!

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295

Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295

Uh oh!

Conversation

DannyYuyang-quic commented Sep 15, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Sep 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14295

❌ 2 New Failures, 1 Cancelled Job

Uh oh!

DannyYuyang-quic commented Sep 15, 2025

Uh oh!

DannyYuyang-quic commented Sep 15, 2025

Uh oh!

facebook-github-bot commented Sep 15, 2025

Uh oh!

cccclai Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

DannyYuyang-quic Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DannyYuyang-quic commented Sep 25, 2025

Uh oh!

pytorchbot commented Sep 25, 2025

Cherry picking #14295

Uh oh!

DannyYuyang-quic commented Sep 25, 2025

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 15, 2025 •

edited

Loading