-
Notifications
You must be signed in to change notification settings - Fork 683
Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer #14295
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
… and fixed a bug in the graph drawer
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/14295
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 1 Cancelled JobAs of commit 11e794c with merge base eec95d0 ( NEW FAILURES - The following jobs have failed:
CANCELLED JOB - The following job was cancelled. Please retry:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
@pytorchbot label "release notes: qualcomm" |
ptq = QuantDtype.use_16a4w_block | ||
group_size = 32 | ||
masked_softmax = True | ||
seq_mse_candidates = 0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What does 0 seq_mse_candidates
mean
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for asking the question about seq_mse_candidates
setting.
Setting seq_mse_candidates
= 0 means that SeqMSE
is disabled during quantization.
For more details, can refer:
https://github.com/pytorch/executorch/blob/main/examples/qualcomm/oss_scripts/llama/decoder_utils.py#L367-L375
… and fixed a bug in the graph drawer (pytorch#14295) Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer ### Summary - add seq_mse_candidates setting to SmolLM3 - fixed a bug in the graph drawer ### Test plan DrawGraph Unit test ``` bash python -m backends.qualcomm.tests.test_qnn_delegate TestQNNFloatingPointUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . ``` SmolLM3 script in `./examples/qualcomm/oss_scripts/llama/README.md` at SmolLM3 part
@pytorchbot cherry-pick --onto release/1.0 -c critical |
… and fixed a bug in the graph drawer (#14295) Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer ### Summary - add seq_mse_candidates setting to SmolLM3 - fixed a bug in the graph drawer ### Test plan DrawGraph Unit test ``` bash python -m backends.qualcomm.tests.test_qnn_delegate TestQNNFloatingPointUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . python -m backends.qualcomm.tests.test_qnn_delegate TestQNNQuantizedUtils.test_qnn_backend_draw_graph -s ${SERIAL_NUM} -m ${SOC_MODEL} -b build-android -a . --executorch_root . ``` SmolLM3 script in `./examples/qualcomm/oss_scripts/llama/README.md` at SmolLM3 part (cherry picked from commit d61dbb9)
Cherry picking #14295The cherry pick PR is at #14571 and it is recommended to link a critical cherry pick PR with an issue. The following tracker issues are updated: Details for Dev Infra teamRaised by workflow job |
This PR fixes the bug in the config for smolLM3 and should be included in release/1.0 |
Qualcomm AI Engine Direct - add seq_mse_candidates setting to SmolLM3 and fixed a bug in the graph drawer
Summary
Test plan
DrawGraph Unit test
SmolLM3
script in
./examples/qualcomm/oss_scripts/llama/README.md
at SmolLM3 part