[ez][ET-VK][glsl-codegen] Use mediump precision for half-precision shader variants by SS-JIA · Pull Request #19287 · pytorch/executorch

SS-JIA · 2026-05-04T23:51:04Z

Stack from ghstack (oldest at bottom):

-> [ez][ET-VK][glsl-codegen] Use mediump precision for half-precision shader variants #19287

The highp default in gen_vulkan_spv.py blocks Mali GPUs from using FP16 ALU because Mali respects the highp precision contract literally. Adreno silently demotes via Qualcomm's relaxed-precision pass, so it was unaffected, but Mali-G715 was running half-precision shaders at FP32 throughput. The mediump qualifier produces SPIR-V with RelaxedPrecision decorations, which Mali's compiler uses to enable f16 packed math. Note: this is a partial fix — texture-storage shaders still declare local vec4 working values, so the speedup is bounded; the follow-up is to make texel_type("half") return f16vec4 and enable the FP16 extension on the texture path.

Differential Revision: D103759541

…ader variants The highp default in gen_vulkan_spv.py blocks Mali GPUs from using FP16 ALU because Mali respects the highp precision contract literally. Adreno silently demotes via Qualcomm's relaxed-precision pass, so it was unaffected, but Mali-G715 was running half-precision shaders at FP32 throughput. The mediump qualifier produces SPIR-V with RelaxedPrecision decorations, which Mali's compiler uses to enable f16 packed math. Note: this is a partial fix — texture-storage shaders still declare local `vec4` working values, so the speedup is bounded; the follow-up is to make `texel_type("half")` return `f16vec4` and enable the FP16 extension on the texture path. Differential Revision: [D103759541](https://our.internmc.facebook.com/intern/diff/D103759541/) [ghstack-poisoned]

pytorch-bot · 2026-05-04T23:51:07Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19287

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Ubuntu services are down

❌ 1 New Failure, 1 Cancelled Job, 2 Unrelated Failures

As of commit bd5dd01 with merge base a6ee309 ():

NEW FAILURE - The following job has failed:

Test ARM Backend / test-arm / test-backend-linux (arm_tosa_fp, models) / linux-job (gh)
RuntimeError: Command docker exec -t 2651b0e0740480142dfa9aec35f527d49dea8c5e17cc44c12e625176209fa512 /exec failed with exit code 92

CANCELLED JOB - The following job was cancelled. Please retry:

periodic (gh)

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-05-04T23:51:56Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 4, 2026

meta-codesync Bot added fb-exported meta-exported labels May 4, 2026

jgibson2 mentioned this pull request May 5, 2026

Vulkan: make half-variant GLSL PRECISION configurable #19292

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ez][ET-VK][glsl-codegen] Use mediump precision for half-precision shader variants#19287

[ez][ET-VK][glsl-codegen] Use mediump precision for half-precision shader variants#19287
SS-JIA wants to merge 1 commit intogh/SS-JIA/525/basefrom
gh/SS-JIA/525/head

SS-JIA commented May 4, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented May 4, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SS-JIA commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19287

❗ 1 Active SEVs

❌ 1 New Failure, 1 Cancelled Job, 2 Unrelated Failures

Uh oh!

github-actions Bot commented May 4, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SS-JIA commented May 4, 2026 •

edited

Loading

pytorch-bot Bot commented May 4, 2026 •

edited

Loading

This PR needs a `release notes:` label