[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110

trivedivivek · 2025-10-14T15:13:27Z

Summary:

Summary

This diff improves the overall performance of the linear shader by removing manual unrolling in the linear_qcsnw_tiled.glsl file.

The changes include:

Removing the [[unroll]] directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance.
Changing the type of mat1 from VEC4_T[TILE_ROWS] to T[TILE_ROWS][4] to better match the access pattern in the loop.

Differential Revision: D84571616

pytorch-bot · 2025-10-14T15:13:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15110

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit b19c6be with merge base 4c4f235 ():

NEW FAILURES - The following jobs have failed:

pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 467757e0e42b6e9ecf8251a91ea6ec5f55098aee39d9278a608725d0a7a785c2 /exec failed with exit code 1
pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh)
RuntimeError: Command docker exec -t ba037b75e03f99f799f02807d822902d6518fc987621f6645caf7647058d0b3e /exec failed with exit code 1
Test CUDA Builds / export-voxtral-cuda-artifact / linux-job (gh)
RuntimeError: Command docker exec -t fd9613a88a8c381283c084a705a66528f2eadd7b30a540c4fd2338bb6af66118 /exec failed with exit code 2

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-10-14T15:13:37Z

@trivedivivek has exported this pull request. If you are a Meta employee, you can view the originating Diff in D84571616.

…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Differential Revision: D84571616

…e. (pytorch#15110) Summary: ### Summary This diff improves the overall performance of the linear shader by removing manual unrolling in the `linear_qcsnw_tiled.glsl` file. The changes include: - Removing the `[[unroll]]` directive in the for loop to allow the compiler to automatically unroll the loop, which can lead to better performance. - Changing the type of `mat1` from `VEC4_T[TILE_ROWS]` to `T[TILE_ROWS][4]` to better match the access pattern in the loop. Reviewed By: SS-JIA Differential Revision: D84571616

trivedivivek requested a review from SS-JIA as a code owner October 14, 2025 15:13

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 14, 2025

meta-codesync bot added fb-exported meta-exported labels Oct 14, 2025

mergennachin changed the title ~~Removing manual unroll in linear shader to improve overall performance.~~ [ET-VK] Removing manual unroll in linear shader to improve overall performance. Oct 15, 2025

trivedivivek force-pushed the export-D84571616 branch from 9c3d279 to c1369bc Compare October 15, 2025 14:32

SS-JIA approved these changes Oct 15, 2025

View reviewed changes

trivedivivek force-pushed the export-D84571616 branch from c1369bc to 280b330 Compare October 15, 2025 15:40

trivedivivek added the release notes: vulkan Changes to the Vulkan backend delegate label Oct 15, 2025

trivedivivek force-pushed the export-D84571616 branch from 280b330 to b19c6be Compare October 15, 2025 17:34

meta-codesync bot merged commit f8e199c into pytorch:main Oct 16, 2025
137 of 141 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110

[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110

Uh oh!

trivedivivek commented Oct 14, 2025

Uh oh!

pytorch-bot bot commented Oct 14, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110

[ET-VK] Removing manual unroll in linear shader to improve overall performance. #15110

Uh oh!

Conversation

trivedivivek commented Oct 14, 2025

Summary

Uh oh!

pytorch-bot bot commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15110

❌ 3 New Failures

Uh oh!

meta-codesync bot commented Oct 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Oct 14, 2025 •

edited

Loading