Fix alignment calculation in XNNWeightsCache #15090

pytorchbot · 2025-10-14T02:18:07Z

Summary

We're seeing crashes on Android when running XNNPACK-delegated models. I tracked it down to a bug in the alignment calculation for weight cache memory. To make the calculation, it casts the void* to a (signed) intptr_t. When the address is in the upper half of the address space, it becomes negative. This causes the modulo to return a negative value and increment the address too much - leading to out of bounds access.

executorch/backends/xnnpack/runtime/XNNWeightsCache.cpp

Lines 166 to 168 in cc6cb83

    
           void* maybe_aligned_space = data_container.data(); 
        
           void* aligned_space = (void*)((intptr_t)maybe_aligned_space + 64 - 
        
                                         (intptr_t)maybe_aligned_space % 64);

Walking through the numbers I captured in #14831:

The raw (unaligned) address of the data buffer is 0xb40000763d4bfa90.
The target alignment is 64 bytes.
Casting the address to intptr_t gives -5476376639047992688.
- Mod 64 is -48.
- The total offset applied is 64 - (-48) = 112.
Since the allocation size is N + 64, increasing the start by 112 means the new region extends 48 bytes past the end of the allocation.

To resolve this, I replaced the alignment code with a call to std::align. Casing to uintptr_t also resolves it, but using the standard implementation seems less error prone.

Test plan

I've validated that the repro in #14831 does not crash with this change.

### Summary We're seeing crashes on Android when running XNNPACK-delegated models. I tracked it down to a bug in the alignment calculation for weight cache memory. To make the calculation, it casts the void* to a (signed) intptr_t. When the address is in the upper half of the address space, it becomes negative. This causes the modulo to return a negative value and increment the address too much - leading to out of bounds access. https://github.com/pytorch/executorch/blob/cc6cb837d6ac92f52a2d30a405900caf115f0556/backends/xnnpack/runtime/XNNWeightsCache.cpp#L166-L168 Walking through the numbers I captured in #14831: * The raw (unaligned) address of the data buffer is 0xb40000763d4bfa90. * The target alignment is 64 bytes. * Casting the address to intptr_t gives -5476376639047992688. * Mod 64 is -48. * The total offset applied is 64 - (-48) = 112. * Since the allocation size is N + 64, increasing the start by 112 means the new region extends 48 bytes past the end of the allocation. To resolve this, I replaced the alignment code with a call to std::align. Casing to uintptr_t also resolves it, but using the standard implementation seems less error prone. ### Test plan I've validated that the repro in #14831 does not crash with this change. (cherry picked from commit 7421646)

pytorch-bot · 2025-10-14T02:18:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15090

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

[ROCm][CI] Machines under the label linux.rocm.gpu.2 are undergoing maintenance.

⏳ No Failures, 4 Pending

As of commit 7b1103a with merge base e0dda90 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorchbot requested a review from digantdesai as a code owner October 14, 2025 02:18

pytorchbot mentioned this pull request Oct 14, 2025

[v1.0.0] Release Tracker #14288

Open

pytorchbot mentioned this pull request Oct 14, 2025

Fix alignment calculation in XNNWeightsCache #15039

Merged

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 14, 2025

mergennachin self-requested a review October 14, 2025 02:27

mergennachin approved these changes Oct 14, 2025

View reviewed changes

GregoryComer merged commit 2897bde into release/1.0 Oct 14, 2025
121 of 124 checks passed

GregoryComer deleted the cherry-pick-15039-by-pytorch_bot_bot_ branch October 14, 2025 03:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix alignment calculation in XNNWeightsCache #15090

Fix alignment calculation in XNNWeightsCache #15090

Uh oh!

pytorchbot commented Oct 14, 2025

Uh oh!

pytorch-bot bot commented Oct 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	void* maybe_aligned_space = data_container.data();
	void* aligned_space = (void*)((intptr_t)maybe_aligned_space + 64 -
	(intptr_t)maybe_aligned_space % 64);

Fix alignment calculation in XNNWeightsCache #15090

Fix alignment calculation in XNNWeightsCache #15090

Uh oh!

Conversation

pytorchbot commented Oct 14, 2025

Summary

Test plan

Uh oh!

pytorch-bot bot commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15090

❗ 1 Active SEVs

⏳ No Failures, 4 Pending

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Oct 14, 2025 •

edited

Loading