Fix access to unitialized memory in VSX vector functions #89833

Flamefire · 2022-11-29T09:44:13Z

This results in e.g. failures in TestNNDeviceTypeCPU.test_groupnorm_nhwc_cpu_float32

So simply initialize the stack array with zeroes as expected and done in other implementations

Fixes #32502

cc @VitalyFedyunin @jgong5 @mingfeima @XiaobingSuper @sanchitintel @ashokei @jingxu10

pytorch-bot · 2022-11-29T09:44:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89833

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 215ba85:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ezyang

a test would be nice

ezyang · 2022-12-01T18:31:06Z

@pytorchbot merge

pytorchmergebot · 2022-12-01T18:32:44Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2022-12-01T19:43:27Z

Merge failed

Reason: 2 additional jobs have failed, first few of them are: trunk ,trunk / cuda11.6-py3.10-gcc7-sm86 / test (default, 4, 4, linux.g5.4xlarge.nvidia.gpu)

Details for Dev Infra team

Raised by workflow job

Flamefire · 2022-12-02T08:43:39Z

a test would be nice

I'd normally agree but you cannot test for undefined behavior although UBSAN or valgrind may catch this if run on PPC.

It may be possible to add tests for loadu for all datatypes with different load sizes and assert the "not loaded" parts are zero but again they may succeed if the "stack garbage" happens to contain the correct values. I'm also not sure where to add such a test, so I'll leave this as an idea for you.

Also not sure why the test failed:

RuntimeError: [enforce fail at alloc_cpu.cpp:83] err == 0. DefaultCPUAllocator: can't allocate memory: you tried to allocate 9999800000 bytes. Error code 12 (Cannot allocate memory)

That's surely unrelated to this PR.

ezyang · 2022-12-02T15:09:57Z

@pytorchbot rebase

ezyang · 2022-12-02T15:11:03Z

I think when we fixed it in non VSX, we relied on UBSAN to tell us about it. But I guess we don't have UBSAN setup on this platform, so meh

pytorchmergebot · 2022-12-02T15:12:11Z

@pytorchbot successfully started a rebase job. Check the current status here

This results in e.g. failures in TestNNDeviceTypeCPU.test_groupnorm_nhwc_cpu_float32 Fixes pytorch#32502

pytorchmergebot · 2022-12-02T15:12:17Z

Successfully rebased vsx-loadu onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout vsx-loadu && git pull --rebase)

ezyang · 2022-12-02T15:26:07Z

@pytorchbot merge

pytorchmergebot · 2022-12-02T15:28:06Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

This results in e.g. failures in TestNNDeviceTypeCPU.test_groupnorm_nhwc_cpu_float32 So simply initialize the stack array with zeroes as expected and done in other implementations Fixes pytorch#32502 Pull Request resolved: pytorch#89833 Approved by: https://github.com/ezyang

…d values Similar to pytorch#89833 those function may access uninitialized memory leading to undefined behavior/results. Initialize with zeros as done before.

…d values (#122399) Similar to #89833 those function may access uninitialized memory leading to undefined behavior/results. Initialize with zeros as done before. Pull Request resolved: #122399 Approved by: https://github.com/ezyang

github-actions bot added the module: cpu CPU specific problem (e.g., perf, algorithm) label Nov 29, 2022

pytorchbot added the open source label Nov 29, 2022

drisspg requested review from ezyang, colesbury and albanD December 1, 2022 17:44

drisspg added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Dec 1, 2022

ezyang approved these changes Dec 1, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Dec 1, 2022

ezyang added the topic: bug fixes topic category label Dec 1, 2022

Fix access to unitialized memory in VSX vector functions

215ba85

This results in e.g. failures in TestNNDeviceTypeCPU.test_groupnorm_nhwc_cpu_float32 Fixes pytorch#32502

pytorchmergebot force-pushed the vsx-loadu branch from 6f314bc to 215ba85 Compare December 2, 2022 15:12

pytorchmergebot added the Merged label Dec 2, 2022

pytorchmergebot closed this in 538f627 Dec 2, 2022

Flamefire deleted the vsx-loadu branch December 2, 2022 22:06

Flamefire mentioned this pull request Sep 18, 2023

Fix access to unitialized memory in VSX vector functions for quantized values #109487

Closed

Flamefire mentioned this pull request Mar 21, 2024

Fix access to unitialized memory in VSX vector functions for quantized values #122399

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix access to unitialized memory in VSX vector functions #89833

Fix access to unitialized memory in VSX vector functions #89833

Flamefire commented Nov 29, 2022 •

edited by pytorch-bot bot

pytorch-bot bot commented Nov 29, 2022 •

edited

ezyang left a comment

ezyang commented Dec 1, 2022

pytorchmergebot commented Dec 1, 2022

pytorchmergebot commented Dec 1, 2022

Flamefire commented Dec 2, 2022

ezyang commented Dec 2, 2022

ezyang commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

ezyang commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

Fix access to unitialized memory in VSX vector functions #89833

Fix access to unitialized memory in VSX vector functions #89833

Conversation

Flamefire commented Nov 29, 2022 • edited by pytorch-bot bot

pytorch-bot bot commented Nov 29, 2022 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/89833

✅ No Failures

ezyang left a comment

Choose a reason for hiding this comment

ezyang commented Dec 1, 2022

pytorchmergebot commented Dec 1, 2022

Merge started

pytorchmergebot commented Dec 1, 2022

Merge failed

Flamefire commented Dec 2, 2022

ezyang commented Dec 2, 2022

ezyang commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

ezyang commented Dec 2, 2022

pytorchmergebot commented Dec 2, 2022

Merge started

Flamefire commented Nov 29, 2022 •

edited by pytorch-bot bot

pytorch-bot bot commented Nov 29, 2022 •

edited