[AOTInductor] Add config to allow buffer mutation #126584

muchulee8 · 2024-05-17T23:19:28Z

Summary:
Add an additional config to allow buffer mutation.
For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur.
This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB.

Test Plan: Included in commit.

Differential Revision: D57514729

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @ColinPeppler @amjames @desertfire @chauhang

pytorch-bot · 2024-05-17T23:19:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126584

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit ec70e96 with merge base 5fb11cd ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (inductor_torchbench_cpu_smoketest_perf, 1, 1, linux.24xl.spr-metal) (gh) (similar failure)
Process completed with exit code 1.
pull / linux-focal-py3.8-clang10-onnx / test (default, 2, 2, linux.2xlarge) (gh) (similar failure)
Process completed with exit code 1.

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

inductor / cuda12.1-py3.10-gcc9-sm86 / test (aot_inductor_torchbench, 2, 2, linux.g5.4xlarge.nvidia.gpu, unstable) (gh) ()
sam_fast

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-05-17T23:19:37Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-17T23:21:00Z

This pull request was exported from Phabricator. Differential Revision: D57514729

facebook-github-bot · 2024-05-18T01:54:06Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Pull Request resolved: pytorch#126584 Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-19T17:20:32Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Pull Request resolved: pytorch#126584 Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-19T17:27:08Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Pull Request resolved: pytorch#126584 Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-19T18:54:54Z

This pull request was exported from Phabricator. Differential Revision: D57514729

chenyang78 · 2024-05-19T20:51:30Z

torch/_inductor/codecache.py

@@ -1924,12 +1924,13 @@ def _compile_consts_linux(consts: bytes) -> str:
                run_command_and_check(cmd)
            log.debug("aot constant binary command: %s", cmd)

-            # .data section is between .text and .bss. When the size of .data is large,
-            # during the linking, the relocation of .text against .bss may overflow.
-            # Rename it to .ldata so that it won't be in between the .text and .bss section


nit - we may still need this comment?

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-19T21:26:38Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-19T21:28:05Z

This pull request was exported from Phabricator. Differential Revision: D57514729

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729

facebook-github-bot · 2024-05-20T00:25:30Z

This pull request was exported from Phabricator. Differential Revision: D57514729

facebook-github-bot · 2024-05-20T00:25:42Z

This pull request was exported from Phabricator. Differential Revision: D57514729

desertfire · 2024-05-20T02:18:55Z

torch/_inductor/config.py

@@ -744,6 +744,9 @@ class aot_inductor:
    # rather than embedded into the data section. Needed to support 1B+ parameter models
    force_mmap_weights: bool = False

+    # flag to allow buffer mutation. This would remove the read-only property from buffers.
+    allow_buffer_mutation: bool = False


Can we do like "0" if is_fbcode() else "1"? (search for other examples in this file)

I don't think this is the right thing, it's not only about fbcode.
If you take a look at the log and previous comment Intel folks made, they moved from .data to .ldata because it overflows to .bss. The overflow is already happening with cases in OSS world. It's only because the constants they have fits within .ldata, so the solution works for them. But if even larger constants comes in again, we will get more failures.

I agree that diverging the behavior is not a good idea. How about this: instead of adding a config here, Inductor can detect if the model does buffer mutation (https://github.com/pytorch/pytorch/blob/655038687afd19a4a4c9371b77ff046fd6c84be1/torch/_inductor/lowering.py#L5078C25-L5078C41), then decides what section to use and add const_size checking to explicitly fail when size is over limit. We will work on a more comprehensive support in future, but this at least will notice user with a clear error msg.

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729 [ghstack-poisoned]

Summary: Add an additional config to allow buffer mutation. For data that's greater than 2GB, we would need to set it as read-only, otherwise overflow would occur. This is a temporary solution since it won't handle cases that requires mutable data greater than 2GB. Test Plan: Included in commit. Differential Revision: D57514729 ghstack-source-id: d311ef3d306a0d1fb912f5c67bcbfdd0ea5aef4b Pull Request resolved: #126667

facebook-github-bot · 2024-05-20T18:14:13Z

@pytorchbot merge -f 'Landed internally'

(Initiating merge automatically since Phabricator Diff has merged, using force because this PR might not pass merge_rules.json but landed internally)

pytorchmergebot · 2024-05-20T18:15:47Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added ciflow/inductor module: inductor labels May 17, 2024

facebook-github-bot added the fb-exported label May 17, 2024

muchulee8 force-pushed the export-D57514729 branch from e466565 to ff1c943 Compare May 17, 2024 23:20

muchulee8 force-pushed the export-D57514729 branch from ff1c943 to 41c7209 Compare May 18, 2024 01:54

muchulee8 force-pushed the export-D57514729 branch from 41c7209 to 15bf6e2 Compare May 19, 2024 17:20

muchulee8 force-pushed the export-D57514729 branch from 15bf6e2 to 1e9a36b Compare May 19, 2024 17:27

muchulee8 force-pushed the export-D57514729 branch from 1e9a36b to cdf2f8d Compare May 19, 2024 18:54

chenyang78 approved these changes May 19, 2024

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label May 19, 2024

chenyang78 reviewed May 19, 2024

View reviewed changes

muchulee8 force-pushed the export-D57514729 branch from cdf2f8d to a1fffe3 Compare May 19, 2024 21:26

muchulee8 force-pushed the export-D57514729 branch from a1fffe3 to 276ffed Compare May 19, 2024 21:27

muchulee8 force-pushed the export-D57514729 branch from 276ffed to b498a01 Compare May 20, 2024 00:25

muchulee8 force-pushed the export-D57514729 branch from b498a01 to ec70e96 Compare May 20, 2024 00:25

desertfire reviewed May 20, 2024

View reviewed changes

pytorchmergebot added the merging label May 20, 2024

pytorchmergebot added the Merged label May 20, 2024

pytorchmergebot closed this in 11c2d12 May 20, 2024

pytorchmergebot removed the merging label May 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AOTInductor] Add config to allow buffer mutation #126584

[AOTInductor] Add config to allow buffer mutation #126584

muchulee8 commented May 17, 2024 •

edited by pytorch-bot bot

pytorch-bot bot commented May 17, 2024 •

edited

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 18, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

chenyang78 May 19, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 20, 2024

facebook-github-bot commented May 20, 2024

desertfire May 20, 2024

muchulee8 May 20, 2024 •

edited

desertfire May 20, 2024

facebook-github-bot commented May 20, 2024

pytorchmergebot commented May 20, 2024

[AOTInductor] Add config to allow buffer mutation #126584

[AOTInductor] Add config to allow buffer mutation #126584

Conversation

muchulee8 commented May 17, 2024 • edited by pytorch-bot bot

pytorch-bot bot commented May 17, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/126584

✅ You can merge normally! (3 Unrelated Failures)

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 17, 2024

facebook-github-bot commented May 18, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

chenyang78 May 19, 2024

Choose a reason for hiding this comment

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 19, 2024

facebook-github-bot commented May 20, 2024

facebook-github-bot commented May 20, 2024

desertfire May 20, 2024

Choose a reason for hiding this comment

muchulee8 May 20, 2024 • edited

Choose a reason for hiding this comment

desertfire May 20, 2024

Choose a reason for hiding this comment

facebook-github-bot commented May 20, 2024

pytorchmergebot commented May 20, 2024

Merge started

muchulee8 commented May 17, 2024 •

edited by pytorch-bot bot

pytorch-bot bot commented May 17, 2024 •

edited

muchulee8 May 20, 2024 •

edited