[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors #123302

anijain2305 · 2024-04-03T22:40:12Z

Stack from ghstack (oldest at bottom):

We should sparingly use ID_MATCH guards. When it comes to performance, ID_MATCH is much faster DATA_PTR for Python guards. However, the difference is very small in C++. So, its worth just using DATA_PTR_MATCH.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @chenyang78 @kadeng @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-04-03T22:40:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/123302

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 5601f90 with merge base 4732375 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

inductor / rocm6.0-py3.8-inductor / test (inductor, 1, 1, linux.rocm.gpu.2) (gh)
test/distributed/_composable/fsdp/test_fully_shard_training.py::TestFullyShard1DTrainingCore::test_train_parity_multi_group_compile

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 7dabbfd Pull Request resolved: #123302

Not needed since #122858 has landed Pull Request resolved: #123303 Approved by: https://github.com/mlazos ghstack dependencies: #123285, #123302

…cessor (#123396) Speeds up the guard-overhead microbenchmark by around 10% normalized to main-branch CPP guards ~~~ import torch @torch.compile(backend="eager") def fn(x, lst): for l in lst: x = x + l return x n = 1000 lst = [i for i in range(n)] x = torch.randn(4) print(fn(x, lst)) print("Sucess") ~~~ Pull Request resolved: #123396 Approved by: https://github.com/jansel ghstack dependencies: #123285, #123302, #123303

[ghstack-poisoned]

ghstack-source-id: 61b1d7f Pull Request resolved: #123485

anijain2305 · 2024-04-05T23:00:02Z

Note for oncall @shunting314 - If you end up bisecting this to be a source of a regression, this is fwd fixed here - #123485

For some reason, adding a `TYPE_CHECK` in DATA_PTR_MATCH guard in #123302 increases optimizer guard overhead for `MT5ForConditionalGeneration` by 10x. There is nothing special about MT5. As we are going to move towards the CPP guards soon, there is no reason to investigate this deeper. We can use `ID_MATCH` instead of `DATA_PTR` match. Today both cant be serialized, so there is no one preference over the other. Pull Request resolved: #123485 Approved by: https://github.com/mlazos

…h#123302) We should sparingly use ID_MATCH guards. When it comes to performance, ID_MATCH is much faster DATA_PTR for Python guards. However, the difference is very small in C++. So, its worth just using DATA_PTR_MATCH. Pull Request resolved: pytorch#123302 Approved by: https://github.com/mlazos ghstack dependencies: pytorch#123285

Not needed since pytorch#122858 has landed Pull Request resolved: pytorch#123303 Approved by: https://github.com/mlazos ghstack dependencies: pytorch#123285, pytorch#123302

…cessor (pytorch#123396) Speeds up the guard-overhead microbenchmark by around 10% normalized to main-branch CPP guards ~~~ import torch @torch.compile(backend="eager") def fn(x, lst): for l in lst: x = x + l return x n = 1000 lst = [i for i in range(n)] x = torch.randn(4) print(fn(x, lst)) print("Sucess") ~~~ Pull Request resolved: pytorch#123396 Approved by: https://github.com/jansel ghstack dependencies: pytorch#123285, pytorch#123302, pytorch#123303

For some reason, adding a `TYPE_CHECK` in DATA_PTR_MATCH guard in pytorch#123302 increases optimizer guard overhead for `MT5ForConditionalGeneration` by 10x. There is nothing special about MT5. As we are going to move towards the CPP guards soon, there is no reason to investigate this deeper. We can use `ID_MATCH` instead of `DATA_PTR` match. Today both cant be serialized, so there is no one preference over the other. Pull Request resolved: pytorch#123485 Approved by: https://github.com/mlazos

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors

5601f90

[ghstack-poisoned]

anijain2305 mentioned this pull request Apr 3, 2024

[dynamo][optimizer][guard-overhead] NOT_NONE guard for param.grad instead of TENSOR_MATCH #123285

Closed

pytorch-bot bot added ciflow/inductor module: dynamo labels Apr 3, 2024

anijain2305 added a commit that referenced this pull request Apr 3, 2024

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors

d410f1b

ghstack-source-id: 7dabbfd Pull Request resolved: #123302

anijain2305 mentioned this pull request Apr 3, 2024

[dynamo][guards] Remove workaround after #122858 #123303

Closed

anijain2305 requested a review from mlazos April 3, 2024 23:19

mlazos approved these changes Apr 3, 2024

View reviewed changes

pytorchmergebot closed this in 5b45ec8 Apr 4, 2024

pytorchmergebot pushed a commit that referenced this pull request Apr 4, 2024

[dynamo][guards] Remove workaround after #122858 (#123303)

6694628

Not needed since #122858 has landed Pull Request resolved: #123303 Approved by: https://github.com/mlazos ghstack dependencies: #123285, #123302

pytorchmergebot added the Merged label Apr 4, 2024

anijain2305 mentioned this pull request Apr 4, 2024

[dynamo][cpp-guards] ListGetItemGuardAccessor and TupleGetItemGuardAccessor #123396

Closed

anijain2305 added a commit that referenced this pull request Apr 5, 2024

[dynamo][guards] Forward fix for #123302

27ee5bd

[ghstack-poisoned]

anijain2305 added a commit that referenced this pull request Apr 5, 2024

[dynamo][guards] Forward fix for #123302

3640141

ghstack-source-id: 61b1d7f Pull Request resolved: #123485

anijain2305 mentioned this pull request Apr 5, 2024

[dynamo][guards] Forward fix for #123302 #123485

Closed

github-actions bot deleted the gh/anijain2305/260/head branch May 6, 2024 01:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors #123302

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors #123302

Uh oh!

anijain2305 commented Apr 3, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 3, 2024 •

edited

Loading

Uh oh!

anijain2305 commented Apr 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors #123302

[dynamo][guards] Use DATA_PTR instead of ID_MATCH for tensors #123302

Uh oh!

Conversation

anijain2305 commented Apr 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/123302

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

anijain2305 commented Apr 5, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anijain2305 commented Apr 3, 2024 •

edited

Loading

pytorch-bot bot commented Apr 3, 2024 •

edited

Loading