[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

BowenBao · 2023-04-20T21:16:44Z

Stack from ghstack (oldest at bottom):

-> [ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

Summary

Previously this was required by and entangled with tracing_mode=symbolic for dynamic tracing.
That is resolved by Delete tracing_mode argument to export #99555 and its follow ups.
Later decomposition pass will do graph lowering, so this step is duplicated.
Updated Functionalization to workaround RuntimeError: Cannot call sizes() on tensor with symbolic sizes/strides w/ dynamo.export, make_fx and functionalize #99774 (comment)

Todo

Training vs eval in dynamo_export
So we are effectively exporting all models in traning mode by
default. But for the sake of this export we are only interested in eval mode.
The question is, should we call model.eval() in dynamo_export?
Tests with model containing batch norm fails 'functionalization' in training mode.
We are explicitly calling model.eval() for these model for now.
Merge decomp and functionalize pass. Both calls into make_fx.
Merging potentially increases performance. However it is unclear
if it will result in different behavior.

Fixes #99662. (For the functionalization issue. Still need missing op support.)

[ghstack-poisoned]

pytorch-bot · 2023-04-20T21:16:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99667

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Degradation on most runner types due to networking outage

✅ No Failures

As of commit 3fc55a6:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b15d9612edec3ebd2917ad381e5c52de79416d1b Pull Request resolved: #99667

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. [ghstack-poisoned]

ghstack-source-id: 6d04ee3f2ee259a6c51cd527c290a357096846f2 Pull Request resolved: #99667

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: aa29f45725b6c3f4a951223fa6fd56e06f36b982 Pull Request resolved: #99667

torch/onnx/_internal/fx/dynamo_exporter.py

torch/onnx/_internal/fx/fx_exporter.py

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: e25861a8ec73d8f4010237dc95205fddf0faebee Pull Request resolved: #99667

…] Drop 'aten_graph' arg for 'DynamoExporter'" Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 077fd3a1d13582ceb7a135f7b07ece93ff0b159d Pull Request resolved: #99667

…h' arg for 'DynamoExporter'" Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: b2ef6bc7f45661f8b0098ea405496a3dc9584e8c Pull Request resolved: #99667

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 78e3950a12d32e764987857a43faf3935004ae7f Pull Request resolved: #99667

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 594171b60ed8a3f48090b90a24d24b9ccba8a1a3 Pull Request resolved: #99667

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 278c37b40d0995e8b9ebf3d3cbe26dd015552c2c Pull Request resolved: #99667

titaiwangms · 2023-05-03T17:29:09Z

Seems getting complicated. You might consider move Functionalization ahead of dropping aten_graph in the title.

BowenBao · 2023-05-03T23:59:25Z

@pytorchbot merge

pytorchmergebot · 2023-05-04T00:01:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jon-chuang · 2023-10-12T20:58:05Z

torch/onnx/_internal/fx/passes/functionalization.py

+            for inpt, input_functional in zip(flat_inputs, flat_inputs_functional):
+                if isinstance(input_functional, torch.Tensor):
+                    torch._sync(input_functional)
+                    inpt_new = torch._from_functional_tensor(input_functional)


Why is this inpt_new assigned?

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

9962e88

[ghstack-poisoned]

pytorch-bot bot added release notes: onnx torch.onnx related changes that should show up in the release notes labels Apr 20, 2023

BowenBao added a commit that referenced this pull request Apr 20, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

e032366

ghstack-source-id: b15d9612edec3ebd2917ad381e5c52de79416d1b Pull Request resolved: #99667

pytorchbot added the open source label Apr 20, 2023

BowenBao mentioned this pull request Apr 21, 2023

[ONNX] Cover 'undiscoverable' ops 'torch.ops.aten' #99682

Closed

BowenBao added a commit that referenced this pull request Apr 21, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

eb50da5

ghstack-source-id: 6d04ee3f2ee259a6c51cd527c290a357096846f2 Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request Apr 21, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

64e2aa0

ghstack-source-id: aa29f45725b6c3f4a951223fa6fd56e06f36b982 Pull Request resolved: #99667

BowenBao marked this pull request as ready for review April 21, 2023 01:52

BowenBao requested a review from abock as a code owner April 21, 2023 01:52

titaiwangms approved these changes Apr 21, 2023

View reviewed changes

BowenBao added a commit that referenced this pull request Apr 21, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

0a3fb3a

ghstack-source-id: e25861a8ec73d8f4010237dc95205fddf0faebee Pull Request resolved: #99667

BowenBao mentioned this pull request Apr 21, 2023

Introduce FXGraphExtractor into torch.onnx.dynamo_export #98893

Closed

BowenBao added module: onnx Related to torch.onnx topic: new features topic category ciflow/trunk Trigger trunk jobs on your pull request labels Apr 21, 2023

BowenBao added a commit that referenced this pull request Apr 28, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

19f2d1e

ghstack-source-id: 077fd3a1d13582ceb7a135f7b07ece93ff0b159d Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request Apr 28, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

48f150d

ghstack-source-id: b2ef6bc7f45661f8b0098ea405496a3dc9584e8c Pull Request resolved: #99667

BowenBao changed the title ~~[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'~~ [ONNX] Drop 'aten_graph' arg for 'DynamoExporter'; Apply workaround in 'Functionalize' pass Apr 29, 2023

BowenBao mentioned this pull request Apr 29, 2023

[ONNX] Skip flaky dynamic test in CI #100297

Closed

BowenBao added a commit that referenced this pull request Apr 29, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

c684f4c

ghstack-source-id: 78e3950a12d32e764987857a43faf3935004ae7f Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request May 1, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

6d6b8ed

ghstack-source-id: 594171b60ed8a3f48090b90a24d24b9ccba8a1a3 Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request May 3, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

66ba01c

ghstack-source-id: 278c37b40d0995e8b9ebf3d3cbe26dd015552c2c Pull Request resolved: #99667

BowenBao changed the title ~~[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'; Apply workaround in 'Functionalize' pass~~ [ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' May 3, 2023

pytorchmergebot added the merging label May 4, 2023

pytorchmergebot added Merged and removed merging labels May 4, 2023

pytorchmergebot closed this in f827563 May 4, 2023

facebook-github-bot deleted the gh/BowenBao/234/head branch June 8, 2023 14:28

jon-chuang reviewed Oct 12, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

BowenBao commented Apr 20, 2023 •

edited

pytorch-bot bot commented Apr 20, 2023 •

edited

titaiwangms commented May 3, 2023

BowenBao commented May 3, 2023

pytorchmergebot commented May 4, 2023

jon-chuang Oct 12, 2023

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

Conversation

BowenBao commented Apr 20, 2023 • edited

pytorch-bot bot commented Apr 20, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99667

❗ 1 Active SEVs

✅ No Failures

titaiwangms commented May 3, 2023

BowenBao commented May 3, 2023

pytorchmergebot commented May 4, 2023

Merge started

jon-chuang Oct 12, 2023

Choose a reason for hiding this comment

BowenBao commented Apr 20, 2023 •

edited

pytorch-bot bot commented Apr 20, 2023 •

edited