Add support for name kwarg in mark_dynamic #163246

bobrenjc93 · 2025-09-18T06:11:32Z

Stack from ghstack (oldest at bottom):

Ergonomic improvement to allow sharing symbols without having to do the complex torch._check paradigm as described by @anijain2305 in his recent UED:

Different symbols for KV cached tensors - One property in my case was that the KV cache for different attention blocks had the same seq length, but there is no API to enforce that. The only way is to add torch._check but torch.compile must trace those functions to instruct the dynamic shape infra. This required me to change the model code.
Changing model code is not the best experience. Lets see how transformers maintainers react to my PR. Maybe an API with fullgraph=True is a better bet here.

cc @ezyang @EikanWang @jgong5 @wenzhe-nrv @voznesenskym @penguinwu @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @jiayisunx @chenyang78 @kadeng @chauhang @amjames @Lucaskabela

[ghstack-poisoned]

pytorch-bot · 2025-09-18T06:11:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163246

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit f7c3ddf with merge base 39450e7 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b0b8fa6 Pull Request resolved: #163246

Ergonomic improvement to allow sharing symbols without having to do the complex torch._check paradigm as described by anijain2305 in his recent UED: ``` Different symbols for KV cached tensors - One property in my case was that the KV cache for different attention blocks had the same seq length, but there is no API to enforce that. The only way is to add torch._check but torch.compile must trace those functions to instruct the dynamic shape infra. This required me to change the model code. Changing model code is not the best experience. Lets see how transformers maintainers react to my PR. Maybe an API with fullgraph=True is a better bet here. ``` cc ezyang EikanWang jgong5 wenzhe-nrv voznesenskym penguinwu Guobing-Chen XiaobingSuper zhuhaozhe blzheng jiayisunx chenyang78 kadeng chauhang amjames Lucaskabela [ghstack-poisoned]

ghstack-source-id: ee461a2 Pull Request resolved: #163246

Ergonomic improvement to allow sharing symbols without having to do the complex torch._check paradigm as described by anijain2305 in his recent UED: ``` Different symbols for KV cached tensors - One property in my case was that the KV cache for different attention blocks had the same seq length, but there is no API to enforce that. The only way is to add torch._check but torch.compile must trace those functions to instruct the dynamic shape infra. This required me to change the model code. Changing model code is not the best experience. Lets see how transformers maintainers react to my PR. Maybe an API with fullgraph=True is a better bet here. ``` cc ezyang EikanWang jgong5 wenzhe-nrv voznesenskym penguinwu Guobing-Chen XiaobingSuper zhuhaozhe blzheng jiayisunx chenyang78 kadeng chauhang amjames Lucaskabela [ghstack-poisoned]

ghstack-source-id: f67376d Pull Request resolved: #163246

Ergonomic improvement to allow sharing symbols without having to do the complex torch._check paradigm as described by anijain2305 in his recent UED: ``` Different symbols for KV cached tensors - One property in my case was that the KV cache for different attention blocks had the same seq length, but there is no API to enforce that. The only way is to add torch._check but torch.compile must trace those functions to instruct the dynamic shape infra. This required me to change the model code. Changing model code is not the best experience. Lets see how transformers maintainers react to my PR. Maybe an API with fullgraph=True is a better bet here. ``` cc ezyang EikanWang jgong5 wenzhe-nrv voznesenskym penguinwu Guobing-Chen XiaobingSuper zhuhaozhe blzheng jiayisunx chenyang78 kadeng chauhang amjames Lucaskabela [ghstack-poisoned]

ghstack-source-id: 11d6ca8 Pull Request resolved: #163246

Ergonomic improvement to allow sharing symbols without having to do the complex torch._check paradigm as described by anijain2305 in his recent UED: ``` Different symbols for KV cached tensors - One property in my case was that the KV cache for different attention blocks had the same seq length, but there is no API to enforce that. The only way is to add torch._check but torch.compile must trace those functions to instruct the dynamic shape infra. This required me to change the model code. Changing model code is not the best experience. Lets see how transformers maintainers react to my PR. Maybe an API with fullgraph=True is a better bet here. ``` cc ezyang EikanWang jgong5 wenzhe-nrv voznesenskym penguinwu Guobing-Chen XiaobingSuper zhuhaozhe blzheng jiayisunx chenyang78 kadeng chauhang amjames Lucaskabela [ghstack-poisoned]

ezyang · 2025-09-21T02:41:52Z

I think doing it with strings is unwise, as in a large enough codebase it can be difficult to avoid collisions, which will cause extremely strange errors. The only use case for strings is if there is a single global configuration spot for dynamic that applies everywhere, but mark_dynamic works both for fullgraph and graph break cases, and also it can propagate unpredictably as its data flow. Easy fix: explicitly allocate Dim symbols (similar to how export does it) and then use those to dedupe by object identity.

Speaking of which, why don't we use export's Dim directly? cc @avikchaudhuri

bobrenjc93 · 2025-09-21T03:19:26Z

I think doing it with strings is unwise, as in a large enough codebase it can be difficult to avoid collisions, which will cause extremely strange errors.

Fair point. I assumed this would be such a rare power-user case that it wouldn’t matter in practice, but you’re right that using object identity is cleaner. I did think about the Dims API, though it seemed like a heavy lift for existing models with mark_dynamic infrastructure (for example, ads PT2 wrappers). Since this is a power-user feature anyway, that cost may not be a big deal. I’ll close this PR for now and see if I can get the Dims API working for compile in a more ergonomic way. Worst case, there may be a middle-ground approach where we create dim-like objects and thread them through the mark_dynamic calls.

…opener.py (#163469) Pull Request resolved: #163469 Approved by: https://github.com/aorenste, https://github.com/Skylion007 ghstack dependencies: #163246

…_dq_pass.py (#163470) Pull Request resolved: #163470 Approved by: https://github.com/aorenste ghstack dependencies: #163246, #163469

…orter/_globals.py (#163472) Pull Request resolved: #163472 Approved by: https://github.com/Skylion007 ghstack dependencies: #163246, #163469, #163470

…opener.py (pytorch#163469) Pull Request resolved: pytorch#163469 Approved by: https://github.com/aorenste, https://github.com/Skylion007 ghstack dependencies: pytorch#163246

…_dq_pass.py (pytorch#163470) Pull Request resolved: pytorch#163470 Approved by: https://github.com/aorenste ghstack dependencies: pytorch#163246, pytorch#163469

…orter/_globals.py (pytorch#163472) Pull Request resolved: pytorch#163472 Approved by: https://github.com/Skylion007 ghstack dependencies: pytorch#163246, pytorch#163469, pytorch#163470

Add support for name kwarg in mark_dynamic

ef3b5e1

[ghstack-poisoned]

This was referenced Sep 18, 2025

Turn on capture_scalar_outputs when fullgraph=True #163121

Closed

Turn on capture_dynamic_output_shape_ops when fullgraph=True #163123

Closed

pytorch-bot bot added ciflow/inductor module: dynamo release notes: fx release notes category labels Sep 18, 2025

bobrenjc93 added a commit that referenced this pull request Sep 18, 2025

Add support for name kwarg in mark_dynamic

ed471f2

ghstack-source-id: b0b8fa6 Pull Request resolved: #163246

facebook-github-bot added the fx label Sep 18, 2025

bobrenjc93 added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Sep 18, 2025

bobrenjc93 added a commit that referenced this pull request Sep 18, 2025

Add support for name kwarg in mark_dynamic

8bac1d4

ghstack-source-id: ee461a2 Pull Request resolved: #163246

bobrenjc93 added a commit that referenced this pull request Sep 18, 2025

Add support for name kwarg in mark_dynamic

f8516dc

ghstack-source-id: f67376d Pull Request resolved: #163246

bobrenjc93 added a commit that referenced this pull request Sep 18, 2025

Add support for name kwarg in mark_dynamic

fc8f307

ghstack-source-id: 11d6ca8 Pull Request resolved: #163246

bobrenjc93 requested a review from anijain2305 September 18, 2025 21:43

bobrenjc93 marked this pull request as ready for review September 18, 2025 21:44

bobrenjc93 requested a review from laithsakka as a code owner September 18, 2025 21:44

bobrenjc93 mentioned this pull request Sep 20, 2025

[wip] consolidate on fullgraph name #163421

Closed

bobrenjc93 closed this Sep 21, 2025

github-actions bot deleted the gh/bobrenjc93/564/head branch October 22, 2025 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add support for name kwarg in mark_dynamic #163246

Add support for name kwarg in mark_dynamic #163246

Uh oh!

bobrenjc93 commented Sep 18, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 18, 2025 •

edited

Loading

Uh oh!

ezyang commented Sep 21, 2025

Uh oh!

bobrenjc93 commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add support for name kwarg in mark_dynamic #163246

Add support for name kwarg in mark_dynamic #163246

Uh oh!

Conversation

bobrenjc93 commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163246

✅ No Failures

Uh oh!

ezyang commented Sep 21, 2025

Uh oh!

bobrenjc93 commented Sep 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bobrenjc93 commented Sep 18, 2025 •

edited

Loading

pytorch-bot bot commented Sep 18, 2025 •

edited

Loading