[inductor] Fix split-scan interaction with multi-kernel #131044

peterbell10 · 2024-07-18T16:48:21Z

Stack from ghstack (oldest at bottom):

This fixes a couple errors that come up when multi-kernel is used with
split-scan.

The split-scan was being marked as a persistent kernel, which allowed
a multi-kernel to be created but this isn't supported. Fix is to
never mark split-scan as persistent.
Benchmark codegen was not handling WorkspaceArg, and would raise a
KeyError during codegen.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang

[ghstack-poisoned]

pytorch-bot · 2024-07-18T16:48:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/131044

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 6831061 with merge base fedae41 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

Lint / lintrunner-noclang / linux-job (gh) (trunk failure)
>>> Lint for torch/_functorch/_aot_autograd/subclass_utils.py:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

peterbell10 · 2024-07-23T18:45:56Z

@shunting314 PTAL

shunting314 · 2024-07-24T00:44:15Z

Thanks for fixing these!

Can you also add a tests in test/inductor/test_kernel_benchmark.py for the handling of WorkspaceArg in kernel benchmarking?

[ghstack-poisoned]

peterbell10 · 2024-07-25T10:06:55Z

@pytorchbot merge

pytorchmergebot · 2024-07-25T10:08:33Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…7724) Persistent kernels are sometimes able to remove intermediate buffers that would otherwise be needed for the non-persistent reduction kernel. This makes multi kernel's codegen more complicated as it needs to drop these extra arguments at runtime after selecting the correct kernel to run. Instead, this PR updates the persistent kernel's `must_keep_buffers` so these aren't dropped during codegen so both kernels have the same signature. Pull Request resolved: #127724 Approved by: https://github.com/shunting314 ghstack dependencies: #131044

Pull Request resolved: #127725 Approved by: https://github.com/lezcano ghstack dependencies: #131044, #127724

Update

e932eb8

[ghstack-poisoned]

pytorch-bot bot added ciflow/inductor module: inductor labels Jul 18, 2024

This was referenced Jul 18, 2024

[inductor] Simplify multi-kernel codegen by unifying kernel args #127724

Closed

[BE] Use assertEqual in MultiKernel tests #127725

Closed

[TESTING] Temporarily enable multi-kernel #130988

Closed

pytorchbot added the open source label Jul 18, 2024

Fix lint

5063d12

[ghstack-poisoned]

peterbell10 requested review from lezcano and shunting314 July 18, 2024 17:54

lezcano removed their request for review July 18, 2024 21:27

peterbell10 added 2 commits July 24, 2024 12:01

Update

5447283

[ghstack-poisoned]

Fix lint

6831061

[ghstack-poisoned]

shunting314 approved these changes Jul 24, 2024

View reviewed changes

peterbell10 added ciflow/trunk Trigger trunk jobs on your pull request release notes: inductor labels Jul 24, 2024

pytorchmergebot added the merging label Jul 25, 2024

pytorchmergebot added the Merged label Jul 25, 2024

pytorchmergebot closed this in 2784b3f Jul 25, 2024

pytorchmergebot removed the merging label Jul 25, 2024

pytorchmergebot pushed a commit that referenced this pull request Jul 26, 2024

[BE] Use assertEqual in MultiKernel tests (#127725)

c92f2a1

Pull Request resolved: #127725 Approved by: https://github.com/lezcano ghstack dependencies: #131044, #127724

henrylhtsang mentioned this pull request Jul 31, 2024

[BE][typing] fix types in common pruning #132309

Closed

github-actions bot deleted the gh/peterbell10/769/head branch August 25, 2024 02:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[inductor] Fix split-scan interaction with multi-kernel #131044

[inductor] Fix split-scan interaction with multi-kernel #131044

Uh oh!

peterbell10 commented Jul 18, 2024 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Jul 18, 2024 •

edited

Loading

Uh oh!

peterbell10 commented Jul 23, 2024

Uh oh!

shunting314 commented Jul 24, 2024

Uh oh!

peterbell10 commented Jul 25, 2024

Uh oh!

pytorchmergebot commented Jul 25, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[inductor] Fix split-scan interaction with multi-kernel #131044

[inductor] Fix split-scan interaction with multi-kernel #131044

Uh oh!

Conversation

peterbell10 commented Jul 18, 2024 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/131044

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

peterbell10 commented Jul 23, 2024

Uh oh!

shunting314 commented Jul 24, 2024

Uh oh!

peterbell10 commented Jul 25, 2024

Uh oh!

pytorchmergebot commented Jul 25, 2024

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

peterbell10 commented Jul 18, 2024 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jul 18, 2024 •

edited

Loading