[Inductor] Cache generated user defined triton kernels on tensor dtype and non tensor parameters #112752

oulgen · 2023-11-02T16:34:46Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler

…e and non tensor parameters [ghstack-poisoned]

pytorch-bot · 2023-11-02T16:34:50Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112752

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (3 Unrelated Failures)

As of commit 6cf317d with merge base 7715b47 ():

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…e and non tensor parameters ghstack-source-id: 0fa5e4ecdbbd5f82e4bbbe1d04de8424d6fa2f50 Pull Request resolved: #112752

oulgen · 2023-11-02T16:37:11Z

The cache key needs to be dtype of all tensors and values of all non tensors since our triton autotune implementation assumes every input is implicitly in "keys" list.

…tensor dtype and non tensor parameters" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

aakhundov · 2023-11-03T13:24:40Z

torch/_inductor/codegen/wrapper.py

        original_name = kernel.__name__
+
+        cache_key = [original_name]


Can we run into a (rather weird, but possible) edge case that the user defines two functionally different Triton kernels in different scopes with the same name and things added to the cache_key below?

This caching is for the same fx graph so I dont think it is possible for user to unambiguously define two kernels with same name.

I don't think the kernel name is enough for a cache key.

from file1 import my_cool_triton_kernel as k1 from file2 import my_cool_triton_kernel as k2

or

import file1 import file2 ... file1.call_my_cool_triton_kernel(...) file2.call_my_cool_triton_kernel(...)

aakhundov · 2023-11-03T13:27:13Z

... since our triton autotune implementation assumes every input is implicitly in "keys" list.

Does our autotune implementation not assume that the key doesn't exist? If the key is taken into account, how is it used? Thanks!

oulgen · 2023-11-03T15:47:12Z

Does our autotune implementation not assume that the key doesn't exist? If the key is taken into account, how is it used? Thanks!

our autotune implementation assumes that all non tensor arguments to kernel are part of the key (because we specialize on all non tensor values -- this is how dynamo works)

…tensor dtype and non tensor parameters" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

jansel · 2023-11-06T02:05:34Z

torch/_inductor/codegen/wrapper.py

        original_name = kernel.__name__
+
+        cache_key = [original_name]


I don't think the kernel name is enough for a cache key.

from file1 import my_cool_triton_kernel as k1 from file2 import my_cool_triton_kernel as k2

or

import file1 import file2 ... file1.call_my_cool_triton_kernel(...) file2.call_my_cool_triton_kernel(...)

…tensor dtype and non tensor parameters" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

oulgen · 2023-11-06T04:32:32Z

Swapped over to using code id, added a test case where kernel.__name__ is the same but we do not cache since code id does not match

…tensor dtype and non tensor parameters" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

zou3519 · 2023-11-06T17:17:33Z

torch/_inductor/codegen/wrapper.py

        original_name = kernel.__name__
+
+        # Distinguish between different functions using function id
+        cache_key = [id(kernel.fn)]


Is it possible that kernel.fn ends up being GC'ed in Python and we end up with an id that points to nothing? We've run into some situations in the past where something similar has caused issues

@zou3519 This is not a part of python I am familiar with. What would you recommend using in place of id?
In general, why would kernel.fn get GCed when we hold a reference to it?

I read the code a bit more. We should be fine here. As long as we hold a reference to kernel.fn then it won't get GC'ed (and it looks like we keep one in the kernel_side_table).

In some other places in Dynamo (the allowed_functions dict) we use the id of a nested function as a key to a dictionary but don't store a reference to it, resulting in the object getting GC'ed

…tensor dtype and non tensor parameters" cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 kadeng muchulee8 aakhundov ColinPeppler [ghstack-poisoned]

Pull Request resolved: #113008 Approved by: https://github.com/zou3519, https://github.com/jansel ghstack dependencies: #112752

Pull Request resolved: #112801 Approved by: https://github.com/Chillee, https://github.com/jansel ghstack dependencies: #112752, #113008

…ernels (#113056) Pull Request resolved: #113056 Approved by: https://github.com/jansel ghstack dependencies: #112752, #113008, #112801

…e and non tensor parameters (pytorch#112752) Pull Request resolved: pytorch#112752 Approved by: https://github.com/jansel

Pull Request resolved: pytorch#113008 Approved by: https://github.com/zou3519, https://github.com/jansel ghstack dependencies: pytorch#112752

Pull Request resolved: pytorch#112801 Approved by: https://github.com/Chillee, https://github.com/jansel ghstack dependencies: pytorch#112752, pytorch#113008

…ernels (pytorch#113056) Pull Request resolved: pytorch#113056 Approved by: https://github.com/jansel ghstack dependencies: pytorch#112752, pytorch#113008, pytorch#112801

[Inductor] Cache generated user defined triton kernels on tensor dtyp…

1f7763d

…e and non tensor parameters [ghstack-poisoned]

This was referenced Nov 2, 2023

[Inductor] Add Dynamic shape support to user defined triton kernels #112523

Closed

[Inductor] Support one node creating multiple mutations in scheduler #112547

Closed

oulgen added a commit that referenced this pull request Nov 2, 2023

[Inductor] Cache generated user defined triton kernels on tensor dtyp…

feea2f7

…e and non tensor parameters ghstack-source-id: 0fa5e4ecdbbd5f82e4bbbe1d04de8424d6fa2f50 Pull Request resolved: #112752

github-actions bot added module: inductor module: dynamo ciflow/inductor labels Nov 2, 2023

oulgen requested review from jansel, zou3519, Chillee and aakhundov November 2, 2023 16:38

oulgen added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Nov 2, 2023

oulgen mentioned this pull request Nov 2, 2023

[Inductor] Improve reinplace_scatters pass #112801

Closed

aakhundov reviewed Nov 3, 2023

View reviewed changes

jansel requested changes Nov 6, 2023

View reviewed changes

oulgen requested a review from jansel November 6, 2023 04:32

oulgen mentioned this pull request Nov 6, 2023

Move all triton related testing utils into shared file #113008

Closed

zou3519 reviewed Nov 6, 2023

View reviewed changes

oulgen mentioned this pull request Nov 6, 2023

[Inductor] Allow None values to be passed in as arguments to triton kernels #113056

Closed

oulgen mentioned this pull request Nov 6, 2023

[AOTI] Support non auto-tuned triton kernels in aoti #113090

Closed

oulgen added 2 commits November 6, 2023 17:17

jansel approved these changes Nov 7, 2023

View reviewed changes

pytorchmergebot added the Merged label Nov 7, 2023

pytorchmergebot closed this in dbf44df Nov 7, 2023

pytorchmergebot pushed a commit that referenced this pull request Nov 7, 2023

Move all triton related testing utils into shared file (#113008)

f6008be

Pull Request resolved: #113008 Approved by: https://github.com/zou3519, https://github.com/jansel ghstack dependencies: #112752

pytorchmergebot pushed a commit that referenced this pull request Nov 7, 2023

[Inductor] Improve reinplace_scatters pass (#112801)

bfa717c

Pull Request resolved: #112801 Approved by: https://github.com/Chillee, https://github.com/jansel ghstack dependencies: #112752, #113008

facebook-github-bot deleted the gh/oulgen/23/head branch November 10, 2023 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] Cache generated user defined triton kernels on tensor dtype and non tensor parameters #112752

[Inductor] Cache generated user defined triton kernels on tensor dtype and non tensor parameters #112752

oulgen commented Nov 2, 2023 •

edited

pytorch-bot bot commented Nov 2, 2023 •

edited

oulgen commented Nov 2, 2023 •

edited

aakhundov Nov 3, 2023

oulgen Nov 3, 2023 •

edited

jansel Nov 6, 2023

aakhundov commented Nov 3, 2023

oulgen commented Nov 3, 2023

jansel Nov 6, 2023

oulgen commented Nov 6, 2023 •

edited

zou3519 Nov 6, 2023

oulgen Nov 6, 2023

zou3519 Nov 6, 2023

[Inductor] Cache generated user defined triton kernels on tensor dtype and non tensor parameters #112752

[Inductor] Cache generated user defined triton kernels on tensor dtype and non tensor parameters #112752

Conversation

oulgen commented Nov 2, 2023 • edited

pytorch-bot bot commented Nov 2, 2023 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/112752

✅ You can merge normally! (3 Unrelated Failures)

oulgen commented Nov 2, 2023 • edited

aakhundov Nov 3, 2023

Choose a reason for hiding this comment

oulgen Nov 3, 2023 • edited

Choose a reason for hiding this comment

jansel Nov 6, 2023

Choose a reason for hiding this comment

aakhundov commented Nov 3, 2023

oulgen commented Nov 3, 2023

jansel Nov 6, 2023

Choose a reason for hiding this comment

oulgen commented Nov 6, 2023 • edited

zou3519 Nov 6, 2023

Choose a reason for hiding this comment

oulgen Nov 6, 2023

Choose a reason for hiding this comment

zou3519 Nov 6, 2023

Choose a reason for hiding this comment

oulgen commented Nov 2, 2023 •

edited

pytorch-bot bot commented Nov 2, 2023 •

edited

oulgen commented Nov 2, 2023 •

edited

oulgen Nov 3, 2023 •

edited

oulgen commented Nov 6, 2023 •

edited