Make Inductor benchmarker more compatible with Triton do_bench #160921

yf225 · 2025-08-18T22:13:07Z

Common benchmark suites like TritonBench uses triton.testing.do_bench for kernel timing measurement which is not always fair for all backends. E.g. it includes torch.compile Dynamo invocation overhead and hence doesn't reflect real-world model use case where Dynamo overhead is usually hidden.

I also opened a PR to use this timing measurement function on TritonBench side: meta-pytorch/tritonbench#333. But regardless of whether that PR can land, I think we should enhance Inductor benchmark_gpu to match do_bench features, to make it easier to people to migrate.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-08-18T22:13:11Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160921

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 75bc584 with merge base 82c7a1e ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

BoyuanFeng · 2025-08-18T22:42:09Z

torch/_inductor/runtime/benchmarking.py

@@ -183,7 +183,7 @@ def L2_cache_size(self: Self) -> int:

    def get_event_pairs(
        self: Self, iters: int
-    ) -> list[tuple[torch.cuda.Event, torch.cuda.Event]]:
+    ) -> List[tuple[torch.cuda.Event, torch.cuda.Event]]:


please use list instead of List

yf225 · 2025-08-18T23:54:31Z

@pytorchbot merge

pytorchmergebot · 2025-08-18T23:56:55Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ch#160921) Common benchmark suites like TritonBench uses `triton.testing.do_bench` for kernel timing measurement which is not always fair for all backends. E.g. it includes torch.compile Dynamo invocation overhead and hence doesn't reflect real-world model use case where Dynamo overhead is usually hidden. I also opened a PR to use this timing measurement function on TritonBench side: meta-pytorch/tritonbench#333. But regardless of whether that PR can land, I think we should enhance Inductor benchmark_gpu to match do_bench features, to make it easier to people to migrate. Pull Request resolved: pytorch#160921 Approved by: https://github.com/BoyuanFeng

yf225 requested review from eellison and BoyuanFeng August 18, 2025 22:13

pytorch-bot bot added ciflow/inductor module: inductor labels Aug 18, 2025

yf225 force-pushed the benchmarker_compat_with_do_bench branch 2 times, most recently from de03957 to 93e10f3 Compare August 18, 2025 22:41

BoyuanFeng approved these changes Aug 18, 2025

View reviewed changes

yf225 added the topic: not user facing topic category label Aug 18, 2025

yf225 force-pushed the benchmarker_compat_with_do_bench branch from 93e10f3 to d3d38c9 Compare August 18, 2025 22:48

Make Inductor benchmarker more compatible with Triton do_bench

75bc584

yf225 force-pushed the benchmarker_compat_with_do_bench branch from d3d38c9 to 75bc584 Compare August 18, 2025 23:20

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Aug 18, 2025

pytorchmergebot added the merging label Aug 18, 2025

pytorchmergebot added the Merged label Aug 19, 2025

pytorchmergebot closed this in a391fa1 Aug 19, 2025

pytorchmergebot removed the merging label Aug 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make Inductor benchmarker more compatible with Triton do_bench #160921

Make Inductor benchmarker more compatible with Triton do_bench #160921

yf225 commented Aug 18, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Aug 18, 2025 •

edited

Loading

Uh oh!

BoyuanFeng Aug 18, 2025

Uh oh!

yf225 commented Aug 18, 2025

Uh oh!

pytorchmergebot commented Aug 18, 2025

Uh oh!

Uh oh!

Make Inductor benchmarker more compatible with Triton do_bench #160921

Make Inductor benchmarker more compatible with Triton do_bench #160921

Conversation

yf225 commented Aug 18, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/160921

✅ No Failures

Uh oh!

BoyuanFeng Aug 18, 2025

Choose a reason for hiding this comment

Uh oh!

yf225 commented Aug 18, 2025

Uh oh!

pytorchmergebot commented Aug 18, 2025

Merge started

Uh oh!

Uh oh!

yf225 commented Aug 18, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Aug 18, 2025 •

edited

Loading