Added limit on number of warps for coordesc autotuner #108997

Chillee · 2023-09-11T01:35:59Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

[ghstack-poisoned]

pytorch-bot · 2023-09-11T01:36:01Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108997

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

A100 runners down: apt-get install nvidia-docker2, Could not get lock /var/lib/dpkg/lock-frontend

✅ You can merge normally! (4 Unrelated Failures)

As of commit a2e3493 with merge base a6b153b ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

linux-jammy-py3.9-clang12-asan / test (default, 2, 6, linux.4xlarge) (gh)

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

cuda12.1-py3.10-gcc9-sm86 / test (inductor, 1, 1, linux.g5.4xlarge.nvidia.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

shunting314 · 2023-09-11T16:39:52Z

torch/_inductor/coordinate_descent_tuner.py

@@ -75,6 +75,11 @@ def get_rmax(self):
            # large enough. We should not pick this large RBLOCK anyway
            return 2**30

+    def get_warpsmax(self):
+        # Currently, CUDA has a maximum of 1024 threads, so 32 is the max


nit: 'CUDA has a maximum of 1024 threads' -> 'CUDA has a maximum of 1024 threads per block'?

Chillee · 2023-09-12T00:12:25Z

@pytorchbot merge -f "failures unrelated"

pytorchmergebot · 2023-09-12T00:14:33Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Added limit on number of warps

74022c2

[ghstack-poisoned]

Chillee mentioned this pull request Sep 11, 2023

Add heuristic for when evict_first should be set (and some other minor things) #108841

Closed

Chillee requested a review from shunting314 September 11, 2023 01:36

github-actions bot added module: inductor ciflow/inductor labels Sep 11, 2023

github-actions bot requested a review from ezyang September 11, 2023 01:36

Chillee changed the title ~~Added limit on number of warps~~ Added limit on number of warps for coordesc autotuner Sep 11, 2023

Chillee added 2 commits September 10, 2023 19:16

Update on "Added limit on number of warps for coordesc autotuner"

d2747e7

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Update on "Added limit on number of warps for coordesc autotuner"

a2e3493

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

shunting314 reviewed Sep 11, 2023

View reviewed changes

shunting314 approved these changes Sep 11, 2023

View reviewed changes

pytorchmergebot added the merging label Sep 12, 2023

pytorchmergebot added Merged and removed merging labels Sep 12, 2023

pytorchmergebot closed this in 33c1136 Sep 12, 2023

facebook-github-bot deleted the gh/chillee/218/head branch September 15, 2023 14:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added limit on number of warps for coordesc autotuner #108997

Added limit on number of warps for coordesc autotuner #108997

Uh oh!

Chillee commented Sep 11, 2023 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Sep 11, 2023 •

edited

Loading

Uh oh!

shunting314 Sep 11, 2023

Uh oh!

Chillee commented Sep 12, 2023

Uh oh!

pytorchmergebot commented Sep 12, 2023

Uh oh!

Uh oh!

Added limit on number of warps for coordesc autotuner #108997

Added limit on number of warps for coordesc autotuner #108997

Uh oh!

Conversation

Chillee commented Sep 11, 2023 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/108997

❗ 1 Active SEVs

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

shunting314 Sep 11, 2023

Choose a reason for hiding this comment

Uh oh!

Chillee commented Sep 12, 2023

Uh oh!

pytorchmergebot commented Sep 12, 2023

Merge started

Uh oh!

Uh oh!

Chillee commented Sep 11, 2023 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Sep 11, 2023 •

edited

Loading