Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inductor cutlass backend] Robust Precompilation / Autotuning / Retuning in subprocesses #115654

Closed
wants to merge 2 commits into from

Conversation

kadeng
Copy link
Contributor

@kadeng kadeng commented Dec 12, 2023

Stack from ghstack (oldest at bottom):

Makes autotuning in subprocesses more robust, specifically against long running or crashing
functions being benchmarked, which could also completely corrupt the CUDA Context of the entire process.

This diff introduces changes to ensure that precompilation works well with autotuning in
subprocesses, and ensures that autotuning subprocesses have robust timeouts after which
they will be killed.

…ing in subprocesses

Makes autotuning in subprocesses more robust, specifically against long running or crashing
functions being benchmarked, which could also completely corrupt the CUDA Context of the entire process.

This diff introduces changes to ensure that precompilation works well with autotuning in
subprocesses, and ensures that autotuning subprocesses have robust timeouts after which
they will be killed.

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Dec 12, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/115654

Note: Links to docs will display an error until the docs builds have been completed.

❌ 20 New Failures, 12 Unrelated Failures

As of commit f7c2cf2 with merge base afe6d27 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

This was referenced Dec 12, 2023
…ing / Retuning in subprocesses"

Makes autotuning in subprocesses more robust, specifically against long running or crashing
functions being benchmarked, which could also completely corrupt the CUDA Context of the entire process.

This diff introduces changes to ensure that precompilation works well with autotuning in
subprocesses, and ensures that autotuning subprocesses have robust timeouts after which
they will be killed.

[ghstack-poisoned]
@kadeng
Copy link
Contributor Author

kadeng commented Dec 15, 2023

Moved to a (draft) feature branch, see #115919

@kadeng kadeng closed this Dec 15, 2023
@facebook-github-bot facebook-github-bot deleted the gh/kadeng/40/head branch January 14, 2024 15:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant