Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inductor cutlass backend] Support for StreamK and non-zero workspace sizes #114687

Closed
wants to merge 16 commits into from

Conversation

kadeng
Copy link
Contributor

@kadeng kadeng commented Nov 28, 2023

Stack from ghstack (oldest at bottom):

The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though.

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

Copy link

pytorch-bot bot commented Nov 28, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/114687

Note: Links to docs will display an error until the docs builds have been completed.

❌ 18 New Failures, 14 Unrelated Failures

As of commit 366c377 with merge base afe6d27 (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

kadeng added a commit that referenced this pull request Nov 28, 2023
… sizes

WIP

ghstack-source-id: f723905c1913492f02786a8c10269c7586fd7372
Pull Request resolved: #114687
kadeng added a commit that referenced this pull request Dec 7, 2023
… sizes

WIP

ghstack-source-id: e330a20e8dbb4132dc7bc1c3d830c780157f95bc
Pull Request resolved: #114687
kadeng added a commit that referenced this pull request Dec 7, 2023
… sizes

WIP

ghstack-source-id: 36620de4afce44499522e1831a4fe2f0476c69d8
Pull Request resolved: #114687
kadeng added a commit that referenced this pull request Dec 7, 2023
… sizes

WIP

ghstack-source-id: 833a41137aba9906e4c9c311a0fcd3d13b16b002
Pull Request resolved: #114687
kadeng added a commit that referenced this pull request Dec 7, 2023
… sizes

WIP

ghstack-source-id: 1ec1e9d9ee3434fbc24814f0e90ba3ad8db0d646
Pull Request resolved: #114687
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
…o workspace sizes"


The StreamK tile scheduler offers performance improvements for certain otherwise problematic GEMM shapes. In Cutlass, most StreamK ops require temporary working memory, though. 

This diff adds support for temporary workspace memory being provided to Cutlass ops and enables StreamK-based GEMM Kernels in the Inductor Cutlass backend.

[ghstack-poisoned]
@kadeng
Copy link
Contributor Author

kadeng commented Dec 15, 2023

Moved to a (draft) feature branch, see #115919

@kadeng kadeng closed this Dec 15, 2023
@facebook-github-bot facebook-github-bot deleted the gh/kadeng/35/head branch January 14, 2024 15:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant