-
Notifications
You must be signed in to change notification settings - Fork 21.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Cutlass 3.3 submodule upgrade] #112861
Closed
Closed
[Cutlass 3.3 submodule upgrade] #112861
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 3, 2023
ghstack-source-id: 6ef08d44aabacacf0827babadf2758e5a99eb780 Pull Request resolved: #112861
[ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 3, 2023
ghstack-source-id: 914166b584551faa2b4f73aa40d6fdd00701da9c Pull Request resolved: #112861
[ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 3, 2023
ghstack-source-id: d8af54a5f1f55128c8c155351d21b207b89ecf51 Pull Request resolved: #112861
…(experimental)" [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 3, 2023
ghstack-source-id: 588d706b1c056aa2a7143923eaae1cd654b594fe Pull Request resolved: #112861
[ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 3, 2023
ghstack-source-id: b49a4c870571fa592fd0ae66b373d632a3c81e00 Pull Request resolved: #112861
@kadeng has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Nov 9, 2023
ghstack-source-id: c05876f5571638881bfcc411d7f70698bd09ef1d Pull Request resolved: #112861
This was referenced Nov 9, 2023
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
I was waiting for the official v3.3 tag. That's released now, and it actually fixes an important bug that (in the meantime ) caused build failures. Waiting for CI now.. |
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 7, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: 1363752e2699509ab1c5dde200bb2111ec0694d9 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 7, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: 4956e5d00692fcf9ec3048085c798ca334808679 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 7, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: cca382a4785bce0b6b64d443f2cbcc6a522c5116 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 9, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: 980c3b025d6d04ced8415cae15131f443c55f360 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 10, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: b289b21b3e5e937975644cfa92888b55285087c2 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 10, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: 878506f289216d2e15b54876fb5b5a3cf6b780a8 Pull Request resolved: #112861
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
kadeng
added a commit
that referenced
this pull request
Dec 10, 2023
Cutlass 3.3 offers the following improvements: Adds support for mixed precision GEMMs On Hopper and Ampere Adds support for < 16B aligned GEMMs on Hopper Enhancements to EVT Enhancements to Python interface Enhancements to Sub-byte type handling in CuTe Several other bug-fixes and performance improvements. minor doc update Test Plan: CI ( ciflow/trunk, ciflow/inductor ) pytest test/inductor/test_max_autotune.py ghstack-source-id: 3880723ea0a3ec4fab13373ef50c1149da0c2888 Pull Request resolved: #112861
This was referenced Dec 12, 2023
Closed
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary. Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top ) Cutlass 3.3 offers the following improvements: - Adds support for mixed precision GEMMs On Hopper and Ampere - Adds support for < 16B aligned GEMMs on Hopper - Enhancements to EVT - Enhancements to Python interface - Enhancements to Sub-byte type handling in CuTe - Several other bug-fixes and performance improvements. - minor doc update Test Plan: - CI ( ciflow/trunk, ciflow/inductor ) - pytest test/inductor/test_max_autotune.py Differential Revision: [D50988216](https://our.internmc.facebook.com/intern/diff/D50988216) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx peterbell10 ipiszy yf225 chenyang78 muchulee8 aakhundov ColinPeppler [ghstack-poisoned]
Moved to a (draft) feature branch, see #115919 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
ciflow/inductor
ciflow/trunk
Trigger trunk jobs on your pull request
module: inductor
topic: not user facing
topic category
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Updates third_party/cutlass to Cutlass v3.3. No further changes appear neccessary.
Cutlass release 3.3 has not been tagged yet, the revision-hash is 1d7f2a207ec215e037099f4ba5632ccfa0249673 ( Cutlass 3.3 and two minor hotfixes on top )
Cutlass 3.3 offers the following improvements:
Test Plan:
Stack from ghstack (oldest at bottom):
Differential Revision: D50988216
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @muchulee8 @aakhundov @ColinPeppler