Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Inductor CUTLASS backend] Step 4: CUDA (template) kernels #107931

Closed
wants to merge 31 commits into from

Commits on Aug 25, 2023

  1. [Inductor CUTLASS backend] Step 4: CUDA (template) kernels

    [ghstack-poisoned]
    ipiszy committed Aug 25, 2023
    Configuration menu
    Copy the full SHA
    75cfaa0 View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 25, 2023
    Configuration menu
    Copy the full SHA
    0c6a52d View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    c0b885d View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    a96e751 View commit details
    Browse the repository at this point in the history
  3. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    5ad9fee View commit details
    Browse the repository at this point in the history
  4. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    51fa22c View commit details
    Browse the repository at this point in the history
  5. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    701bbee View commit details
    Browse the repository at this point in the history
  6. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    9949e3f View commit details
    Browse the repository at this point in the history
  7. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    46da92b View commit details
    Browse the repository at this point in the history
  8. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 26, 2023
    Configuration menu
    Copy the full SHA
    d4008e5 View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 27, 2023
    Configuration menu
    Copy the full SHA
    44bee1c View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 27, 2023
    Configuration menu
    Copy the full SHA
    c10e787 View commit details
    Browse the repository at this point in the history
  3. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 27, 2023
    Configuration menu
    Copy the full SHA
    c75608d View commit details
    Browse the repository at this point in the history
  4. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 27, 2023
    Configuration menu
    Copy the full SHA
    8530cde View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Aug 29, 2023
    Configuration menu
    Copy the full SHA
    0f879a0 View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 6, 2023
    Configuration menu
    Copy the full SHA
    43912cb View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 6, 2023
    Configuration menu
    Copy the full SHA
    c638a77 View commit details
    Browse the repository at this point in the history
  3. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 6, 2023
    Configuration menu
    Copy the full SHA
    0b54e9c View commit details
    Browse the repository at this point in the history
  4. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 6, 2023
    Configuration menu
    Copy the full SHA
    e4246c5 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    67d8bb9 View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    052e685 View commit details
    Browse the repository at this point in the history
  3. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    97015e0 View commit details
    Browse the repository at this point in the history
  4. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    558ee89 View commit details
    Browse the repository at this point in the history
  5. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    6d6b0da View commit details
    Browse the repository at this point in the history
  6. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 7, 2023
    Configuration menu
    Copy the full SHA
    836ae1f View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 8, 2023
    Configuration menu
    Copy the full SHA
    04ea177 View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 8, 2023
    Configuration menu
    Copy the full SHA
    9183a36 View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    ea84a0d View commit details
    Browse the repository at this point in the history
  2. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    49c919d View commit details
    Browse the repository at this point in the history
  3. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 11, 2023
    Configuration menu
    Copy the full SHA
    e6ec8d5 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2023

  1. Update on "[Inductor CUTLASS backend] Step 4: CUDA (template) kernels"

    This is the step 4 to add cutlass as an alternative inductor backend.
    Full tests can be found from the last PR in the stack.
    
    Feature request: #106991.
    
    
    
    
    cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ngimel yf225 chenyang78 kadeng muchulee8 aakhundov
    
    [ghstack-poisoned]
    ipiszy committed Sep 12, 2023
    Configuration menu
    Copy the full SHA
    e62717f View commit details
    Browse the repository at this point in the history