Skip to content

Add generic kernel instances for ck::tensor_operation::device::DeviceGemmMultipleD#741

Merged
zjing14 merged 16 commits into
developfrom
lwpck-759
Jun 14, 2023
Merged

Add generic kernel instances for ck::tensor_operation::device::DeviceGemmMultipleD#741
zjing14 merged 16 commits into
developfrom
lwpck-759

Conversation

@geyyer
Copy link
Copy Markdown
Contributor

@geyyer geyyer commented Jun 2, 2023

Adding generic kernel instances requested in #709.

  • Add a gemm_add_add_fastgelu generic kernel instance, add a client example.
  • Same for gemm_add_fastgelu and gemm_fastgelu.
  • Tests:
  • ./02_gemm_add_add_fastgelu/client_gemm_add_add_fastgelu_generic 1 73 197 -1 -1 -1 -1 -1
    
  • ./02_gemm_add_add_fastgelu/client_gemm_add_fastgelu_generic 1 73 197 -1 -1 -1 -1
    
  • ./02_gemm_add_add_fastgelu/client_gemm_fastgelu_generic 1 73 197 -1 -1 -1
    

@geyyer geyyer marked this pull request as ready for review June 7, 2023 22:41
@geyyer geyyer requested review from asroy, qianfengz and zjing14 June 7, 2023 22:41
@zjing14 zjing14 merged commit 54b68eb into develop Jun 14, 2023
@illsilin illsilin deleted the lwpck-759 branch December 7, 2023 18:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants