Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Improve performance of broadcast_axis on CPU #17882

Merged
merged 5 commits into from
Jun 29, 2020

Commits on Jun 27, 2020

  1. fixing broadcast_axis kernel to int32

    Rohit Kumar Srivastava committed Jun 27, 2020
    Configuration menu
    Copy the full SHA
    78d30c2 View commit details
    Browse the repository at this point in the history
  2. fixing slice_axis kernel to int32

    Rohit Kumar Srivastava committed Jun 27, 2020
    Configuration menu
    Copy the full SHA
    ba9d26a View commit details
    Browse the repository at this point in the history
  3. adding comments explaining code optimizations

    Rohit Kumar Srivastava committed Jun 27, 2020
    Configuration menu
    Copy the full SHA
    e6e3bc7 View commit details
    Browse the repository at this point in the history
  4. combining CPU and GPU implementation method signatures and cleaned up

    code
    Rohit Kumar Srivastava committed Jun 27, 2020
    Configuration menu
    Copy the full SHA
    f68517b View commit details
    Browse the repository at this point in the history
  5. adding new broadcast_axis to np_matmul

    Rohit Kumar Srivastava committed Jun 27, 2020
    Configuration menu
    Copy the full SHA
    df8c232 View commit details
    Browse the repository at this point in the history