Skip to content

Conversation

@LeiWang1999
Copy link
Contributor

This pull request includes several changes to the benchmarking scripts and the matrix multiplication and multi-head attention implementations, as well as updates to the mfma_macro_generator.py file to support different thread binding layouts. The most important changes include updating the submodule commit, adding new benchmarking scripts, and modifying the mfma_macro_generator.py to support different thread binding layouts.

Benchmarking updates:

Matrix multiplication and multi-head attention implementations:

Code simplification and cleanup:

Submodule update:

  • 3rdparty/tvm: Updated the submodule commit to a new version.

@LeiWang1999 LeiWang1999 merged commit b481405 into microsoft:main Nov 19, 2024
5 of 6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants