Skip to content

Conversation

@LeiWang1999
Copy link
Contributor

This pull request includes several changes to the bitblas module, focusing on improving functionality, optimizing performance, and refactoring code. The most important changes include adding a deprecation decorator, modifying tensorcore policies, enhancing the general_matmul module, and updating the ladder_permutate and lop3_permutate modules.

Code Enhancements and New Features:

  • bitblas/__init__.py: Added a deprecation decorator to mark functions as deprecated and emit warnings when used.
  • bitblas/module/__init__.py: Introduced the unpack_qweight function for unpacking quantized weights and updated the load_and_transform_weight method to use this function. [1] [2]

Tensorcore Policy Optimizations:

General Matmul Module Enhancements:

Ladder Permutate and LOP3 Permutate Module Updates:

@LeiWang1999 LeiWang1999 merged commit 2e60d2b into microsoft:main Aug 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant