[Dev][BitNET] Implement INT4xINT2 GEMM #233

LeiWang1999 · 2024-11-02T17:19:38Z

This PR includes:

INT4xINT2 Kernel Implementation and Correctness check
Fast Dequantize of INT4xINT2

TODO:

Integrate as a bitblas op.

- Adjusted the local fragment sizes for tensor core memory allocation in the MatmulFineGrainScheduler class. - Updated the allocation sizes for A_local, B_local, and C_local variables based on the new fragment sizes. - The changes ensure efficient memory utilization and improve performance. Refactor tensor core memory allocation in MatmulDequantizeFineGrainedScheduler - Modified the fragment sizes for tensor core memory allocation in the MatmulDequantizeFineGrainedScheduler class. - Updated the allocation sizes for A_frag, B_frag, and C_frag variables based on the new fragment sizes. - The changes optimize memory usage and enhance the efficiency of the dequantization process. Refactor tensor core memory allocation in MatmulDequantizeWeightPropagationScheduler - Adjusted the fragment sizes for tensor core memory allocation in the MatmulDequantizeWeightPropagationScheduler class. - Updated the allocation sizes for A_frag, B_frag, B_dequantize_frag, and C_frag variables based on the new fragment sizes. - The changes improve memory utilization and optimize the weight propagation process.

…xtent

LeiWang1999 added 13 commits October 4, 2024 05:49

Merge TL Update

acb4aa4

Merge branch 'main' of https://github.com/microsoft/BitBLAS into main

e305a72

submodule update

4aa081c

Re-implement macro with sub function.

811e5c7

lint fix

4a0afc9

Implement int4 tensorcore

2af586d

Merge branch 'main' of https://github.com/microsoft/BitBLAS into tl_e…

fd4973c

…xtent

lint fix

8f7767b

support uint2->uint4 fast dequantize

85a9308

Support int4 tensorcore decoding

9f9f397

lint fix

ff77090

Merge branch 'main' of https://github.com/microsoft/BitBLAS into tl_e…

a2862a0

…xtent

LeiWang1999 merged commit 451b466 into microsoft:main Nov 3, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Dev][BitNET] Implement INT4xINT2 GEMM #233

[Dev][BitNET] Implement INT4xINT2 GEMM #233

Uh oh!

LeiWang1999 commented Nov 2, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[Dev][BitNET] Implement INT4xINT2 GEMM #233

[Dev][BitNET] Implement INT4xINT2 GEMM #233

Uh oh!

Conversation

LeiWang1999 commented Nov 2, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant