[blockwise] GOI packing routine for qb4w #6373

digantdesai · 2024-05-07T01:10:57Z

Test packing-test --gtest_filter="PACK_QD8_F32_QB4W_GEMM_GOI_W.*"

digantdesai · 2024-05-07T14:54:22Z

cc @alankelly - here is the first PR to start building blockwise support i.e. qb4w. I closed the Draft PR #6030

alankelly · 2024-05-08T14:47:01Z

src/xnnpack/pack.h

@@ -202,6 +202,38 @@ XNN_INTERNAL void xnn_pack_qs8_qc4w_gemm_goi_w(
  size_t extra_bytes,
  const struct xnn_qs8_qc4w_packing_params* params);

+typedef void (*xnn_pack_qs8_qb4w_gemm_fn)(
+  size_t g,


Traditionally these parameters were undocumented and cryptic. Could you please break this tradition and give them better names?!

Updated. I feel nc/kc/nr/kr are quite well "baked" in the code and we should document is somewhere like in the docs dir. It might be more meaningful for readability. I can put up a doc change tomorrow.

packing-test --gtest_filter="PACK_QD8_F32_QB4W_GEMM_GOI_W.*"

mcr229 · 2024-05-14T00:34:45Z

src/microparams-init.c

@@ -930,6 +930,45 @@ void xnn_init_qs8_to_qs8_qc8w_scale_fp32_params(
  }
 }

+void xnn_init_qs8_qb8w_scale_fp32_params(


should this be named qb4w?

Though you may be coming from consistency point of view with-in qb*w. I would argue that, it shouldn't matter actually because the routine is independent of weight bit width. If you look qc4w, it uses qc8w routine. I am not sure if we will ever write qb8w but this is just an artifact from qc.

This function is independent of quantization bit width so qs8_qb8w is not needed, maybe xnn_init_blockwise_scale_fp32_params?

Sounds good. Let me update.

digantdesai · 2024-05-17T03:42:00Z

Thanks @alankelly - let me push another PR to fix these name changes and other small things.

digantdesai · 2024-05-17T04:19:10Z

#6434 is the follow up PR.

alankelly reviewed May 8, 2024

View reviewed changes

[blockwise] Packing routine for qb4w

fdd08a9

packing-test --gtest_filter="PACK_QD8_F32_QB4W_GEMM_GOI_W.*"

digantdesai force-pushed the goi_scalar_packing branch from 361695c to fdd08a9 Compare May 9, 2024 03:45

mcr229 reviewed May 14, 2024

View reviewed changes

copybara-service bot merged commit 084e96a into google:master May 16, 2024
3 checks passed

digantdesai mentioned this pull request May 17, 2024

[blockwise] Minor fixes for qb4w goi packing routine #6434

Closed

digantdesai mentioned this pull request May 29, 2024

QB4W Development #6502

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[blockwise] GOI packing routine for qb4w #6373

[blockwise] GOI packing routine for qb4w #6373

digantdesai commented May 7, 2024

digantdesai commented May 7, 2024

alankelly May 8, 2024

digantdesai May 9, 2024

mcr229 May 14, 2024

digantdesai May 14, 2024

alankelly May 15, 2024

digantdesai May 16, 2024

digantdesai commented May 17, 2024

digantdesai commented May 17, 2024

[blockwise] GOI packing routine for qb4w #6373

[blockwise] GOI packing routine for qb4w #6373

Conversation

digantdesai commented May 7, 2024

digantdesai commented May 7, 2024

alankelly May 8, 2024

Choose a reason for hiding this comment

digantdesai May 9, 2024

Choose a reason for hiding this comment

mcr229 May 14, 2024

Choose a reason for hiding this comment

digantdesai May 14, 2024

Choose a reason for hiding this comment

alankelly May 15, 2024

Choose a reason for hiding this comment

digantdesai May 16, 2024

Choose a reason for hiding this comment

digantdesai commented May 17, 2024

digantdesai commented May 17, 2024