feat: add SpinQuant offline rotation and integrate with PTQ pipeline by gavingavin99 · Pull Request #262 · Tencent/AngelSlim

gavingavin99 · 2026-03-16T09:00:02Z

feat: add SpinQuant offline rotation and integrate with PTQ pipeline

Add angelslim/compressor/transform/ package: - TransformBase abstract class and TransformFactory with register decorator - SpinQuant implementation: R1/R2/R4 offline Hadamard rotation fused into weights - SpinQuantMapping for LLaMA/Qwen layer name resolution - fuse_ln_linear, center_embeddings utilities; hadamard_utils
Integrate transform into PTQ: TransformFactory.create() + run() is called before quantization in PTQ.init()
Extend config_parser: add TransformConfig, FullConfig.transform_config, SlimConfigParser support for optional transform: YAML section
Add Engine.prepare_compressor(transform_config=) passthrough and lm_eval()
Add tools/run_transform_offline.py for standalone transform + save
Add configs/qwen3/spinquant/ with SpinQuant + fp8_static / int4_awq examples

@register

- Add angelslim/compressor/transform/ package: - TransformBase abstract class and TransformFactory with @register decorator - SpinQuant implementation: R1/R2/R4 offline Hadamard rotation fused into weights - SpinQuantMapping for LLaMA/Qwen layer name resolution - fuse_ln_linear, center_embeddings utilities; hadamard_utils - Integrate transform into PTQ: TransformFactory.create() + run() is called before quantization in PTQ.__init__() - Extend config_parser: add TransformConfig, FullConfig.transform_config, SlimConfigParser support for optional transform: YAML section - Add Engine.prepare_compressor(transform_config=) passthrough and lm_eval() - Add tools/run_transform_offline.py for standalone transform + save - Add configs/qwen3/spinquant/ with SpinQuant + fp8_static / int4_awq examples

yghstill · 2026-03-17T15:22:22Z


-    # Step 7: Save compressed model
+    def find_modules_with_hooks(model: torch.nn.Module):
+        """查找并打印模型中所有带有 hook 的子模块"""


注释用英文

yghstill · 2026-03-17T15:23:42Z

 global:
  save_path: ./output

+


qwen3-8b_int4_awq.yaml应该不用修改？

不用改，是因为之前改了别的路径，不小心commit了，重新改了回去

yghstill · 2026-03-17T15:25:15Z

@@ -0,0 +1,77 @@
+# coding=utf-8


注意引用规范，使用Tencent的声明
参考自其他库的在下方注明

yghstill · 2026-03-17T15:26:12Z

hadamard_utils.py‎这个文件行数是否有方法精简，近10w行文件会不会过大

这个文件行数多是因为有些特殊的shape（非二次幂），需要单独设置hadamard核，把hadamard核函数写到文件里了，所以看起来行数比较多。精简的话可能通用性降低

linchuanxie · 2026-03-18T08:28:45Z

+    embedding.weight.data = new_weight
+
+
+# [TODO] check this function correct or not


这种注释要不要删掉

linchuanxie · 2026-03-18T08:31:32Z

这里面的mapping是适用于某一个模型还是都要适用，是否需要扩充

gavingavin99 force-pushed the dev_rotation branch from eee33f4 to 2a0d11c Compare March 17, 2026 14:42

gavinlee added 2 commits March 17, 2026 22:58

fix typos

7456d06

fix typos

f945650

yghstill reviewed Mar 17, 2026

View reviewed changes

gavinlee added 4 commits March 18, 2026 10:51

fix typos

3d9c75f

refactor: compact hadamard matrix data to reduce line count

493e147

revert config

84e4d21

revert configs

c3e6ab1

yghstill approved these changes Mar 18, 2026

View reviewed changes

linchuanxie reviewed Mar 18, 2026

View reviewed changes

gavingavin99 merged commit 26567b2 into Tencent:main Mar 18, 2026
5 checks passed

linchuanxie reviewed Mar 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add SpinQuant offline rotation and integrate with PTQ pipeline#262

feat: add SpinQuant offline rotation and integrate with PTQ pipeline#262
gavingavin99 merged 7 commits into
Tencent:mainfrom
gavingavin99:dev_rotation

gavingavin99 commented Mar 16, 2026 •

edited

Loading

Uh oh!

yghstill Mar 17, 2026

Uh oh!

yghstill Mar 17, 2026

Uh oh!

gavingavin99 Mar 18, 2026

Uh oh!

yghstill Mar 17, 2026

Uh oh!

yghstill Mar 17, 2026

Uh oh!

gavingavin99 Mar 18, 2026

Uh oh!

linchuanxie Mar 18, 2026

Uh oh!

Uh oh!

linchuanxie Mar 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		embedding.weight.data = new_weight


		# [TODO] check this function correct or not

Conversation

gavingavin99 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gavingavin99 commented Mar 16, 2026 •

edited

Loading