adding fused uint4x2_mixed_mm to inductor #106516

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: Test Plan: Reviewers: Subscribers: Tasks: Tags: cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 ipiszy ngimel yf225 chenyang78 kadeng muchulee8 aakhundov [ghstack-poisoned]

Summary: this is needed for int4 weight-only quantization, we're matching on the specific unpack operation that unpacks the uint4x2 into int4's so we can have a fused kernel for it. note, even if the user isn't specifically doing this, the two operations are mathematically equilvanet so it won't cause issues. Ideally at some point full prologue fusion for the mm arguments would be able to handle this chain but until then, this type of kernel is needed. Test Plan: python test/inductor/test_pattern_matcher.py -k "uint4x2" print test/inductor/test_torchinductor.py -k "uint4x2" Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Summary: this is needed for int4 weight-only quantization, we're matching on the specific unpack operation that unpacks the uint4x2 into int4's so we can have a fused kernel for it. note, even if the user isn't specifically doing this, the two operations are mathematically equilvanet so it won't cause issues (for some reason int8 bitwise logic in triton and pytorch doesn't match so that's the only exception). Ideally at some point full prologue fusion for the mm arguments would be able to handle this chain but until then, this type of kernel is needed. Test Plan: python test/inductor/test_pattern_matcher.py -k "uint4x2" print test/inductor/test_torchinductor.py -k "uint4x2" Reviewers: Subscribers: Tasks: Tags: [ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding fused uint4x2_mixed_mm to inductor #106516

adding fused uint4x2_mixed_mm to inductor #106516

Commits on Aug 3, 2023

Commits on Aug 10, 2023

Commits on Aug 11, 2023

Commits on Aug 14, 2023