Make torch fused op compilable #2182

jiqing-feng · 2025-11-06T05:33:35Z

After this change, the torch fused op can combine with the whole model torch.compile. It could bring 4x speed-up when input_size > 128 on Intel Xeon CPU.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Qubitium · 2025-11-06T16:33:05Z

@jiqing-feng Wow! Awesome. Thank you!

jiqing-feng requested a review from Qubitium November 6, 2025 05:34

jiqing-feng added 2 commits November 6, 2025 10:17

add compile

1f62143

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

rm inside compile

39a35a5

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Qubitium merged commit 9d524ef into ModelCloud:main Nov 6, 2025
1 check passed

Qubitium approved these changes Nov 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make torch fused op compilable #2182

Make torch fused op compilable #2182

Uh oh!

jiqing-feng commented Nov 6, 2025 •

edited

Loading

Uh oh!

Qubitium commented Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Make torch fused op compilable #2182

Make torch fused op compilable #2182

Uh oh!

Conversation

jiqing-feng commented Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Qubitium commented Nov 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jiqing-feng commented Nov 6, 2025 •

edited

Loading