🚀 The feature, motivation and pitch
Exporting Qwen 3.5 MoE on x090 (using 4090 as experiement) got OOM issue.
One of the core thing is aoti_compile duplicate some of the tensor in GPU for keeping their original value during the optimization.
Alternatives
No response
Additional context
No response
RFC (Optional)
No response
🚀 The feature, motivation and pitch
Exporting Qwen 3.5 MoE on x090 (using 4090 as experiement) got OOM issue.
One of the core thing is
aoti_compileduplicate some of the tensor in GPU for keeping their original value during the optimization.Alternatives
No response
Additional context
No response
RFC (Optional)
No response