Hey Guys, Is device mapping supported? I mean to share the model between the host and gpu automatically, especially in the case of Qwen models?
Hey Guys,
Is device mapping supported? I mean to share the model between the host and gpu automatically, especially in the case of Qwen models?