Fix q to #1963

Qubitium · 2025-10-01T23:37:29Z

@avtc Refractored code. May resolve your Q.to() crash.

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

avtc · 2025-10-02T07:35:58Z

Confirmed, this pr fixed issue with Q.to()

Qubitium · 2025-10-02T09:34:55Z

Confirmed, this pr fixed issue with Q.to()

Finally! A throne in my backside finally plucked. You can pull main for a gptq quantization quality fix. You can now properly increase batch_size to as high as your gpu can manage to increase throughput at the expensive of more vram usage.

Each gpu/model pair has a sweet spot toggle for batch_size. There is no one size fits all. More batch sometimes is slower but not always.

Qubitium added 6 commits October 1, 2025 15:06

use H device

40de245

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

remove old target_device strategy code

a6366fa

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

rehome/move module to gpu device

2bd419f

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

declutter module_loper

cb2a06c

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

reduce memory usage

14a859e

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

more logging

069d3f9

Signed-off-by: Qubitium <Qubitium@modelcloud.ai>

Qubitium merged commit ec31f3d into main Oct 2, 2025
4 checks passed

Qubitium deleted the fix-q-to branch October 2, 2025 00:45

Qubitium mentioned this pull request Oct 2, 2025

lock q.to to fix accelerate invalid argument #1947

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix q to #1963

Fix q to #1963

Uh oh!

Qubitium commented Oct 1, 2025 •

edited

Loading

Uh oh!

Uh oh!

avtc commented Oct 2, 2025

Uh oh!

Qubitium commented Oct 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix q to #1963

Fix q to #1963

Uh oh!

Conversation

Qubitium commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

avtc commented Oct 2, 2025

Uh oh!

Qubitium commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Qubitium commented Oct 1, 2025 •

edited

Loading

Qubitium commented Oct 2, 2025 •

edited

Loading