ramtorch: percentage-based offload fix for text encoder moving to CPU and back inadvertently causing device mismatch error #2525

bghira · 2026-01-29T16:09:59Z

This pull request makes targeted improvements to how text encoders and linear layers are managed when using RamTorch, particularly around device placement and partial replacement scenarios. The changes ensure better compatibility and correctness when RamTorch is enabled, especially in mixed-device and percentage-based replacement cases.

Device management and RamTorch integration:

Improved logic in move_text_encoders (in factory.py) to always skip moving text encoders when RamTorch is enabled, regardless of the target device, to support both partial and full RamTorch setups.

Partial RamTorch replacement handling:

In replace_linear_layers_with_ramtorch (in ramtorch.py), added logic to track remaining eligible layers when using percentage-based replacement, ensuring only the correct subset is replaced.
Enhanced handling so that, after percentage-based replacement, any remaining eligible nn.Linear layers (or those matching specific patterns) are moved to the resolved device if they are still on CPU, maintaining device consistency.

… and back inadvertently causing device mismatch error

bghira added 2 commits January 29, 2026 10:09

ramtorch: percentage-based offload fix for text encoder moving to CPU…

e46cb88

… and back inadvertently causing device mismatch error

skip CUDA tests without GPU

0cb7d81

bghira merged commit 5638c97 into main Jan 29, 2026
2 checks passed

bghira deleted the bugfix/ramtorch-percentage-text-enc branch January 29, 2026 18:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ramtorch: percentage-based offload fix for text encoder moving to CPU and back inadvertently causing device mismatch error #2525

ramtorch: percentage-based offload fix for text encoder moving to CPU and back inadvertently causing device mismatch error #2525

Uh oh!

bghira commented Jan 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ramtorch: percentage-based offload fix for text encoder moving to CPU and back inadvertently causing device mismatch error #2525

ramtorch: percentage-based offload fix for text encoder moving to CPU and back inadvertently causing device mismatch error #2525

Uh oh!

Conversation

bghira commented Jan 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants