Fixed
- Strong partial-offload candidates no longer get buried under weaker full-GPU models because the final sort no longer counts GPU fit twice.
- Light partial offload is penalized less aggressively, while heavy dense offload still gets a strong discount.
- MoE partial-offload scoring now gives a milder penalty when the active working set can plausibly stay on GPU.