Skip to content

Reduce kv_cache_free_gpu_mem_fraction for deepseek_r1_distill_qwen_32…

b188273
Select commit
Loading
Failed to load commit list.
Merged

[https://nvbugs/6044213][chore] unwaive and reduce free mem ratio in AutoDeploy's perf test: deepseek_r1_distill_qwen_32b #12965

Reduce kv_cache_free_gpu_mem_fraction for deepseek_r1_distill_qwen_32…
b188273
Select commit
Loading
Failed to load commit list.