Open
Description
The LoRA configuration for the CPU-based deployment and GPU-based deployment differ. Update the CPU-based deployment to match that of the GPU deployment, e.g., one LoRA adapter named food-review-1
and "--max-loras"
"2"
.
@nirrozenbaum WDYT about removing --max-cpu-loras
in the GPU deployment?