Skip to content

fix(vllm-router): allow using prefill-decode for subset of models (by…

4d1dca0
Select commit
Loading
Failed to load commit list.
Merged

fix(vllm-router): allow using prefill-decode for subset of models (by checking labels) and add a fallback routing strategy #3

fix(vllm-router): allow using prefill-decode for subset of models (by…
4d1dca0
Select commit
Loading
Failed to load commit list.