Open
Description
https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/config/manifests/vllm/cpu-deployment.yaml has a couple instances where it references llama
but we use Qwen for this deployment. We should update these names