Skip to content

feat: add multi-deployment routing and num_replicas support#13

Merged
alez007 merged 1 commit intomainfrom
feat/multi-deployment-routing
Apr 11, 2026
Merged

feat: add multi-deployment routing and num_replicas support#13
alez007 merged 1 commit intomainfrom
feat/multi-deployment-routing

Conversation

@alez007
Copy link
Copy Markdown
Owner

@alez007 alez007 commented Apr 11, 2026

Allow the same model name to appear multiple times in config with different settings (e.g. GPU + CPU). The gateway round-robins requests across all deployments sharing a name. Each deployment also supports num_replicas for scaling identical copies via Ray Serve.

Allow the same model name to appear multiple times in config with
different settings (e.g. GPU + CPU). The gateway round-robins requests
across all deployments sharing a name. Each deployment also supports
num_replicas for scaling identical copies via Ray Serve.
@alez007 alez007 merged commit 13e0813 into main Apr 11, 2026
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant