[serve] Add strict enforcement of max_concurrent_queries
#42947
Labels
enhancement
Request for new feature and/or capability
P1
Issue that should be fixed within a few weeks
ray 2.10
ray-team-created
Ray Team created
serve
Ray Serve Related Issue
Currently this is "best effort" and there are race conditions that allow replicas to exceed the limit.
The text was updated successfully, but these errors were encountered: