Skip to content

[Serve.llm][P/D] Support separate deployment config for PDProxy in Prefill disagg #53821

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Jun 16, 2025

Conversation

kouroshHakha
Copy link
Contributor

This PR is simple. It adds a separate configuration for controlling deployment config of PDProxy

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
@kouroshHakha kouroshHakha added the go add ONLY when ready to merge, run all tests label Jun 14, 2025
@kouroshHakha kouroshHakha marked this pull request as ready for review June 14, 2025 00:42
@Copilot Copilot AI review requested due to automatic review settings June 14, 2025 00:42
@kouroshHakha kouroshHakha requested a review from a team as a code owner June 14, 2025 00:42
Copilot

This comment was marked as outdated.

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
@kouroshHakha kouroshHakha requested a review from Copilot June 14, 2025 00:46
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces a dedicated deployment configuration for the PDProxy component in the prefill-decode disaggregated flow, allowing users to pass custom Ray Serve options.

  • Added a new proxy_deployment_config field to PDServingArgs for arbitrary Ray Serve deployment parameters.
  • Forwarded proxy_deployment_config through parse_args.
  • Applied the custom options in build_app via .options(**pd_config.proxy_deployment_config) on the PDProxyServer deployment.
Comments suppressed due to low confidence (2)

python/ray/llm/_internal/serve/deployments/prefill_decode_disagg/prefill_decode_disagg.py:42

  • [nitpick] Consider expanding this description to list common keys (e.g., num_replicas, ray_actor_options) or link directly to the Ray Serve deployment options docs for easier discoverability.
The Ray @server.deployment options for the proxy server.

python/ray/llm/_internal/serve/deployments/prefill_decode_disagg/prefill_decode_disagg.py:205

  • Add unit or integration tests to verify that various proxy_deployment_config parameters (e.g., replicas, resource tags) are correctly applied and take effect on the PDProxyServer deployment.
.options(**pd_config.proxy_deployment_config)

@kouroshHakha kouroshHakha enabled auto-merge (squash) June 16, 2025 22:07
@kouroshHakha kouroshHakha merged commit 9fb4510 into ray-project:master Jun 16, 2025
6 checks passed
elliot-barn pushed a commit that referenced this pull request Jun 18, 2025
…efill disagg (#53821)

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
rebel-scottlee pushed a commit to rebellions-sw/ray that referenced this pull request Jun 21, 2025
…efill disagg (ray-project#53821)

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
Signed-off-by: Scott Lee <scott.lee@rebellions.ai>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
go add ONLY when ready to merge, run all tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants