-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[serve] Add default configuration for autoscaling that works out of the box #42613
Comments
Specifically, this means that setting: |
One issue here is the default for As a temporary fix, when Open question: what should the defaults be? To start, let's go with |
Plan is:
|
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes #42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com>
Add default configuration for autoscaling that works out of the box. This can be used by setting `num_replicas="auto"`. Relationship between `num_replicas` and `autoscaling_config`: - If `num_replicas="auto"` is set without setting `autoscaling_config`, a default autoscaling configuration that works out the box will be used. - If `num_replicas="auto"` and `autoscaling_config` are both set, then the fields in `autoscaling_config` will override that of the default autoscaling configuration used by `num_replicas="auto"`. - If `num_replicas` is not set and `autoscaling_config` is set, the behavior doesn't change. Behavior between `num_replicas` and `max_concurrent_queries`: - If `num_replicas="auto"` and `max_concurrent_queries` is unset, the max concurrent queries will be overrided to a new default (5). - If `num_replicas="auto"` and `max_concurrent_queries` is manually configured, nothing is modified. Since `num_replicas="auto"` is a new API, there is no migration plan. Closes ray-project#42613 Signed-off-by: Cindy Zhang <cindyzyx9@gmail.com> Signed-off-by: tterrysun <terry@anyscale.com>
Provide a default configuration for autoscaling that works out of the box.
The text was updated successfully, but these errors were encountered: