Skip to content

Commit

Permalink
chore(helm): add retry limit for ray worker process (#1031)
Browse files Browse the repository at this point in the history
Because

- Current ray worker will respawn indefinitely after non zero exit code

This commit

- add retry limit for ray worker process
  • Loading branch information
heiruwu committed Jul 2, 2024
1 parent 04b8c86 commit d511c0b
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions charts/core/templates/ray-service/ray-service.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -74,6 +74,8 @@ spec:
value: http://core-grafana:80
- name: RAY_PROMETHEUS_HOST
value: http://core-prometheus:9090
- name: RAY_TASK_MAX_RETRIES
value: "2"
- name: RAY_worker_register_timeout_seconds
value: "3600"
# - name: RAY_REDIS_ADDRESS
Expand Down Expand Up @@ -155,6 +157,8 @@ spec:
privileged: true
imagePullPolicy: {{ $.Values.rayService.image.pullPolicy }}
env:
- name: RAY_TASK_MAX_RETRIES
value: "2"
- name: RAY_worker_register_timeout_seconds
value: "3600"
lifecycle:
Expand Down

0 comments on commit d511c0b

Please sign in to comment.