Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Misuse of Docker API and misunderstanding of Ray HA cause test_ray_serve flaky #621

Closed
2 tasks done
kevin85421 opened this issue Oct 8, 2022 · 2 comments · Fixed by #650
Closed
2 tasks done
Assignees
Labels
bug Something isn't working

Comments

@kevin85421
Copy link
Member

Search before asking

  • I searched the issues and found no similar issues.

KubeRay Component

ci

What happened + What you expected to happen

See #620.

Reproduction script

See #620.

Anything else

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!
@kevin85421 kevin85421 added the bug Something isn't working label Oct 8, 2022
@kevin85421 kevin85421 self-assigned this Oct 8, 2022
@kevin85421
Copy link
Member Author

kevin85421 commented Oct 17, 2022

cc @jasoonn

The following links may be useful.

How does the docker container communicate with the k8s cluster?

  • NodePort Service: head pod <--> local
  • Docker (network_mode: host): container <--> local

You can try curl 127.0.0.1:10001 (valid port) in both your local laptop and the container. You will get curl: (56) Recv failure: Connection reset by peer. You can also run curl 127.0.0.1:10002 (invalid port), and you will get curl: (7) Failed to connect to 127.0.0.1 port 10002: Connection refused.

The following command enables you to run a single test in the test suite. In addition, both STDOUT (1) and STDERR (2) will be redirected to file log.

RAY_IMAGE=rayproject/ray:2.0.0 python3 tests/compatibility-test.py RayFTTestCase.test_ray_serve 2>&1 | tee log

@kevin85421 kevin85421 removed their assignment Oct 17, 2022
@jasoonn
Copy link
Contributor

jasoonn commented Oct 17, 2022

@kevin85421
Thanks a lot for the information. I will work on it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants