New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[operator] Use of '~' hard-coding in ray code #14155
Comments
I may be able to hack around this by setting HOME on images, will experiment. On a related note, assuming ray has been installed via conda, or needs the use of |
@DmitriGekhtman what do you think - is the use of a Use of |
OK, a workaround exists: by setting A side-note, I had to install |
Thanks for the suggestions -- agreed that Ray code should make fewer assumptions on the existence of a home directory -- .bashrc, etc cc @ijrsvt Hmm...as for uptime in particular, I think the code path that calls it could be avoided with some restructuring... |
@erikerlandson Are you using the Ray Cluster Launcher with the |
The focus here is on the K8s operator, which does this under the hood |
@ijrsvt I am using the ray operator to create clusters. To see specifically what I'm doing, the relevant demo is here: (note I'm working purely with the ray 2.0 head of dev branch, including the new client/server connection) |
Hi, I'm a bot from the Ray team :) To help human contributors to focus on more relevant issues, I will automatically add the stale label to issues that have had no activity for more than 4 months. If there is no further activity in the 14 days, the issue will be closed!
You can always ask for help on our discussion forum or Ray's public slack channel. |
bump |
Should be solved at the same time as this one #16093 |
What is the problem?
There are some places in the ray code where the use of
~
for 'home directory' is hard-coded.This is a problem for ray images where no home directory has been created. It is also a potential problem for any image running in openshift, where images run using anonymous random UID, and no passwd entry will exist.
Ray version and other system information (Python version, TensorFlow version, OS):
ray 2.0 (dev branch)
Reproduction (REQUIRED)
Please provide a short code snippet (less than 50 lines if possible) that can be copy-pasted to reproduce the issue. The snippet should have no external library dependencies (i.e., use fake or mock data / environments):
run a ray cluster using
quay.io/erikerlandson/ray-ubi
as the image. It will fail when trying to executesource ~/.bashrc
because~
maps to/
and there is no such .bashrc file. For example, any command run via_with_interactive
will fail in this way. Possibly there are other places~
is used.One possible solution is that all appearances of
~
in the ray code are located and replaced by an environment variable, perhaps calledRAY_HOME
, but the name is unimportant.I suspect the "root" problem is assumption of the use of a home directory during the installation of ray on the image, which is where the references to
~/.bashrc
and similar come from. So possibly aRAY_HOME
environment variable is less the issue than simply documenting requirements for a ray image to operate: for example "construct your image such that ray is in the path" or better yet "by the time 'ray start' is run, ray must be in the path and the ray libs be in the python environment", for example I replace invocations ofray start...
withpipenv run ray start ...
in places where that is configured on YAML.The text was updated successfully, but these errors were encountered: