-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: After taking into account object store and redis memory usage, the amount of memory on this node available for tasks and actors (-0.01 GB) is less than 0% of total. You can adjust these settings with ray.init(memory=<bytes>, object store memory=<bytes> #11966
Comments
This is a great question! Usually these memory should be set automatically by ray. Did you set artificial values when you run ray.init? |
Also cc @richardliaw |
i set num_cpus=16 only |
@ericl Do you know what's the recommended setup? Also, what's your total memory @LianShuaiLong? |
I run it in Machine Learning Platform, I set 16 CPUS , 8GPUS, 48GB memory for all my training trials |
i find that ray can start successfully when i set no params for ray.init(),why? |
Hmm, so are you saying ray.init() # works
ray.init(num_cpus) # doesn't work ? |
yeap,it failed when i set num_cpus/_temp_dir/_memory/ |
since i get this error again, i reopen this issue |
Is there a way to get out of this error? I have upgraded the ray to 1.8 as well.. but still shows up with this issue. On trying for both ray.init() and ray.init(num_cpus) the error remains. |
What about if you do num_cpus=4 or sth? (Provide a kwarg instead of arg) |
It's still the same. Initially had the same setup. On debugging, it showed with the mentioned error. did an initial checkup with the node and resources. They are as follows:
|
Hmm can you also tell me the memory size of the machine/container you runs your jupyter on? |
Hi @rkooo567, |
Any documentation about correctly initializing Ray inside a docker container? Hitting same issue here. |
cc: @DmitriGekhtman |
After upgrading to Ray 2.0, issue is gone on my side. |
Same issue for me with ray 2.0.0. When calling |
When you don't specify the object store memory, it uses 20% of available memory. I think your machine doesn't have enough available memory (20% of available memory is even less than 80MB). You can manually specify object_store_memory to avoid this.
the minimal you should specify is Better solution is to use an instance that has more available memory. |
What is the problem?
when i run ray in ML platform,
occurs
can you tell me the approximate value of memory size /object store memory i should set ?
thanks
Ray version and other system information (Python version, TensorFlow version, OS):
Reproduction (REQUIRED)
Please provide a script that can be run to reproduce the issue. The script should have no external library dependencies (i.e., use fake or mock data / environments):
If we cannot run your script, we cannot fix your issue.
The text was updated successfully, but these errors were encountered: