-
Notifications
You must be signed in to change notification settings - Fork 363
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
java.lang.RuntimeException: Cloud size 1 under 2 #1739
Comments
Hi @BhushG, |
@mn-mikke Thanks for the quick reply. We already tried that. We have taken Iris dataset for testing which is just of a few KBs and allocated 5GB to executors as well as the driver but still, it did not work. |
What version of Sparkling Water do you use? Could you share a code snippet that you tried to run? |
@mn-mikke We have tried these versions of Sparkling water: 3.28.0.1-1-2.4 and 3.26.11-2.4 on spark 2.4. scala version: 2.11.8 Here is the code snippet: def main(args: Array[String]): Unit =
} |
@mn-mikke we are using internal cluster mode and I've in fact set spark.dynamicAllocation.enabled to false |
@jakubhava Hi.. Is there any solution to this exception? Sometimes the model gets trained on cluster but when I deploy same model for same dataset on cluster, it fails with cloud size 0 under 2. I appreciate your help. |
@jakubhava @mn-mikke Is there any solution to this? or shall I use External backend? |
I'm also not able to start the External backend. Created new issue: #1759 |
Can you share the full YARN logs ( executors, driver)? We have fixed various clouding issues in the upcoming release 3.28.0.3 and I would like to verify if this issue is one of them. |
I am getting the same issue and have sent the full logs on the gitter channel. Thank you. |
Yes, this issue will be fixed in the upcoming 3.28.0.3 release |
Sparkling Water 3.28.0.3 is released which fixes the clouding issues mentioned above:
If you bump into any new issues, please create new or feel free to reopen this issue. |
Hi, I'm getting this exception when I'm executing the job on the YARN cluster. There is no problem executing same job on a local machine.
I've tried all of these settings: http://docs.h2o.ai/sparkling-water/2.1/latest-stable/doc/configuration/internal_backend_tuning.html , but still couldn't resolve this exception.
Here are the logs:
The text was updated successfully, but these errors were encountered: