-
Notifications
You must be signed in to change notification settings - Fork 110
A Few Issues found on Kubernetes #27
Comments
Hi @ericxu10101 , let me reply those questions inline:
That's true. Currently k8s jobs only work in 'default' namespace. It's a limitation of the current version. I'm planning to add that in the next version. Or if you have time, you can submit a PR for that.
The pods failed most likely due to the daemon thread inside each worker pod which terminates the worker when it lose connection to the master. It' doesn't mean the whole job failed, it only means the worker exited in a special way. And it shouldn't affect the final result (If you notice something otherwise, please create a new issue for it). I'm planning to address this issue in the next version to make sure the exit status of worker pods can be correctly set.
Currently 'name' is required for Fiber on K8s to work. Do you have a use case for 'generateName'? If it's general enough, it can be supported later. |
@ericxu10101 issues 1 and 2 have been fixed in #29 and #28. Feel free to install the newest version from the current master branch and test it out. Regarding issue 3, do you have a further explanation of the details of the issue? |
@calio thanks a lot. That's awesome ! 3 is not actually an issue, just wanted to confirm the behavior. So seems like giving an explicit name instead of using 'generateName' is the current limitation. I think supporting 'generateName' probably can be a future feature. It would be handy when scheduling the Job. Feel free to close this issue. And let me know where and how to track feature requests. Thanks. |
Hi @ericxu10101 , currently all features are tracked with Github issues. You can create a new issue with the tag "enhancement" and can track progress over there. |
I'm getting urllib3 unable to establish connection in process.py with pi-estimation example. I am unable to resolve it. Tried giving all the permission but nothing seems to work. And also I'm getting this via running fiber cli.
Anyone have idea what is happening here? Is it a bug? |
I am following k8s part on https://uber.github.io/fiber/getting-started/ and realized the following issues:
seems like k8s 'Job' only works in 'default' namespace ?
when I try on different namespace, the master pod keep failing and recreate.
poolwork pods terminate with 'Failed' status, while master pod returns 'Success'. Any way to address that ?
It looks like the k8s 'Job' must have explicit 'name' instead of 'generateName', otherwise master pod throws 'Pod not found' error. Is it known issue ?
Thanks
The text was updated successfully, but these errors were encountered: