-
Notifications
You must be signed in to change notification settings - Fork 553
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
--exit-when-healthy
does not wait for the node to join the cluster first
#15818
Comments
Thanks, will give it a try with --watch in the meantime :-) |
As per redpanda-data/redpanda#15818 it also requires --watch to function properly.
Thanks @r-vasquez. @layus let us know if you hit any other issues or --watch doesn't work. |
@dotnwat As you can see in fornybar/redpanda.nix#32, adding |
@r-vasquez friendly ping regarding above message ^ |
Hello @layus, is it possible to get the rpk logs as well? I was looking into the logs but the failure doesn't seem to be the typical rpk error
Which may indicate that rpk is indeed waiting. But I'm not certain what the specific test failure that I should look into is. Maybe is:
I also tried reproducing the rpk wait time with a simple script using
But it works, it waits for the cluster to be healthy before finishing:
Let me know if I'm missing something from the logs or if I am looking at the wrong error here 😅 |
@r-vasquez Thank you for your detailed review. The error is related to a VM crashing, probably due to lack of ram. Redpanda seems okay now with the --watch flag 😀 . |
Version & Environment
rpk version
): 23.2.17/etc/os-release
): NixOSWhat went wrong?
We had to insert a sleep instruction after restarting redpanda to let it join (or start joining at least) its cluster. Otherwise it would happily return immediately from
--exit-when-healthy
and applymaintenance disable
with no effect.What should have happened instead?
I would have expected
rpk cluster health --exit-when-healthy
to wait for the node to join the cluster before reporting testing if the cluster is healthy.How to reproduce the issue?
See above snippet, without
sleep
.Additional information
https://github.com/fornybar/redpanda.nix and #nix channel on slack
The text was updated successfully, but these errors were encountered: