New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Upgrading instance through webapp fails for the first time. #8502
Comments
Does this mean second attempt works or is it just broken? |
Second attempt just works fine without error. |
It's weird that second attempt works but not the first one. I will have a look at this. |
cc @nydr and @garethbowen - I'm seeing a lot of HTML being returned when JSON is expected in the video above. Suspect that #8179 will help a lot here, but not sure if addresses the root cause. |
There's also a curious http2 error which doesn't make sense to me. The http2 change doesn't exist in the tag so I'm assuming it's something spurious. We need to dig deeper into the actual logs to see what happened. |
Hah, I think there's a bit of a race condition here where, for a brief moment, some of the old containers are still running while others are down and unexpected errors happen. I think that the incoming change that updates how haproxy and nginx respond when services are down will maybe fix this. |
Hi @dianabarsan , is there a PR or issue for this? Can you link that here ? |
@yrimal |
Confirming that this still happens with 4.4.0 -> 4.4.1 so it's not fixed by the issue Diana cited, though if I'm patient it does just work eventually. I think the yellow warning is shown unnecessarily and the upgrade trigger happens correctly. |
The warning is displayed "optimistically" after 1 minute - because we believed that 1 minute is enough time for containers to stop and restart. This is clearly not true. |
A nice solution would be to check the health of the containers somehow, and validate we are indeed on the right versions, but right now I don't have a solution for this, especially since deployments are either in docker or k8s, we'd need to add an endpoint to some external service that API can reach. |
@dianabarsan I see the warning immediately, not after 1 minute, just like in @1yuv 's video. It's displayed behind the dialog but if you close the dialog it's there. |
For docker compose deployment, we do have access to the docker Unix socket, so we could tell container state as well as new version image download progress. But yeah, as you already said, - it'd only be for docker compose and not k*s. Since we're trying to migrate away from docker compose hosting - likely not worth pursuing. |
Yea, after further testing I think I know what is happening:
The warning is displayed because previous version of API goes down and up because another container (likely haproxy or healthcheck) is updated. |
- increases interval before warning the user that the upgrade might have issues - handles case where nginx sends a 502 - prevents upgrade interruption when state is completing #8502
- increases interval before warning the user that the upgrade might have issues - handles case where nginx sends a 502 - prevents upgrade interruption when state is completing #8502
Describe the bug
When upgrade is staged and installed from webapp interface, first installation fails with following message:
To Reproduce
Steps to reproduce the behavior:
Install
button.Expected behavior
Installation should be successful in first attempt.
Screen recording
Screen.Recording.2023-08-30.at.10.02.14.PM.mov
The text was updated successfully, but these errors were encountered: