Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DO / helm: Can't Upgrade #6025

Closed
1 of 3 tasks
boldtrn opened this issue Sep 20, 2021 · 4 comments
Closed
1 of 3 tasks

DO / helm: Can't Upgrade #6025

boldtrn opened this issue Sep 20, 2021 · 4 comments
Assignees
Labels
bug Something isn't working right

Comments

@boldtrn
Copy link

boldtrn commented Sep 20, 2021

Bug description

I followed the upgrading instructions for DO/helm

helm upgrade -f values.yaml --timeout 20m --namespace posthog posthog posthog/posthog --atomic

Error: UPGRADE FAILED: release posthog failed, and has been rolled back due to atomic being set: post-upgrade hooks failed: job failed: BackoffLimitExceeded

When I tried to rerun the upgrade with a higher timeout value, as I thought that might have been the issue

helm upgrade -f values.yaml --timeout 120m --namespace posthog posthog posthog/posthog --atomic

Error: UPGRADE FAILED: release posthog failed, and has been rolled back due to atomic being set: post-upgrade hooks failed: warning: Hook post-upgrade posthog/templates/migrate.job.yaml failed: jobs.batch "posthog-migrate" already exists

Expected behavior

Run the latest version.

Environment

  • PostHog Cloud
  • self-hosted PostHog (ClickHouse-based), version/commit: please provide
  • self-hosted PostHog (Postgres-based, legacy), version/commit: please provide
@boldtrn boldtrn added the bug Something isn't working right label Sep 20, 2021
@jamesefhawkins
Copy link
Collaborator

Hey @tiina303 I thought maybe you could help here?

@tiina303
Copy link
Contributor

Hi @boldtrn

The second error about the migrate job happens because it was left around from the previous upgrade attempt, we just need to kill that job before running the upgrade again kubectl delete job posthog-migrate -n posthog.

Regarding to why the initial upgrade failed I'm would need a bit more information:

  1. What chart version are you currently on helm get manifest posthog | grep "HELM_INSTALL_INFO" -A1 (maybe it's a major upgrade that needs some manual steps to be performed too & that's why the upgrade failed)?
  2. While you were running the upgrade did any of the pods fail/restart loop, what were the logs in those pods, were any pods unschedulable what was the kubectl get pod <pod> -o yaml output for those?

@boldtrn
Copy link
Author

boldtrn commented Sep 22, 2021

@tiina303 thanks a lot for the additional help.

Running kubectl delete job posthog-migrate -n posthog and then re-running the upgrade again fixed the issue.

Running helm get manifest posthog returns Error: release: not found. Not sure if this indicates a potential issue?

I will close this issue, as the actual problem is resolved, but I would be happy to provide some more information to find out the underlying issue.

@boldtrn boldtrn closed this as completed Sep 22, 2021
@tiina303
Copy link
Contributor

tiina303 commented Sep 22, 2021

Running helm get manifest posthog returns Error: release: not found. Not sure if this indicates a potential issue?

Sorry I forgot the -n posthog flag, there release is typically installed in the posthog namespace, but if the issue is fixed that's not needed, I was trying to be proactive about further investigation in case it wasn't.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working right
Projects
None yet
Development

No branches or pull requests

3 participants