Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Swarm.Tracker - fix start_pid_remotely retrying rapidly #129

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

darrenclark
Copy link

Noticed a lot of this message in our logs:

remote tracker on #{remote_node} went down during registration, retrying operation..

It seems to happen randomly, but I think this will fix it

@pirvudoru
Copy link

pirvudoru commented May 19, 2024

I've seen this also happen to us just today in our production cluster. There haven't been any deploys in like 3 weeks, and everything ran smooth until this happened.

When this happened we got ~11M logs in 2 hours of this retrying and not being able to fix itself. Restarted the pods and then everything got back to normal

We are running 2 pods on a k8s cluster. Lib is a dependency of https://github.com/commanded/commanded-swarm-registry

We are using swarm lib 3.4.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants