Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

strays/missing on sr3 status during start or stop #1069

Open
petersilva opened this issue May 27, 2024 · 1 comment
Open

strays/missing on sr3 status during start or stop #1069

petersilva opened this issue May 27, 2024 · 1 comment
Labels
bug Something isn't working enhancement New feature or request ReliabilityRecovery improve behaviour in failure situations.

Comments

@petersilva
Copy link
Contributor

there is a window of time while "stop" and "start" operations are in progress, where the status will be marked as missing because either instance processes have been killed but the pid files not removed (stopping) or the instance processes have been launched, but not yet written their pid files so they can be claimed.

so sr3 sanity if runing while a flow is being stopped, will restart it, rather than allowing the stop to complete. When in start up, sr3 sanity can kill some instance processes that are marked stray because they have not fully initialized yet, and then restart those instances.

sub-optimal.

@petersilva petersilva added bug Something isn't working enhancement New feature or request ReliabilityRecovery improve behaviour in failure situations. labels May 27, 2024
@petersilva
Copy link
Contributor Author

left-over from #1067

@petersilva petersilva changed the title starting and stopping states strays/missing on sr3 status when starting and stopping May 31, 2024
@petersilva petersilva changed the title strays/missing on sr3 status when starting and stopping strays/missing on sr3 status while start or stop in progress May 31, 2024
@petersilva petersilva changed the title strays/missing on sr3 status while start or stop in progress strays/missing on sr3 status during start or stop May 31, 2024
@petersilva petersilva reopened this Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working enhancement New feature or request ReliabilityRecovery improve behaviour in failure situations.
Projects
None yet
Development

No branches or pull requests

1 participant