-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Listener not aware of pending work after restart/reinstall #4027
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hey, @katarzynainit, If you schedule another workflow, you should be able to get the ones in the queue as well. It shouldn't present a problem for busy scale sets, but it is an issue on the back-end side that should be live soon. I'll keep this one open here for visibility, and I will notify you when the fix is done. |
Thank you, awaiting your update :) |
Just to add additional context here: Seeing the same on |
Hey @WyriHaximus, Can you also send the log so we can correlate what was going on with your scale set as well as with @katarzynainit. This way, we can inspect both situations and see if they have any similarities in building up the job message. And @katarzynainit, in the log you provided, you redacted the information related to the org and the scale set. If this information needs to stay private, please reach out to the support because we will need this information to see activity on your scale set and properly debug it. Thanks! |
@nikola-jokic Will do when I get home. But until then, I can give you the queue graph I've been using (it does the following query And this is from private repo Jobs have been queue for 12 hours at this point. |
@nikola-jokic I have provided full logs via our support channel. Adding additional finding to the ticket:
|
Here are the logs @nikola-jokic chaos.log. Looks like it restarted two and a half hours ago but didn't process anything. |
Checks
Controller Version
0.9.3
Deployment Method
Helm
Checks
To Reproduce
Describe the bug
The listener is not aware of the work and runners needed, if the work was scheduled when listener was offline.
After cancel/rerun on the job all works fine.
Describe the expected behavior
The work should be picked up by the listener once it is online, no matter if it was scheduled when listener was up or not.
Additional Context
I use fork, no changes in the listener code.
Newest runner image in use.
Controller Logs
Runner Pod Logs
The text was updated successfully, but these errors were encountered: