New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High Concurrency Lock / Not In Active State Errors #1028
Comments
Have you checked if you have many stalled jobs? Maybe when you have so much concurrency the CPU is being saturated and is not able to keep up with maintaining the locks, then the jobs will stall and move back to wait, if the original job that was stalled continues to work then it will eventually fail with that missing lock error since it does not own the lock anymore. The jobs may have completed already by another worker anyway. |
So basically the recommendation here would be to reduce the concurrency per worker and instead add more physical workers. |
Yah we’re going to take a closer look glad to see we were in the right track |
could this have been related? We still get these even only very low CPU (on my local machine with 5 concurrency and ~50 jobs) |
@lukepolo Are you using flows or just standard jobs? any chance to produce a simple test case that reproduces the issue so that we can take a deeper look at it? |
Yup I’ll make a repo to recreate it this week |
We are getting a ton of errors when we have a high amount of jobs and a high number of concurrency : (600+)
Do we know why these can happen, is there anything I can do to prevent this from happening?
The text was updated successfully, but these errors were encountered: