-
Notifications
You must be signed in to change notification settings - Fork 14.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scheduler frequently stops scheduling tasks/dags #698
Comments
What do the scheduler logs look like? The scheduler logs are very verbose, and you should clearly see where it's stuck. We restart it every 5 runs using |
The logs aren't really giving me the detail I need. I'll lower my num_runs from 6000 to 10 and see how that does. Also, I've a branch with more debug logging in the works. I need to test it before I open the PR. |
@mtustin-handy does this issue still occur? Please create Jira if so. |
I'm using one of the versions after 1.6 (specifically commit e1f9a28), and this problem is happening more than once a day. It has happened in all versions I have used. Right now I have
--num_runs=600
set on the scheduler, but that hasn't helped; the problem appears to occur before the scheduler can hit that count.I also use the CeleryExecutor, backed by redis for celery message passing and results, and mysql as the database.
For some reason the scheduler frequently gets into a state where it doesn't schedule new dagruns or tasks. I haven't yet investigated whether it's failing to schedule tasks for existing dag runs also (I suspect that it is).
The text was updated successfully, but these errors were encountered: