Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

QuartzHealthCheck never recovers after detecting failure #1496

Closed
bencyoung-Fignum opened this issue Jan 20, 2022 · 6 comments · Fixed by #1497
Closed

QuartzHealthCheck never recovers after detecting failure #1496

bencyoung-Fignum opened this issue Jan 20, 2022 · 6 comments · Fixed by #1497
Milestone

Comments

@bencyoung-Fignum
Copy link
Contributor

Describe the bug

The error count can only be incremented but is never cleared if things start working.

Version used

3.3.3

To Reproduce

Start a quartz app with the DB down and then start the DB. The health check always reports an error.

Expected behavior

I'd expect the health check to start reporting success when the DB comes up

@bencyoung-Fignum
Copy link
Contributor Author

I may be able to raise a PR if that's an acceptable route?

@lahma
Copy link
Member

lahma commented Jan 20, 2022

Sure, that would be great!

@bencyoung-Fignum
Copy link
Contributor Author

At the moment it seems to report unhealthy if there's any job errors whereas I'd expect it to be more of an infrastructural test? I.e. it's ok if jobs are failing but not if there are DB errors? Is there a way to tell the difference? The other option is to just check if the job store is healthy rather than listen for general errors?

@lahma
Copy link
Member

lahma commented Jan 20, 2022

I think it would be sufficient to check that scheduler is running and job store operates. A single small query should suffice, even trying to get non-existing item as long as error is what is expected.

@bencyoung-Fignum
Copy link
Contributor Author

I think this is done.

@bencyoung-Fignum
Copy link
Contributor Author

I've simplified and tested this now

@lahma lahma changed the title QuartzHealthCheck never recovers QuartzHealthCheck never recovers after detecting failure Mar 20, 2022
@lahma lahma modified the milestones: 4.0, vNext Mar 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants