Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FTDCS-33 added logs to failing graphiteSpike checks #170

Merged
merged 1 commit into from
Dec 15, 2021

Conversation

AniaMakes
Copy link
Contributor

Next-health has this health check: https://github.com/Financial-Times/next-health/blob/main/server/config/health-checks/platform.js. It is very flappy. The flaps are ephemeral - such at that if you click on "healthcheck source" in Heimdall immediately after it goes red, you'll get a "all healthy" source logs. As we can't see what numbers are triggering the flaps, it is really difficult - bordering on impossible - to fix the source of the problem. We are hoping that this log will help us see what is wrong when the flaps happen and act accordingly.

Q: Why just graphiteSpike, and not all of the checks?
A: This has a potential to be noisy and expensive. I would love to add this to all of the checks, but I want to use this one as a trial to see if it actually does give the information we want, and if that information is as useful as we hoped.

@AniaMakes AniaMakes requested a review from a team as a code owner December 15, 2021 12:11
Copy link
Member

@apaleslimghost apaleslimghost left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice, really good idea!

@AniaMakes AniaMakes merged commit e95ffbb into main Dec 15, 2021
@AniaMakes AniaMakes deleted the logs-on-graphite-spike-fail branch December 15, 2021 14:34
JSRedondo pushed a commit that referenced this pull request Jun 20, 2022
FTDCS-33 added logs to failing graphiteSpike checks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants