New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
All Prometheus' targets are omitted if any single ServiceMonitor is configured incorrectly #380
Comments
Yes this is something that shoulnd be part of validating manifests at creation time. Sadly this is currently not possible with third party resources, and I'm hesitant to making gracefully failing on errors in the workqueue, because that will likely cause actual errors to be swallowed and never seen. Validation at time of creation will be possible one day though using user aggredated apiservers, which is one of the next things on our list to tackle. |
This issue has been automatically marked as stale because it has not had any activity in last 60d. Thank you for your contributions. |
still an issue with |
@alice-sawatzky your case looks like something that should be addressed by team responsible for helm charts as we are not maintaining them. |
this was an internal helm chart that i maintain. I fixed the bug just fine, the issue is that an invalid |
I agree that this is something that needs to be validated much more strictly. No single PrometheusRule should be able to take out alerting, and probably shouldn't be accepted in the first place. As in it shouldn't be possible to create in the first place. |
This issue has been automatically marked as stale because it has not had any activity in last 60d. Thank you for your contributions. |
This issue has been automatically marked as stale because it has not had any activity in last 60d. Thank you for your contributions. |
This should now be fixed with the validating webhooks. Should you use the validating webhooks and this still occurs, please open a new issue. |
We had:
...however, when a new ServiceMonitor was created with a port number used (rather than the port name) all targets are removed from Prometheus.
We understand port numbers are invalid (only names are valid), but the invalid configuration should not have cleared all targets in Prometheus.
FYI the error we saw in the prometheus-operator pod's logs was:
The text was updated successfully, but these errors were encountered: