Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitoring stack doesn't respect manager_agent.prometheus config parameter #1992

Closed
karol-kokoszka opened this issue May 26, 2023 · 0 comments · Fixed by #2000
Closed

Monitoring stack doesn't respect manager_agent.prometheus config parameter #1992

karol-kokoszka opened this issue May 26, 2023 · 0 comments · Fixed by #2000
Labels

Comments

@karol-kokoszka
Copy link

This issue is connected to scylla-cloud deployments.

Scylla manager agent allows end-users to change the default port where agent exposes metrics for prometheus scrapping.
https://github.com/scylladb/scylla-manager/blob/e25e51487cae81fc04eb67fa5b957249a4ac6801/dist/etc/scylla-manager-agent.yaml#L31-L35

Scylla cloud uses this config to override the default port = 5090 to custom 56090.

Targets for prometheus server to scrap metrics from agents are created by monitoring stack and it looks that it always creates scrapeUrl with the port 5090.
See:

 {
        "discoveredLabels": {
          "__address__": "*****",
          "__meta_filepath": "/etc/scylla.d/prometheus/scylla_servers.yml",
          "__metrics_path__": "/metrics",
          "__scheme__": "http",
          "__scrape_interval__": "20s",
          "__scrape_timeout__": "15s",
          "cluster": "*****",
          "dc": "*****",
          "instance": "*****",
          "job": "manager_agent",
          "publicIp": "*****",
          "serverExternalId": "*****"
        },
        "labels": {
          "cluster": "*****",
          "dc": "*****",
          "instance": "*****",
          "job": "manager_agent",
          "publicIp": "*****",
          "serverExternalId": "*****",
          "serverId": "*****"
        },
        "scrapePool": "manager_agent",
        "scrapeUrl": "http://*****:5090/metrics",
        "globalUrl": "http://*****:5090/metrics",
        "lastError": "Get \"http://*****:5090/metrics\": context deadline exceeded",
        "lastScrape": "2023-05-25T13:22:41.865990115Z",
        "lastScrapeDuration": 15.000340933,
        "health": "down",
        "scrapeInterval": "20s",
        "scrapeTimeout": "15s"
      },

So finally, metrics exposed by the manager agent are not collected.
We can see it on the manager dashboard as well:

image

Please use scylla-manager-agent.yml file to check on what port manager-agent is exposing its metrics.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants