Adding a configurable scrapeTimeout for prometheus operator. #539

gmcquillan · 2017-08-03T20:48:23Z

Hi CoreOS team,

I couldn't find a comprehensive developers guide for ensuring this works. I figured I'd make a local build locally and test it out in our QA cluster. If there's something faster and easier than just running go test in the pkg/prometheus subdir, I'd love to hear about it.

Cheers!

Tests pass.

coreosbot · 2017-08-03T20:48:26Z

Can one of the admins verify this patch?

coreosbot · 2017-08-03T21:26:34Z

Can one of the admins verify this patch?

brancz · 2017-08-07T12:49:12Z

make test executes all unit tests, and you can execute the e2e tests on a local minikube by compiling the static binary (which is what is used for the container images) with make crossbuild and then build the container image with the docker host from within minikube by running eval $(minikube docker-env), then you can build the container using make container and then finally run the e2e tests using make e2e-tests.

Note that generally we will also run all of this on CI, however, we're having some trouble with our jenkins instances right now, should be working soon again though.

As a side note a section in the readme on developing might be helpful for future reference. Do you want to create a PR to add such a section 🙂 ?

gmcquillan · 2017-08-07T12:59:16Z

Happy to. I'll see if I can get my mini kube working today or tomorrow. Cheers, Gavin On Aug 7, 2017 8:49 AM, "Frederic Branczyk" <notifications@github.com> wrote: make test executes all unit tests, and you can execute the e2e tests on a local minikube by compiling the static binary (which is what is used for the container images) with make crossbuild and then build the container image with the docker host from within minikube by running eval $(minikube docker-env), then you can build the container using make container and then finally run the e2e tests using make e2e-tests. Note that generally we will also run all of this on CI, however, we're having some trouble with our jenkins instances right now, should be working soon again though. As a side note a section in the readme on developing might be helpful for future reference. Do you want to create a PR to add such a section 🙂 ? — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#539 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AATHHN3reh4BZc97yA92bT_VAWSDfJK9ks5sVwfKgaJpZM4Os-JI> .

fabxc · 2017-08-07T15:14:54Z

My opinion so far has been that the scrape timeout being as long as the scrape interval should generally be sane for all setups.
Can you elaborate on the exact use case that makes you want to set it explicitly?

gmcquillan · 2017-08-07T15:19:02Z

Yes -- in this case we're querying an appliance (vsphere) which is responsible for 10s - 100s of machines. We're querying its SOAP API and it takes about 20 seconds to pull down the complete metrics set. Also, I've found similar issues when polling SNMP traps on some embedded devices where the response simply takes 10s of seconds. A universal scrapTimeout precludes any such metrics. Given these circumstances, I think it's pretty reasonable to give the ServiceMonitor an opportunity to override the default for the managed Prometheus service.

…

On Mon, Aug 7, 2017 at 11:14 AM, Fabian Reinartz ***@***.***> wrote: My opinion so far has been that the scrape timeout being as long as the scrape interval should generally be sane for all setups. Can you elaborate on the exact use case that makes you want to set it explicitly? — You are receiving this because you modified the open/close state. Reply to this email directly, view it on GitHub <#539 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AATHHEiDDUfX0AqQ4ZROgi8DjLpL6-7Wks5sVynwgaJpZM4Os-JI> .

brancz · 2017-08-07T15:23:38Z

In that case I think the functionality from this PR #537 might already all we need. I'd personally also prefer not to bloat the Prometheus object itself.

gmcquillan · 2017-08-07T15:29:55Z

Sure. That PR didn't exist when I began work on this branch. I'd be happy with either one getting merged.

I'd argue that making the API Spec attribute name match the configuration attribute in Prometheus is a good idea, however (i.e. s/timeout/scrape_timeout/).

gmcquillan · 2017-08-07T17:48:15Z

Closing in favor of #537

carlosedp · 2020-05-26T19:43:22Z

Any chance of getting this functionality into the operator? It would help setting the global timeout for the serviceMonitors that doesn't specify it.

A use case would be the vast amounts of RaspberryPi clusters where some scraping times-out due to low performance.

I can send a new PR addressing this.

Cc. @geerlingguy @brancz

brancz · 2020-05-27T07:18:55Z

I'm not entirely opposed to it, but I do have to wonder how much that will actually help you, as ServiceMonitors/PodMonitors can specify scrape timeouts themselves which will always take precedence.

carlosedp · 2020-05-27T13:49:10Z

Many targets don't have scrape_timeout definitions like the core ones (kube-apiserver, kube-scheduler, kube-controller) so having the hability to change the default value is desired.

Opened #3250 to address this.

Adding a configurable scrapeTimeout for prometheus operator.

0eda998

Tests pass.

fixing typo

8e9a316

gmcquillan closed this Aug 3, 2017

gmcquillan reopened this Aug 3, 2017

re-add the scrape_interval

47214a5

gmcquillan closed this Aug 7, 2017

gmcquillan deleted the thread-prom-scrape-timeout-spec branch August 7, 2017 17:48

carlosedp mentioned this pull request May 26, 2020

How to change Prometheus' scrapeInterval? carlosedp/cluster-monitoring#46

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a configurable scrapeTimeout for prometheus operator. #539

Adding a configurable scrapeTimeout for prometheus operator. #539

gmcquillan commented Aug 3, 2017

coreosbot commented Aug 3, 2017

coreosbot commented Aug 3, 2017

brancz commented Aug 7, 2017

gmcquillan commented Aug 7, 2017 via email

fabxc commented Aug 7, 2017

gmcquillan commented Aug 7, 2017 via email

brancz commented Aug 7, 2017

gmcquillan commented Aug 7, 2017

gmcquillan commented Aug 7, 2017

carlosedp commented May 26, 2020 •

edited

Loading

brancz commented May 27, 2020

carlosedp commented May 27, 2020

Adding a configurable scrapeTimeout for prometheus operator. #539

Adding a configurable scrapeTimeout for prometheus operator. #539

Conversation

gmcquillan commented Aug 3, 2017

coreosbot commented Aug 3, 2017

coreosbot commented Aug 3, 2017

brancz commented Aug 7, 2017

gmcquillan commented Aug 7, 2017 via email

fabxc commented Aug 7, 2017

gmcquillan commented Aug 7, 2017 via email

brancz commented Aug 7, 2017

gmcquillan commented Aug 7, 2017

gmcquillan commented Aug 7, 2017

carlosedp commented May 26, 2020 • edited Loading

brancz commented May 27, 2020

carlosedp commented May 27, 2020

carlosedp commented May 26, 2020 •

edited

Loading