-
Notifications
You must be signed in to change notification settings - Fork 2.2k
No metrics displayed with Thanos engine ranged query (go_routines[1d]) in query distributes installation (Prometheus engine returns correct output) #8078
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hey there, @s0rl0v! I'm here to assist you with any bugs, questions, or contributions you might have. Let's tackle this together! The issue you're experiencing with Thanos not displaying metrics for range queries in distributed mode could be related to several factors specific to the Thanos engine's handling of distributed queries. Here are some potential areas to investigate:
By reviewing these configurations and settings, you might be able to identify the root cause of the issue with the Thanos engine in distributed mode. To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other |
The flag set only for global query, local ones are without it.
The gateways are present in Stores menu, there are no errors in query logs assiciated with store unavailability.
replica label is set to "site", Azure metrics have value set to "azure", huawei ones are set to "huawei"
enabled
I'll check this out.
There are no known issues, I've used search before creating this issue. |
Could you click on the Endpoints tab and share a screenshot? |
@fpetkovski Sure! |
Hi there. Is there a chance this is related to this issue? |
I have this same exact issue.
I was using thanos-0.34.1, then upgraded to thanos-0.37.2 but still facing the same issue. |
Are you using storage GW? If so what's your retention? |
Can you try this flag please: queryDistributedWithOverlappingInterval := cmd.Flag("query.distributed-with-overlapping-interval", "Allow for distributed queries using an engines lowest MinT.").Hidden().Default("false").Bool() |
Have pretty much the same problem and same version of almost all components (Thanos and Prometheus identical). |
My quick and dirty fix was to add 1s in time partitioning (I have 7 days retention in Receive):
Still, some permanent solution would be better |
Flag might be on main and soon 0.38 can you try with 0.38rc and the flag? |
@MichaHoffmann I can confirm the |
@MichaHoffmann made quick and dirty installation in our development cluster and looks like it indeed fix this problem and make my workaround redundant 👍 |
It's an interesting issue that I think we understand but that's tricky to fix automatically. I hope one day we won't need the flag anymor! |
@MichaHoffmann It can be documented for anyone that does not intent to use this flag. Since even |
Thanos, Prometheus and Golang version used:
Thanos - v0.37.2
Prometheus - v2.55.1
Golang - v1.23.4
Object Storage Provider:
Azure
Huawei OBS
What happened:
I've set up Thanos Query in distributed mode with the following stores (aka local queries) in config:
extraArgs:
The local queries have Thanos Gateways hostnames in their corresponding configurations (Prometheus installations with Thanos sidecars).
When I query distributed query with range query (e.g. go_threads[1d]) via Thanos engine it returns nothing

, Prometheus engine - returns correct output.

Querying local query directly also yields correct results with either Thanos and Promethus engines.

What you expected to happen:
Distributed query returns the same output with Promethues and Thanos engines.
How to reproduce it (as minimally and precisely as possible):
Full logs to relevant components:
no errors whatsoever
Anything else we need to know:
The text was updated successfully, but these errors were encountered: