search: trace and observe each zoekt host #12516

keegancsmith · 2020-07-28T13:08:56Z

This commit moves tracing and observability from being around the
aggregated zoekt client to being both the aggregated client and a trace
per zoekt replica.

To do this we adjust the prometheus metric to have a hostname label. We
also change the category / family name to indicate if its a search
against a specific zoekt replica or its the aggregation.

This commit moves tracing and observability from being around the aggregated zoekt client to being both the aggregated client and a trace per zoekt replica. To do this we adjust the prometheus metric to have a hostname label. We also change the category / family name to indicate if its a search against a specific zoekt replica or its the aggregation.

pecigonzalo · 2020-07-28T13:57:12Z

In theory, each service should already have an identifier as prometheus adds it to it from discovery. Ill verify.

Yeah, they have instance already. I think Im getting this the other way around and this is the hostname of the target, is that the case?

uwedeportivo · 2020-07-28T15:29:01Z

@pecigonzalo yes, correct

pecigonzalo · 2020-07-28T16:04:11Z

I would be careful with sort of metrics as they can create a cardinality explosion for the metrics DB. In most cases, I believe the downstream service should actually provide the metrics if possible, and this service only provide his health (general latency to the upstream, etc)

uwedeportivo · 2020-07-28T16:20:10Z

@pecigonzalo yes, you are right. you would be surprised of our labelling in general :-). this one is "only" a multiplier of 15 at the bigdata cluster. we have far worse offenders in our code base. we need to clean and reduce sometime. this one is super useful for our current debug situation with our bigdata customer.

keegancsmith requested review from uwedeportivo and a team July 28, 2020 13:08

search: test NewMeteredSearcher

6d1f671

uwedeportivo approved these changes Jul 28, 2020

View reviewed changes

uwedeportivo merged commit 5d25b0d into master Jul 28, 2020

uwedeportivo deleted the zoekt-tracing branch July 28, 2020 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

search: trace and observe each zoekt host #12516

search: trace and observe each zoekt host #12516

keegancsmith commented Jul 28, 2020

pecigonzalo commented Jul 28, 2020 •

edited

uwedeportivo commented Jul 28, 2020

pecigonzalo commented Jul 28, 2020

uwedeportivo commented Jul 28, 2020

search: trace and observe each zoekt host #12516

search: trace and observe each zoekt host #12516

Conversation

keegancsmith commented Jul 28, 2020

pecigonzalo commented Jul 28, 2020 • edited

uwedeportivo commented Jul 28, 2020

pecigonzalo commented Jul 28, 2020

uwedeportivo commented Jul 28, 2020

pecigonzalo commented Jul 28, 2020 •

edited