SelectMerge latency is high #689

kolesnikovae · 2023-05-12T11:51:10Z

Most of the queries spend significant time after all the samples were fetched and deduplicated. We need to figure out and fix what causes the latency.

For example, phlare-querier SelectMergeStacktraces:

Solving the problem may result in a very significant decrease in the overall query latencies (up to 50%).

It’s very likely that this is caused by pulling and merging resolved stacks from the ingesters: the size of the payload may be quite big. If this is the case, we may want to find a way to reduce it, e.g. by stack truncation:

We may want to optimize the representation of results in this API: notice that stacks take appx. 50MB (encoded in protobuf). We should consider building a truncated tree (w/o insignificant nodes) as close to the storage as possible, instead of passing an array of stack traces along the way. That would also decrease CPU time of the query execution, and allocations as well:

kolesnikovae · 2023-05-12T12:25:19Z

Also related: grafana/pyroscope#2107

cyriltovena · 2023-05-15T14:43:52Z

Based on our discussions I think this is a great idea to use an opaque format for the internal API. We should use the Pyroscope tree package https://github.com/grafana/pyroscope/blob/254c5759900d3a2da6e6dfaf4ef2767c05cb45bb/pkg/storage/tree/tree.go between query-frontend - querier - ingester and store-gateway.

kolesnikovae · 2023-05-24T10:46:13Z

#702 reduces the query duration 2-3 times in certain cases:

As a bonus, resource consumption also decreased:

Closing the issue for now

kolesnikovae added kind/performance area/database labels May 12, 2023

kolesnikovae self-assigned this May 12, 2023

kolesnikovae removed the area/database label May 12, 2023

kolesnikovae mentioned this issue May 18, 2023

Use pyroscope tree representation in MergeProfilesStacktraces API #702

Merged

kolesnikovae closed this as completed May 24, 2023

kolesnikovae added the area/database label Jul 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SelectMerge latency is high #689

SelectMerge latency is high #689

kolesnikovae commented May 12, 2023 •

edited

kolesnikovae commented May 12, 2023

cyriltovena commented May 15, 2023

kolesnikovae commented May 24, 2023

SelectMerge latency is high #689

SelectMerge latency is high #689

Comments

kolesnikovae commented May 12, 2023 • edited

kolesnikovae commented May 12, 2023

cyriltovena commented May 15, 2023

kolesnikovae commented May 24, 2023

kolesnikovae commented May 12, 2023 •

edited