Loki: Cache extracted labels #75842

gtk-grafana · 2023-10-02T18:30:34Z

What is this feature?
Adds simple Map to cache the previous results of Loki data samples in the Loki language provider.

Why do we need this feature?
loki-data-samples queries are currently triggered directly from the Monaco completion callbacks, and not part of the react/UI application layer, so certain keystrokes in certain contexts will trigger another query. When this query takes a long time to complete, the editor UX is very sluggish and difficult to work with. Since the values returned by this query are extracted label values, which are not expected to change second-to-second, the solution proposed here is to cache requests and serve "stale" labels to prevent needing to wait for an api request whenever the user presses spacebar or comma inside a stream selector.

TL;DR;
To prevent duplicate API calls getting labels in autocomplete, we cache 2 unique query strings and their associated labels.

Who is this feature for?
Users of Loki code editor.

Which issue(s) does this PR fix?:
Fixes #75512

Special notes for your reviewer:
This is the lightest implementation I could imagine in terms of CPU/memory consumption, worried about cases where this list of labels is quite long.

We could use LRU for the added feature of smarter cache purging (instead of purging the first inserted, we'd be purging the least recently used), but that's a lot of additional overhead, and the API between map and LRU are very similar, so swapping it out would be very easy.

Please check that:

It works as expected from a user's perspective.
If this is a pre-GA feature, it is behind a feature toggle.
The docs are updated, and if this is a notable improvement, it's added to our What's New doc.

github-actions · 2023-10-02T18:42:59Z

Backend code coverage report for PR #75842
No changes

github-actions · 2023-10-02T18:43:01Z

Frontend code coverage report for PR #75842

Plugin	Main	PR	Difference
loki	86.2%	86.24%	.04%

grafana-delivery-bot · 2023-10-02T19:07:38Z

Hello @gtk-grafana!
Backport pull requests need to be either:

Pull requests which address bugs,
Urgent fixes which need product approval, in order to get merged,
Docs changes.

Please, if the current pull request addresses a bug fix, label it with the type/bug label.
If it already has the product approval, please add the product-approved label. For docs changes, please add the type/docs label.
If the pull request modifies CI behaviour, please add the type/ci label.
If none of the above applies, please consider removing the backport label and target the next major/minor release.
Thanks!

...urce/loki/components/monaco-query-field/monaco-completion-provider/CompletionDataProvider.ts

ivanahuckova

Left a question bellow, but looks good. Great change!

ivanahuckova · 2023-10-03T09:17:05Z

public/app/plugins/datasource/loki/types.ts

@@ -84,4 +84,12 @@ export interface ContextFilter {
  description?: string;
 }

+export interface ExtractedLabelKeys {


Yay for adding this 🤩!

ivanahuckova · 2023-10-03T09:21:29Z

...urce/loki/components/monaco-query-field/monaco-completion-provider/CompletionDataProvider.ts

@@ -16,7 +16,10 @@ export class CompletionDataProvider {
  constructor(
    private languageProvider: LanguageProvider,
    private historyRef: HistoryRef = { current: [] }
-  ) {}
+  ) {
+    this.queryToLabelKeysCache = new Map();


We could use LRU for the added feature of smarter cache purging (instead of purging the first inserted, we'd be purging the least recently used), but that's a lot of additional overhead

I am curious about what do you mean by but that's a lot of additional overhead, cause we already use LRUCache in LokiLanguageProvider and I personally like that it handles all logic for purging for us, and we just need to set and get. Which makes code much cleaner and it is more consistent with Loki codebase.

We do already use the LRU cache, but in #75760 I'm proposing to replace it with the browser cache, so I didn't want to go all-in on LRU in case we want to get away from it.

If we're ok using a bit more memory we could certainly use LRU and/or increase the size of the cache, for this MVP I was going for the fastest/lightest possible implementation.

Swapping out for LRU should be trivial as it has the same interface as Map, we'll just need to delete the bit where we manually delete the oldest element and swap out the Map initializer for LRUCache

I guess the question is how many items do we want to cache, and how long do we want to cache them, and how bad are stale records being served throughout the session?

I guess the question becomes: is the goal of this cache to prevent duplicate requests, or to cache every possible request in a session?
If we want to cache "everything" or as much as possible, we should use LRU, or beef up the size of the map.
If we're worried about stale results we should range snap and check the date ranges when checking the cache (although in the getParserAndLabelKeys we don't have the time range in scope), or stick to a small cache size so results aren't likely to stick around for a long time.

LRU is ~49K, which IMO is a tad overkill for a map interface that keeps track of how many times each element is hit.

ivanahuckova

LGTM! Nice change 👍

* add simple cache to extracted label values in completion provider (cherry picked from commit 5b63cdb)

Loki: Cache extracted labels (#75842) * add simple cache to extracted label values in completion provider (cherry picked from commit 5b63cdb) Co-authored-by: Galen Kistler <109082771+gtk-grafana@users.noreply.github.com>

* add simple cache to extracted label values in completion provider

add simple cache to extracted label values in completion provider

e509a69

grafana-delivery-bot bot added this to the 10.2.x milestone Oct 2, 2023

grafana-pr-automation bot added datasource/Loki area/frontend labels Oct 2, 2023

clean up docs

d82202d

update tests, fix cache size

69b8ccf

gtk-grafana added backport v10.1.x add to changelog labels Oct 2, 2023

grafana-delivery-bot bot added the missing-labels label Oct 2, 2023

gtk-grafana added type/bug and removed missing-labels labels Oct 2, 2023

gtk-grafana self-assigned this Oct 2, 2023

rewrite type assertion and update docs

d91a82d

gtk-grafana commented Oct 2, 2023

View reviewed changes

...urce/loki/components/monaco-query-field/monaco-completion-provider/CompletionDataProvider.ts Outdated Show resolved Hide resolved

gtk-grafana marked this pull request as ready for review October 2, 2023 20:39

gtk-grafana requested a review from a team as a code owner October 2, 2023 20:39

ivanahuckova reviewed Oct 3, 2023

View reviewed changes

ivanahuckova approved these changes Oct 3, 2023

View reviewed changes

gtk-grafana merged commit 5b63cdb into main Oct 3, 2023
18 checks passed

gtk-grafana deleted the gtk-grafana/logs/issues/75512/extracted-label-keys-memoize branch October 3, 2023 15:37

grafana-delivery-bot bot mentioned this pull request Oct 3, 2023

[v10.1.x] Loki: Cache extracted labels #75905

Merged

3 tasks

grafana-delivery-bot bot pushed a commit that referenced this pull request Oct 3, 2023

Loki: Cache extracted labels (#75842)

b6b5dff

* add simple cache to extracted label values in completion provider (cherry picked from commit 5b63cdb)

mildwonkey pushed a commit that referenced this pull request Oct 4, 2023

Loki: Cache extracted labels (#75842)

1edbfe9

* add simple cache to extracted label values in completion provider

zerok modified the milestones: 10.2.x, 10.2.0 Oct 23, 2023

dnhn mentioned this pull request Oct 24, 2023

grafana 10.2.0 Homebrew/homebrew-core#152264

Closed

BrewTestBot mentioned this pull request Oct 25, 2023

grafana 10.2.0 Homebrew/homebrew-core#152321

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loki: Cache extracted labels #75842

Loki: Cache extracted labels #75842

gtk-grafana commented Oct 2, 2023 •

edited

github-actions bot commented Oct 2, 2023

github-actions bot commented Oct 2, 2023 •

edited

grafana-delivery-bot bot commented Oct 2, 2023

ivanahuckova left a comment

ivanahuckova Oct 3, 2023

ivanahuckova Oct 3, 2023 •

edited

gtk-grafana Oct 3, 2023

gtk-grafana Oct 3, 2023

ivanahuckova left a comment

Loki: Cache extracted labels #75842

Loki: Cache extracted labels #75842

Conversation

gtk-grafana commented Oct 2, 2023 • edited

github-actions bot commented Oct 2, 2023

github-actions bot commented Oct 2, 2023 • edited

grafana-delivery-bot bot commented Oct 2, 2023

ivanahuckova left a comment

Choose a reason for hiding this comment

ivanahuckova Oct 3, 2023

Choose a reason for hiding this comment

ivanahuckova Oct 3, 2023 • edited

Choose a reason for hiding this comment

gtk-grafana Oct 3, 2023

Choose a reason for hiding this comment

gtk-grafana Oct 3, 2023

Choose a reason for hiding this comment

ivanahuckova left a comment

Choose a reason for hiding this comment

gtk-grafana commented Oct 2, 2023 •

edited

github-actions bot commented Oct 2, 2023 •

edited

ivanahuckova Oct 3, 2023 •

edited