Update from upstream repository #298

periklis · 2024-05-03T18:16:27Z

Refs:

LOG-5504

Signed-off-by: Michel Hollands <michel.hollands@gmail.com>

This code allows us to preprocess generic logs and replace highly variable dynamic data (timestamps, IPs, numbers, UUIDs, hex values, bytesizes and durations) with static placeholders for easier pattern extraction and more efficient and user-friendly matching by the Drain algorithm. Additionally, there is logic that splits generic log lines into discrete tokens that can be used with Drain for better results than just naively splitting the logs on every whitespace. The tokenization here handles quote counting and emits quoted strings as a part of the same token. On the other side, it also handles likely JSON logs without any white spaces in them better, by trying to split `{"key":value}` pairs (without actually parsing the JSON). All of this is done without using regular expressions and without actually parsing the log lines in any specific format. That's why it works very efficiently in terms of CPU usage and allocations, and should handle all log formats and unformatted logs equally well.

…mes (#12374)

…list (#12688)

…MRoleName for lambda-promtail CloudFormation template (#12728)

…age (#12740) Co-authored-by: J Stickler <julie.stickler@grafana.com>

Signed-off-by: Owen Diehl <ow.diehl@gmail.com>

Signed-off-by: Michel Hollands <michel.hollands@gmail.com>

Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

Followup to #12806 which exposes skipped pages more explicitly than as an error. * refactors skip logic for bloom pages that are too large * s/Seek/LoadOffset/ for LazyBloomIter * removes unused code

…12807) This PR aims for full de-duplication of chunks and series from filter requests from the index gateway to the bloom gateway. Whenever we merge/de-duplicate slices, the inputs need to be sorted. It appears that the Removals (chunks) from the v1.Output are not guaranteed to be sorted. When comparing ShortRefs, both From, Through, and Checksum need to be used. Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

Signed-off-by: thorker <th.kerber+github@gmail.com> Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

#12838) The bloom shipper uses metas to resolve available blocks. Metas are fetched from cache, and if not available from object storage. If fetching metas from cache fails, e.g. timeout, the request should not fail, but proceed as if no metas were available. Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

Co-authored-by: J Stickler <julie.stickler@grafana.com> Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

Signed-off-by: Callum Styan <callumstyan@gmail.com> Co-authored-by: J Stickler <julie.stickler@grafana.com>

We've seen a few cases where creating the ULID failed for unknown reasons, and the ID is not really used. It was only useful early on in the development for debugging. Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

There is a time window between between listing metas and fetching them from object storage which could lead to a race condition that the meta is not found in object storage, because it was deleted and superseded by a newer meta. This can happen when querying recent bloom data, that is still subject to updates, and results in an error like this: ``` rpc error: code = Unknown desc = failed to get meta file bloom/tsdb_index_19843/XXXX/metas/18fbdc8500000000-1921d15dffffffff-270affee.json: storage: object doesn't exist (Trace ID: 4fe28d32cfa3e3df9495c3a5d4a683fb) ``` Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

Signed-off-by: Michel Hollands <michel.hollands@gmail.com> Co-authored-by: J Stickler <julie.stickler@grafana.com>

…12815)

Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

Trying to remove some cruft from traces.

Adds some timing information to pre-existing spans to help better understand bloom read path latency responsibility

…and Promtail (#12741) From https://systemd.io/NETWORK_ONLINE/: **How do I make sure that my service starts after the network is really online?** That depends on your setup and the services you plan to run after it (see above). If you need to delay you service after network connectivity has been established, include ```systemd After=network-online.target Wants=network-online.target ``` in the `.service` file. This will delay boot until the network management software says the network is “up”. For details, see the next question. Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

… other) (#12868) Signed-off-by: Michel Hollands <michel.hollands@gmail.com> Co-authored-by: Vladyslav Diachenko <82767850+vlad-diachenko@users.noreply.github.com> Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com> Co-authored-by: Michel Hollands <michel.hollands@gmail.com>

Co-authored-by: J Stickler <julie.stickler@grafana.com>

Signed-off-by: Kaviraj <kavirajkanagaraj@gmail.com>

… boundaries (#12880)

openshift-ci · 2024-05-03T18:43:20Z

@periklis: all tests passed!

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

JoaoBraveCoding

/lgtm

openshift-ci · 2024-05-03T18:44:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: JoaoBraveCoding, periklis

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [periklis]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

MichelHollands and others added 30 commits April 26, 2024 13:05

fix: remove unused parameter causing lint error (#12801)

33e82ec

Signed-off-by: Michel Hollands <michel.hollands@gmail.com>

fix: add missing parentheses in meta monitoring dashboards (#12802)

151d0a5

Signed-off-by: Michel Hollands <michel.hollands@gmail.com>

fix(blooms): Reset error on LazyBloomIter.Seek (#12806)

76ba24e

fix(promtail): Handle docker logs when a log is split in multiple fra…

c0113db

…mes (#12374)

feat(blooms): limit bloom size during creation (#12796)

eac5622

fix(ksonnet): Do not generate rbac for consul if you are using member…

2d62fca

…list (#12688)

docs: Fix typo in structured-metadata.md (#12818)

6e1680b

fix: loki-operational.libsonnet (#12789)

51a841f

feat: parameterise the MaximumEventAgeInSeconds, LogGroupName, and IA…

8892dc8

…MRoleName for lambda-promtail CloudFormation template (#12728)

docs: Add info about step param for Patterns API (#12803)

74db5dd

docs: hint on line and timestamp functions in docs for line_format st…

c3a3bc3

…age (#12740) Co-authored-by: J Stickler <julie.stickler@grafana.com>

feat(blooms): compute chunks once (#12664)

bc78d13

Signed-off-by: Owen Diehl <ow.diehl@gmail.com>

ci: Add lokitool to the dist target (#12830)

c9b6604

Signed-off-by: Michel Hollands <michel.hollands@gmail.com>

feat(helm): Allow extraObject items as multiline strings (#12397)

af5be90

Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

fix(blooms): dont break iterator conventions (#12808)

1665e85

Followup to #12806 which exposes skipped pages more explicitly than as an error. * refactors skip logic for bloom pages that are too large * s/Seek/LoadOffset/ for LazyBloomIter * removes unused code

fix: Fixes read & backend replicas settings (#12828)

d751134

Signed-off-by: thorker <th.kerber+github@gmail.com> Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

fix: Add missing Helm helper loki.hpa.apiVersion (#12755)

3070ea7

Co-authored-by: J Stickler <julie.stickler@grafana.com> Co-authored-by: Michel Hollands <42814411+MichelHollands@users.noreply.github.com>

chore: Add notes about promtail being feature complete (#12827)

5900417

Signed-off-by: Callum Styan <callumstyan@gmail.com> Co-authored-by: J Stickler <julie.stickler@grafana.com>

fix: Fix compactor matcher in the loki-deletion dashboard (#12790)

a03846b

feat(blooms): ingester aware bounded impl (#12840)

7bbd8b5

chore(blooms): Remove ID field from task struct (#12851)

48bbf98

We've seen a few cases where creating the ULID failed for unknown reasons, and the ID is not really used. It was only useful early on in the development for debugging. Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

docs: Update template_functions.md (#12841)

ed84b23

docs: update the lokitool docs (#12805)

599a300

Signed-off-by: Michel Hollands <michel.hollands@gmail.com> Co-authored-by: J Stickler <julie.stickler@grafana.com>

fix: Ensure Drain patterns are valid for LogQL pattern match filter (#…

fd2301f

…12815)

docs: Update docker installation topic (#12770)

afbeedc

feat(blooms): Add in-memory LRU cache for meta files (#12862)

fcd544c

Signed-off-by: Christian Haudum <christian.haudum@gmail.com>

owen-d and others added 14 commits May 2, 2024 15:52

feat(blooms): ignore individual bloom-gw failures (#12863)

4c9b22f

chore: reduces span footprint + double recording (#12864)

5a643c7

Trying to remove some cruft from traces.

chore(blooms): additional spans for bloom read path (#12866)

8b34751

Adds some timing information to pre-existing spans to help better understand bloom read path latency responsibility

ci: make renovate commits come in as fixes (#12867)

b05172b

docs(helm): Improve the helm's NOTES.txt (#12744)

74b28ad

Co-authored-by: J Stickler <julie.stickler@grafana.com>

chore: Add dashboards for Bloom Compactor and Gateway (#12855)

079ba64

docs: Consistent quoting in Template functions docs (#12833)

11e02cc

Co-authored-by: J Stickler <julie.stickler@grafana.com>

feat(detectedFields): add parser to response (#12872)

2b3ae48

docs: Update logcli command reference (#12850)

e684ec8

Co-authored-by: J Stickler <julie.stickler@grafana.com>

fix: codec not initialized in downstream roundtripper (#12873)

b6049f6

Signed-off-by: Kaviraj <kavirajkanagaraj@gmail.com>

chore(instrumentation): reintroduce span propagation across scheduler…

c1d56c1

… boundaries (#12880)

chore(operator): Update Loki operand to v2.9.8 (#12874)

33a677b

periklis self-assigned this May 3, 2024

openshift-ci bot requested review from jcantrill and JoaoBraveCoding May 3, 2024 18:17

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 3, 2024

JoaoBraveCoding approved these changes May 3, 2024

View reviewed changes

openshift-ci bot assigned JoaoBraveCoding May 3, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label May 3, 2024

openshift-merge-bot bot merged commit 945bf0c into openshift:main May 3, 2024
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update from upstream repository #298

Update from upstream repository #298

periklis commented May 3, 2024 •

edited by openshift-ci bot

openshift-ci bot commented May 3, 2024

JoaoBraveCoding left a comment

openshift-ci bot commented May 3, 2024

Update from upstream repository #298

Update from upstream repository #298

Conversation

periklis commented May 3, 2024 • edited by openshift-ci bot

openshift-ci bot commented May 3, 2024

JoaoBraveCoding left a comment

Choose a reason for hiding this comment

openshift-ci bot commented May 3, 2024

periklis commented May 3, 2024 •

edited by openshift-ci bot